Home Knowledge Base Pyramid Transformer

Pyramid Transformer

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Multi-view Pyramid Transformer: Look Coarser to See Broader

arXiv:2512.07806v2 Announce Type: replace Abstract: We propose Multi-view Pyramid Transformer (MVP), a scalable multi-view transformer architecture that directly reconstructs large 3D scenes from tens to hundreds of images in a single forward pass. Drawing on the idea of ``looking broader to see the whole, looking finer to see the details," MVP is built on two core design principles: 1) a local-to-global inter-view hierarchy that gradually broadens the model's perspective from local views to...

arXiv CS 8d ago

Contrastive Augmented Transformer with Domain-specific Enhancement for Robust Multi-scenario Metal Surface Defect Detection

arXiv:2606.01962v1 Announce Type: new Abstract: Metal surface defect detection is critical for maintaining product quality in industrial manufacturing. However, it faces significant challenges, including limited annotated data, difficulty in identifying subtle multi-scale defects, and poor generalization across diverse scenarios. To address these issues, this paper proposes a novel Contrastive Augmented Transformer (CAT) framework for robust defect detection.

arXiv CS 8d ago

Contrastive Augmented Transformer with Domain-specific Enhancement for Robust Multi-scenario Metal Surface Defect Detection

arXiv:2606.01962v2 Announce Type: replace Abstract: Metal surface defect detection is critical for maintaining product quality in industrial manufacturing. However, it faces significant challenges, including limited annotated data, difficulty in identifying subtle multi-scale defects, and poor generalization across diverse scenarios. To address these issues, this paper proposes a novel Contrastive Augmented Transformer (CAT) framework for robust defect detection.

arXiv CS 7d ago

SemDINO: A DINOv3-Driven Network for Cross-Temporal Semantic Alignment in Change Detection

arXiv:2606.09772v1 Announce Type: new Abstract: Semantic change detection (SCD) aims to simultaneously locate land-cover changes and identify semantic categories before and after transition. However, existing methods suffer from insufficient cross-temporal alignment, weak multi-scale representation, and poor robustness to pseudo-changes caused by illumination, season, and registration noise. To address these issues, we propose a novel end-to-end semantic change detection network named...

arXiv CS 1d ago

‘Like a Klingon prison’: inside Barack Obama’s audacious, near-windowless, $850m presidential library

Towering over a low-income area of Chicago, and wrapped in a speech that’s hard to decipher, this controversial monolith feels like a menacing sci-fi HQ. Is it a monument – or a mausoleum?The Egyptians had their pyramids. The Anglo-Saxons had their barrows.

The Guardian Culture 8d ago

‘Like a Klingon prison’: inside Barack Obama’s audacious, near-windowless, $850m presidential library

Towering over a low-income area of Chicago, and wrapped in a speech that’s hard to decipher, this controversial monolith feels like a menacing sci-fi HQ. Is it a monument – or a mausoleum?The Egyptians had their pyramids. The Anglo-Saxons had their barrows.

The Guardian UK 8d ago

Raphael Lets Loose

Plenty of faces keep you company in the Metropolitan Museum of Art’s exhibition “Raphael: Sublime Poetry”—saints and sinners, popes and poets, ladies in posh frocks or nothing at all—but the most disarming is the first to greet you, that of a boy in a fun hat. With a long, straight nose; soft, bright eyes; and an uplifted chin, he carries the wary confidence of a teenage heartthrob. It isn’t just the face that makes you pause.

The Atlantic 8d ago

Great mysteries of archaeology: An ancient Amazonian world revealed from the sky

Great mysteries of archaeology: An ancient Amazonian world revealed from the sky Gaby Clark Scientific Editor Andrew Zinin Lead Editor From the air, you see it only through the constant jolt, tilt, and shudder of the low-flying Cessna aircraft. The landscape of the Llanos de Moxos, northern Bolivia, appears as a disconnected patchwork of open grassland savannahs, forest islands, and lakes. It feels random, almost unreadable.

Phys.org 1d ago

When AI Builds Itself: Our progress toward recursive self-improvement

For most of AI’s history, humans drove every step in its development cycle. But at Anthropic, we are delegating a growing share of AI development to AI systems themselves, which is speeding up our work. Taken far enough, and given enough compute, that trend points to an AI system capable of fully autonomously designing and developing its own successor.

Hacker News 6d ago

Folding Beijing

At ten of five in the morning, Lao Dao crossed the busy pedestrian lane on his way to find Peng Li. After the end of his shift at the waste processing station, Lao Dao had gone home, first to shower and then to change. He was wearing a white shirt and a pair of brown pants—the only decent clothes he owned.

Hacker News 10d ago