Pyramid Transformer
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
Multi-view Pyramid Transformer: Look Coarser to See Broader
arXiv:2512.07806v2 Announce Type: replace Abstract: We propose Multi-view Pyramid Transformer (MVP), a scalable multi-view transformer architecture that directly reconstructs large 3D scenes from tens to hundreds of images in a single forward pass. Drawing on the idea of ``looking broader to see the whole, looking finer to see the details," MVP is built on two core design principles: 1) a local-to-global inter-view hierarchy that gradually broadens the model's perspective from local views to...
Contrastive Augmented Transformer with Domain-specific Enhancement for Robust Multi-scenario Metal Surface Defect Detection
arXiv:2606.01962v1 Announce Type: new Abstract: Metal surface defect detection is critical for maintaining product quality in industrial manufacturing. However, it faces significant challenges, including limited annotated data, difficulty in identifying subtle multi-scale defects, and poor generalization across diverse scenarios. To address these issues, this paper proposes a novel Contrastive Augmented Transformer (CAT) framework for robust defect detection.
Contrastive Augmented Transformer with Domain-specific Enhancement for Robust Multi-scenario Metal Surface Defect Detection
arXiv:2606.01962v2 Announce Type: replace Abstract: Metal surface defect detection is critical for maintaining product quality in industrial manufacturing. However, it faces significant challenges, including limited annotated data, difficulty in identifying subtle multi-scale defects, and poor generalization across diverse scenarios. To address these issues, this paper proposes a novel Contrastive Augmented Transformer (CAT) framework for robust defect detection.
SemDINO: A DINOv3-Driven Network for Cross-Temporal Semantic Alignment in Change Detection
arXiv:2606.09772v1 Announce Type: new Abstract: Semantic change detection (SCD) aims to simultaneously locate land-cover changes and identify semantic categories before and after transition. However, existing methods suffer from insufficient cross-temporal alignment, weak multi-scale representation, and poor robustness to pseudo-changes caused by illumination, season, and registration noise. To address these issues, we propose a novel end-to-end semantic change detection network named...
‘Like a Klingon prison’: inside Barack Obama’s audacious, near-windowless, $850m presidential library
Towering over a low-income area of Chicago, and wrapped in a speech that’s hard to decipher, this controversial monolith feels like a menacing sci-fi HQ. Is it a monument – or a mausoleum?The Egyptians had their pyramids. The Anglo-Saxons had their barrows.
‘Like a Klingon prison’: inside Barack Obama’s audacious, near-windowless, $850m presidential library
Towering over a low-income area of Chicago, and wrapped in a speech that’s hard to decipher, this controversial monolith feels like a menacing sci-fi HQ. Is it a monument – or a mausoleum?The Egyptians had their pyramids. The Anglo-Saxons had their barrows.
Raphael Lets Loose
Plenty of faces keep you company in the Metropolitan Museum of Art’s exhibition “Raphael: Sublime Poetry”—saints and sinners, popes and poets, ladies in posh frocks or nothing at all—but the most disarming is the first to greet you, that of a boy in a fun hat. With a long, straight nose; soft, bright eyes; and an uplifted chin, he carries the wary confidence of a teenage heartthrob. It isn’t just the face that makes you pause.
Great mysteries of archaeology: An ancient Amazonian world revealed from the sky
Great mysteries of archaeology: An ancient Amazonian world revealed from the sky Gaby Clark Scientific Editor Andrew Zinin Lead Editor From the air, you see it only through the constant jolt, tilt, and shudder of the low-flying Cessna aircraft. The landscape of the Llanos de Moxos, northern Bolivia, appears as a disconnected patchwork of open grassland savannahs, forest islands, and lakes. It feels random, almost unreadable.
When AI Builds Itself: Our progress toward recursive self-improvement
For most of AI’s history, humans drove every step in its development cycle. But at Anthropic, we are delegating a growing share of AI development to AI systems themselves, which is speeding up our work. Taken far enough, and given enough compute, that trend points to an AI system capable of fully autonomously designing and developing its own successor.
Folding Beijing
At ten of five in the morning, Lao Dao crossed the busy pedestrian lane on his way to find Peng Li. After the end of his shift at the waste processing station, Lao Dao had gone home, first to shower and then to change. He was wearing a white shirt and a pair of brown pants—the only decent clothes he owned.