A Mixed Diet Makes DINO An Omnivorous Vision Encoder

arXiv CS Tuesday 09 June 2026, 04:00 UTC By Rishabh Kabra, Maks Ovsjanikov, Drew A. Hudson, Ye Xia, Skanda Koppula, Andre Araujo, Joao Carreira, Niloy J. Mitra 1 min read

Key Points

arXiv:2602.24181v2 Announce Type: replace Abstract: Pre-trained vision encoders like DINOv2 have demonstrated exceptional performance on unimodal tasks. However, we observe that their features are poorly aligned across different visual modalities. For instance, the feature embedding for an RGB image and its corresponding depth map of the same scene exhibit a cosine similarity that is nearly identical to that of two random, unrelated images. To address this, we propose the Omnivorous Vision Encoder, a post-training framework that learns a modality-agnostic feature space. We fine-tune the encoder with a dual objective: first, to maximize the feature alignment between different modalities of the same scene; and second, a distillation objective that anchors the learned representations to a fully frozen teacher. The resulting student encoder becomes "omnivorous" by producing more consistent embeddings for a given scene, regardless of the input modality (RGB, Depth, Segmentation, etc.). This approach enables robust cross-modal understanding while retaining the discriminative semantics of the original foundation model. Omnivorous model weights are available at https://github.com/google-deepmind/representations4d.

RGB (ORG) Depth, Segmentation (ORG)

Originally published by arXiv CS Read original →

Jordan Pickford 'should be taking penalties' in World Cup shoot-outs because of his 'tremendous' left foot Jordan Pickford's old school PE teacher says he has a "tremendous" left foot, and England's No 1 should be taking penalties, as well as saving them, in the World Cup England's No 1, Jordan Pickford, should be taking penalties in any World Cup shoot-outs - not just saving them, it's been claimed. His former PE teacher told how a teenage Pickford played in midfield for many years at...

Daily Mirror 1h ago

Dodgers catcher Dalton Rushing executes a slide so illegal it would've made the 1980s proud

Los Angeles Dodgers' catcher Dalton Rushing reminded baseball fans of a simpler time with a perfectly executed slide Tuesday night against the Pirates. Unfortunately, it was only "perfectly executed" from about 1930 through 2016. Nowadays, what Rushing did in the fifth inning of Tuesday's eventual win over Pittsburgh is deemed illegal.

Fox News 2h ago

Silver Lake’s Lucas Goes Back to School For Lessons in AI

Silver Lake’s Lucas Goes Back to School For Lessons in AI Christian Lucas, one of the top executives at specialist technology investor Silver Lake, is back in the classroom as he and his firm seek an edge in age of artificial intelligence. The managing partner said during a panel discussion at the SuperReturn conference in Berlin that Silver Lake has an in-house team of educators on hand to bring its dealmakers up to speed on the rapidly-developing technology.

Bloomberg Technology 2h ago

Uptick in children and teenagers enjoying reading for first time in 5 years

More than one in three now say they like picking up books in their spare time, according to a literacy charity.

BBC Education 2h ago

A Mixed Diet Makes DINO An Omnivorous Vision Encoder

Related Stories

Jordan Pickford 'should be taking penalties' in World Cup shoot-outs because of his 'tremendous' left foot

Dodgers catcher Dalton Rushing executes a slide so illegal it would've made the 1980s proud

Silver Lake’s Lucas Goes Back to School For Lessons in AI

Uptick in children and teenagers enjoying reading for first time in 5 years