Home Knowledge Base MLLM Merging

MLLM Merging

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

ES-Merging: Biological MLLM Merging via Embedding Space Signals

arXiv:2603.14405v2 Announce Type: replace Abstract: Biological multimodal large language models (MLLMs) have emerged as powerful foundation models for scientific discovery. However, existing models are specialized to a single modality, limiting their ability to solve inherently cross-modal scientific problems. While model merging is an efficient method to combine the different modalities into a unified MLLM, existing methods rely on input-agnostic parameter space heuristics that fail to...

arXiv CS 8d ago

EGOSTREAM: A Diagnostic Benchmark for Streaming Episodic Memory in Egocentric Vision

arXiv:2605.31557v1 Announce Type: new Abstract: Continuous episodic memory is a core capability for autonomous agents operating in dynamic, real-world environments, yet current streaming video benchmarks provide limited tools for diagnosing what models remember and for how long. We introduce \egostream, a diagnostic benchmark for streaming episodic memory evaluation in egocentric vision. \egostream organizes 2,250 curated questions along seven cognitive dimensions: detail, spatial, temporal,...

arXiv CS 9d ago

EGOSTREAM: A Diagnostic Benchmark for Streaming Episodic Memory in Egocentric Vision

arXiv:2605.31557v2 Announce Type: replace Abstract: Continuous episodic memory is a core capability for autonomous agents operating in dynamic, real-world environments, yet current streaming video benchmarks provide limited tools for diagnosing what models remember and for how long. We introduce Egostream, a diagnostic benchmark for streaming episodic memory evaluation in egocentric vision. \egostream organizes 2,250 curated questions along seven cognitive dimensions: detail, spatial,...

arXiv CS 8d ago