MOSAIC: Efficient Mixture-of-Agent Scheduling via Adaptive Aggregation and Inference Concurrency

arXiv CS Wednesday 03 June 2026, 04:00 UTC By Saptarshi Mitra, Yifan Zhang, Rachid Karami, Phyo Pyae Moe Aung, Nazmul Takbir, Sreetama Sarkar, Souvik Kundu, Sitao Huang 1 min read

Key Points

arXiv:2606.03014v1 Announce Type: new Abstract: Mixture-of-Agents (MoA) systems improve reasoning accuracy by routing each query to multiple expert LLMs and aggregating their outputs. Efficiently executing this workload on limited GPU resources has bottlenecks. Skill-based routing creates skewed expert demand, and combining instruction-tuned LLMs with long-reasoning models results in extreme variability in generation lengths. Consequently, traditional scheduling strategies suffer from significant GPU idling and throughput collapse due to load imbalances. We present MOSAIC, a scheduling framework to accelerate MoA workloads. First, we formulate an Integer Linear Program (ILP) based scheduler that jointly optimizes expert placement and per-worker prompt assignment from offline-profiled costs, replicating reasoning experts across workers while pinning lightweight ones. Second, MOSAIC uses confidence-aware adaptive aggregation, leveraging inter-expert agreement to bypass the heavy final aggregator LLM for consensus queries. In our 4-GPU system, MOSAIC achieves up to 2.5x expert-stage, 4.23x aggregator-stage and 1.7~2.3x end-to-end speedups over the baseline scheduler, while matching accuracy within 0.1pp.

MOSAIC (ORG) Adaptive Aggregation and Inference Concurrency (ORG) GPU (ORG) MoA (ORG) ILP (ORG) LLM (ORG)

Originally published by arXiv CS Read original →

Popular UK seaside town hotel plunges into administration as holidaymakers updated This popular hotel has entered administration after closing for refurbishment in 2022 A long-shuttered seaside hotel in south Devon, which had been expected to welcome guests again following a major refurbishment, has reportedly gone into administration. According to a notice published by The Gazette, the UK's official public record, administrators were appointed on June 5.

Daily Mirror 11m ago

Scientists were excited about a blood test for many cancers — but it failed a big trial. Here's what to know.

Scientists were excited about a blood test for many cancers — but it failed a big trial. Emerging tests promise to screen for many cancers at once, but one just failed in a big trial. Will these diagnostics deliver on their promise someday?

Live Science 27m ago

After NSIL’s PPP bid, IN-SPACe opens LVM-3 to private sector with ToT push

In a renewed push to hand over Isro’s LVM-3 launch vehicle to private industry, space regulator-cum-promoter Indian National Space Promotion and Authorisation Centre (IN-SPACe) has invited expressions of interest (EoI) for the transfer of technology (ToT) of the country’s heaviest operational rocket. The move comes more than two years after Space PSU NewSpace India Limited (NSIL) had sought private partners to scale up production of the launch vehicle through a public-private partnership...

Times of India 32m ago

NASA chief defends all-male Artemis 3 astronaut crew amid backlash: 'I don't think anyone should be reading into this'

NASA chief defends all-male Artemis 3 astronaut crew amid backlash: 'I don't think anyone should be reading into this' "Our last astronaut candidate class was greater than 50% female. We'll assemble the best astronauts to undertake and complete the objectives." The four astronauts comprising the Artemis 3 crew announced this week are all male, but NASA officials emphasized they were selected based on qualifications and not to exclude any genders.

Space.com 32m ago

MOSAIC: Efficient Mixture-of-Agent Scheduling via Adaptive Aggregation and Inference Concurrency

Related Stories

Popular UK seaside town hotel plunges into administration as holidaymakers updated

Scientists were excited about a blood test for many cancers — but it failed a big trial. Here's what to know.

After NSIL’s PPP bid, IN-SPACe opens LVM-3 to private sector with ToT push

NASA chief defends all-male Artemis 3 astronaut crew amid backlash: 'I don't think anyone should be reading into this'