The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

arXiv CS Monday 08 June 2026, 04:00 UTC By Rahul Nair, Chun Tao 1 min read

Key Points

arXiv:2606.06920v1 Announce Type: new Abstract: Deploying Small Language Models (SLMs) on edge devices requires efficient fine-tuning strategies that adapt models to new tasks without degrading their general capabilities. In this study, we benchmark five sub-1B models (135M-1B) on mathematical reasoning tasks and uncover a critical vulnerability: Full Fine-Tuning (Full FT) actively harms performance in models under 300M parameters, often dropping accuracy below zero-shot baselines. This "negative transfer" makes Parameter-Efficient Fine-Tuning (PEFT) not just an efficiency preference, but a stability requirement. We find that while Low-Rank Adaptation (LoRA) and Weight-Decomposed LoRA (DoRA) perform comparably, their strengths vary by task; DoRA excels in complex reasoning (GSM8K), while LoRA dominates pattern matching (OrcaMath). In particular, Full FT is outperformed by LoRA on aligned models (Qwen2.5-0.5B) and even by simple 5-shot In-Context Learning on the smallest architectures (SmolLM2-135M). Based on these findings, we recommend defaulting to PEFT for all aligned sub-1B models and caution against Full FT for any architecture smaller than 500M parameters to prevent catastrophic forgetting. Reproduction of this work can be found at https://github.com/gulguluu/tiny-slm-finetune-compare.

Mathematical Reasoning arXiv:2606.06920v1 Announce Type (ORG) DoRA (ORG) PEFT (ORG)

Originally published by arXiv CS Read original →

Nasa chief defends choice of all-male Artemis III crew Critics fear the agency is following Trump’s order to eliminate diversity and inclusion efforts despite its vow to put a woman on the moon Nasa’s administrator Jared Isaacman on Wednesday defended the make-up of the space agency’s latest Artemis crew, an all-male group. The nominations have earned criticism that Nasa may have acted in accordance with US President Donald Trump’s direction to eliminate diversity and inclusion efforts....

South China Morning Post 17m ago

The asteroid that wiped out the dinosaurs may have created a vast underground habitat for life that lasted 8 million years

The asteroid that wiped out the dinosaurs may have created a vast underground habitat for life that lasted 8 million years The Chicxulub impact may have actually helped nurture life while destroying it, too. The asteroid impact that doomed the dinosaurs may also have built one of Earth's longest-lasting underground ecosystems. When a roughly 6-mile-wide (10-kilometer-wide) asteroid slammed into what is now Mexico's Yucatán Peninsula 66 million years ago, it triggered a global catastrophe...

Space.com 19m ago

See the 'crawling,' ball-shaped robot that rolled around the moon during Japan's historic first landing

See the 'crawling,' ball-shaped robot that rolled around the moon during Japan's historic first landing A morphable moon robot operated for 100 minutes in 2024, allowing investigators to get images of an upside-down spacecraft on the lunar surface. When the Japanese Smart Lander for Investigating Moon (SLIM) spacecraft, nicknamed the "Moon Sniper," face-planted onto the lunar surface in 2024, an experimental rover told Earth scientists what happened. Rolling autonomously through the lunar...

Live Science 19m ago

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

Related Stories

'Worrying' pollution in Cotswolds river - volunteers

Nasa chief defends choice of all-male Artemis III crew

The asteroid that wiped out the dinosaurs may have created a vast underground habitat for life that lasted 8 million years

See the 'crawling,' ball-shaped robot that rolled around the moon during Japan's historic first landing