Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

arXiv CS Friday 05 June 2026, 04:00 UTC By Subramanyam Sahoo, Vinija Jain, Aman Chadha, Divya Chaudhary 1 min read

Key Points

Announce Type: replace Abstract: Linear probing of large language model (LLM) hidden states is widely used to claim that models learn distinct representations for different reasoning types. We test this by probing Qwen3-14B on three benchmarks spanning the classical trichotomy: LogiQA 2.0 (deductive), ARC-Challenge (inductive), and $\alpha$NLI (abductive). At layer 32 of 40, linear probes achieve 100\% cross-validated accuracy with well-separated geometry (intrinsic dimensionalities: 20.6,...

arXiv:2606.02907v2 Announce Type: replace Abstract: Linear probing of large language model (LLM) hidden states is widely used to claim that models learn distinct representations for different reasoning types. We test this by probing Qwen3-14B on three benchmarks spanning the classical trichotomy: LogiQA 2.0 (deductive), ARC-Challenge (inductive), and $\alpha$NLI (abductive). At layer 32 of 40, linear probes achieve 100\% cross-validated accuracy with well-separated geometry (intrinsic dimensionalities: 20.6, 28.5, 33.6; convex hull contamination $\leq$1.5\%). However, this separation is entirely driven by format confounds. Residualizing source identity, option count, and response length reduces accuracy to chance. Trace-anchor similarity indicates largely shared reasoning across tasks (42.5\% agreement vs.\ 33.3\% chance), and causal steering with random controls ($n=20$) shows no functional link between geometry and reasoning mode ($p=0.286$). Thus, high probe accuracy reflects task format rather than computational structure, motivating routine format deconfounding in mechanistic interpretability.

Linear (ORG) LLM (ORG) Qwen3-14B (PERSON) ARC-Challenge (ORG)

Originally published by arXiv CS Read original →

Starlink rival Qianfan hits satellite milestone, but is it too slow and costly? Constellation now has 201 satellites in orbit but the company is said to be under pressure to ramp up launches The constellation now has 201 satellites after a successful launch on board a Zhuque-2E rocket from the Gobi Desert at 4.23pm Beijing time on Tuesday. The mission delivered Qianfan DTC-01 – a direct-to-cell test satellite – alongside a satellite from China Mobile, state broadcaster CCTV reported.

South China Morning Post 55m ago

Violent Anti-Immigration Protests Erupt Across Northern Ireland

Here Are the Best Ways to Clean Stains and Save Your Money 04:47 Serena Williams Wins After 4 Years Away From Competition 00:25 Pope Leo XIV to Hold Mass at Spain’s Iconic Basilica 02:34 Now Playing Violent Anti-Immigration Protests Erupt Across Northern Ireland 00:26 UP NEXT Who Are the Nuns Praying for the San Antonio Spurs at Games? 01:12

NBC News 1h ago

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’ Wall Street has spent months debating how much SpaceX is worth. Behind the scenes, a different challenge has occupied the institutions responsible for bringing it public: preparing the plumbing systems needed to support what could become the largest IPO in history. S&P Global Inc.’s Equity Bookbuild group, which helps underwriters capture and allocate investor demand during initial public offerings, has spent weeks expanding the...

Bloomberg Markets 1h ago

NASA names crew for Artemis III lunar lander rehearsal

NASA has named the four astronauts set to fly the Artemis III mission in an announcement that raised as many questions as it answered. The quartet is comprised of a Space Shuttle veteran, Randy Bresnik, as commander, and the European Space Agency's Luca Parmitano, whose helmet filled with water during an International Space Station (ISS) spacewalk. NASA astronauts Frank Rubio and Andre Douglas will serve as mission specialists.

The Register 2h ago

Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

Related Stories

Starlink rival Qianfan hits satellite milestone, but is it too slow and costly?

Violent Anti-Immigration Protests Erupt Across Northern Ireland

Wall Street Braces for SpaceX With Stress Test, ‘Watch Parties’

NASA names crew for Artemis III lunar lander rehearsal