Home › Knowledge Base › PSM

PSM

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Emergent alignment and the projectability of ethical personas

arXiv:2606.09475v1 Announce Type: new Abstract: Work on `emergent misalignment' shows that finetuning LLMs on narrow tasks can induce broadly misaligned behavior. This supports the `persona selection' (PSM) hypothesis: during pre-training, LLMs learn to simulate different characters and perspectives, which can be elicited and refined during post-training.

arXiv CS 1d ago

An efficient Progressive Swapping to the Middle distribution protocol adapted to imperfect quantum memories in quantum networks

arXiv:2605.31493v1 Announce Type: cross Abstract: The distribution of entangled pairs of photons on the links composing a quantum network, combined with Bell state measurements and teleportation, is the basic apparatus to transfer quantum bits (qubits) over long distances. Entanglement distribution establishes an end-to-end entangled pair while consuming intermediate pairs on links and holding them for a certain time period. The technical literature identifies two main kinds of protocols,...

arXiv CS 9d ago

Former WA Police officer 'gutted' after compensation for horrific injury denied

Former WA Australian of the Year Paul Litherland sustained severe physical and psychological injuries when he was hit by a car while serving in the WA Police in 2004. He says he was told that when he left the force, he would be eligible for post-service medical compensation for his ongoing medical costs, but his claim was recently denied by the state government's insurer. Police Minister Reece Whitby says he has asked the insurer to look into Mr Litherland's case, describing him as a...

ABC Australia 1d ago

Predictive Style Matching: Natural and Robust Humanoid Locomotion

arXiv:2606.07083v1 Announce Type: new Abstract: Reinforcement learning has become the prevailing approach to humanoid locomotion control: policies transfer reliably from simulation to hardware and recover gracefully from disturbances. Motion quality, however, still lags behind: task-only rewards often converge to stiff, asymmetric gaits, while motion imitation methods improve appearance but become more sensitive to external disturbances because reference signals can oppose the transient...

arXiv CS 2d ago

Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair

Announce Type: new Abstract: Autoregressive chain-of-thought (CoT) reasoning in large language models (LLMs) is fundamentally forward-directed: each step conditions only on prior tokens. This unidirectional inductive bias renders even capable models susceptible to error snowballing, wherein a single logical or arithmetic mistake in an early step irreversibly corrupts the entire reasoning chain. We introduce Teleological Reasoning Infilling (\TRI{}), a training framework that endows...

arXiv CS 6d ago