Home Knowledge Base Kahneman

Kahneman

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Social networks outsmart cognitive biases: How herding in networks makes populations more rational

Social networks outsmart cognitive biases: How herding in networks makes populations more rational Stephanie Baum Scientific Editor Andrew Zinin Lead Editor In 2010, the New York City-based restaurant Serendipity 3 revealed its $69 hot dog, winning the Guinness World Record for the world's most expensive hot dog. Served on a toasted pretzel roll with truffle butter and covered in foie gras, the award-winning hot dog made the restaurant's $18 cheeseburger seem like a steal. That's the point,...

Phys.org 6d ago

MDP-GRPO: Stabilized Group Relative Policy Optimization for Multi-Constraint Instruction Following

arXiv:2606.06058v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards is ideal for multi-constraint instruction following, yet standard group-relative policy optimization (GRPO) becomes unstable under discrete, low-dispersion rewards, where within-group reward distributions are frequently homogeneous. We identify and formalize three pathologies of z-score group normalization in this regime: low-variance amplification, mean-centering blindness, and zero-variance...

arXiv CS 5d ago

ToolRec: Calibrated Preference Alignment for Query Recommendation in On-Device Assistants

arXiv:2606.08466v1 Announce Type: new Abstract: Large Language Models (LLMs) have significantly advanced generative query recommendation. However, existing alignment methods primarily focus on standard chatbot scenarios, falling short in on-device intelligent assistants where users predominantly expect the rapid invocation of system-level tools. Moreover, directly aligning LLMs with real-world click logs introduces severe noise due to varying user activity levels and the failure to emphasize...

arXiv CS 1d ago