Home Knowledge Base S$^3$E (Structured Semantic Stress Evaluation

S$^3$E (Structured Semantic Stress Evaluation

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

When Correct Decisions Hide Internal Stress: Decision-State Probing in Multimodal Language Models

Announce Type: new Abstract: Multimodal language models are typically evaluated through external behavior: selecting the correct image--text match, rejecting unsupported captions, or answering visual queries correctly. However, correct behavior alone does not show that the model's internal decision state remains stable under controlled semantic stress. We study this gap through S$^3$E (Structured Semantic Stress Evaluation), a framework for analyzing behavior-internal decoupling in...

arXiv CS 1d ago