Home Knowledge Base Auditing Proprietary Alignment in Large Language Models:

Auditing Proprietary Alignment in Large Language Models:

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Auditing Proprietary Alignment in Large Language Models: A Comparative Framework Without a Ground-Truth Standard

arXiv:2606.08381v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly released and deployed through opaque development and deployment pipelines, enabling model providers to inject intentional, provider-specific policies without officially announcing them. As a result, various models have been reported to generate responses reflecting proprietary rules and organizational interests, leading to censorship or misinformation on controversial topics. However, systematic...

arXiv CS 1d ago

Assessing and Mitigating Miscalibration in LLM-Based Social Science Measurement

arXiv:2605.11954v2 Announce Type: replace Abstract: Large language models (LLMs) are increasingly used in social science as scalable measurement tools for converting unstructured text into variables that can enter standard empirical designs. Measurement validity demands more than high average accuracy, which requires well calibrated confidence that faithfully reflects the empirical probability of each measurement being correct. This paper studies the model miscalibration in LLM-based social...

arXiv CS 7d ago