Home Knowledge Base the `Constitutional AI'

the `Constitutional AI'

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Glass Box at Orbit: A Constitutional AI Verification Framework for Trustworthy Autonomous CubeSat Intelligence

arXiv:2606.02967v1 Announce Type: new Abstract: The space industry is quietly building toward something nobody has fully reckoned with: orbital data centers running thousands of autonomous AI workloads with no human in the loop, 550 km above the Earth. Microsoft, AWS, and a growing list of orbital computing ventures are moving cloud-scale processing off the ground and into orbit. What none of them have answered yet is the governance question -- when autonomous AI systems at orbital data...

arXiv CS 7d ago

The Human-AI Delegation-Verification Dilemma: Individual Strategies, Collective Equilibria and Sociotechnical Lock-in

Announce Type: replace Abstract: This paper takes an ecological approach toward large-scale models of hybrid human-AI intelligence. Emerging models of human-AI interaction predominantly advance the complementarity thesis variously dubbed human-AI collaboration and human-AI hybrid intelligence. However, this constitutes an over-simplification of the modalities of human-AI interaction and possibility-space for both individual and collective action that human-AI interaction potentiates.

arXiv CS 2d ago

'Your AI Text is not Mine': Redefining and Evaluating AI-generated Text Detection under Realistic Assumptions

arXiv:2606.04906v1 Announce Type: new Abstract: Although it is generally agreed that AI-generated text poses a broad societal risk, there is no common understanding in the AI-generated text detection literature on what constitutes harmful use. Rather, existing datasets and approaches often define their own criteria and make their own assumptions, sometimes implicitly, and often only loosely related to real-world needs and applications.

arXiv CS 6d ago

Emergent alignment and the projectability of ethical personas

arXiv:2606.09475v1 Announce Type: new Abstract: Work on `emergent misalignment' shows that finetuning LLMs on narrow tasks can induce broadly misaligned behavior. This supports the `persona selection' (PSM) hypothesis: during pre-training, LLMs learn to simulate different characters and perspectives, which can be elicited and refined during post-training.

arXiv CS 1d ago

Reproducibility is the New Copyleft: Defining AGI-oriented Reproducible Builds

arXiv:2606.03019v1 Announce Type: new Abstract: Copyleft, as implemented in licenses such as the GNU General Public License, was a legal hack that used copyright to guarantee user freedom by tying the availability of source code to every act of distribution. Its normative force rested on an implicit technical premise: that source code and object code stand in a well-defined, humanly auditable, and reproducible relationship. Large language models and, prospectively, Artificial General...

arXiv CS 7d ago

America Has a Pangram Problem

Basically every recent, high-profile accusation of someone passing off AI-generated writing as their own has started in the same way: with a tool called Pangram. In March, when a horror novel from a major publishing house was pulled just days before its scheduled U.S. release date, it was in part because Pangram, an AI-detection program, had identified the text as AI-generated. Other people have fed text into Pangram to suggest that chatbots have been used to write articles in major...

The Atlantic 11d ago

No, Artificial Intelligence Is Not Conscious

Anthropic is regarded as a giant among AI companies, but perhaps what it really excels in is anthropomorphism. Earlier this year the company released an 84-page document titled Claude’s “constitution,” Claude being the name of the large language model that is the company’s flagship product. The first sentence reads, “Claude’s constitution is a detailed description of Anthropic’s intentions for Claude’s values and behaviors.”

The Atlantic 7d ago

Artificial intelligence is not conscious – Ted Chiang

No, Artificial Intelligence Is Not Conscious Taken to its logical conclusion, this line of thinking is absurd—and damning. Anthropic is regarded as a giant among AI companies, but perhaps what it really excels in is anthropomorphism. Earlier this year, the company released an 84-page document titled Claude’s “constitution,” Claude being the name of the large language model that is the company’s flagship product.

Hacker News 6d ago

The Feeling of Control Slipping Away

Back in the web-traffic-obsessed days of 2018, at a time of dawning awareness of how easily audiences online could be manipulated and spoofed by bots, the writer Max Read argued that the internet had crossed a threshold known as “the Inversion.” Not only had bots proliferated across the internet; they had come to constitute it. In outnumbering humans, bots were also loosening everyone’s grasp on the very reality of online experience.

The Atlantic 11d ago

Event Detection for Parameter-to-KPI Dependency Learning for AI-RAN

arXiv:2606.06459v1 Announce Type: new Abstract: Next-generation wireless networks are expected to rely on multiple concurrent AI-driven control functions that optimize different network objectives simultaneously, particularly in AI-integrated and open radio access network architectures such as AI Radio Access Network (AI-RAN) and Open Radio Access Network (O-RAN). When these functions interact, they can interfere with one another in ways that are difficult to detect from raw network data...

arXiv CS 5d ago