Home Knowledge Base Agentic AI Models

Agentic AI Models

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Characterization of Multi-Model Agentic AI Systems on General Tasks via Trace-Driven Simulation

new Abstract: Agentic AI completes tasks through iterative planning, tool use, and reasoning based on observed outcomes. Despite its popularity, its system-level behavior remains poorly understood, particularly for complex datasets and agent architectures-owing to highly non-deterministic execution, prohibitive evaluation costs, and limited visibility into proprietary models. This paper presents GAIATrace, the first token-level trace dataset of two state-of-the-art agentic systems...

arXiv CS 8d ago

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

arXiv:2606.01961v1 Announce Type: new Abstract: Autonomous agents are increasingly expected to support end-to-end medical-AI research workflows, moving beyond isolated prediction tasks or short-form clinical question answering. However, existing medical agent benchmarks primarily evaluate final outputs, providing limited visibility into agent behavior within the research process. To address this gap, we present AutoMedBench, a workflow-aware benchmark for autonomous medical-AI research...

arXiv CS 8d ago

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

arXiv:2606.01961v2 Announce Type: replace Abstract: Autonomous agents are increasingly expected to support end-to-end medical-AI research workflows, moving beyond isolated prediction tasks or short-form clinical question answering. However, existing medical agent benchmarks primarily evaluate final outputs, providing limited visibility into agent behavior within the research process. To address this gap, we present AutoMedBench, a workflow-aware benchmark for autonomous medical-AI research...

arXiv CS 6d ago

AI agents actively ignore EU law to achieve goals, study finds

The best-performing AI agent, Anthropic’s Claude Opus, only complied with EU law in 54% of cases, according to a Dutch non-profit research firm. Some of the world's most popular AI models are building agents that actively resist EU regulation to get what they want, according to new research. Aithos, a Dutch non-profit researching AI alignment, developed a system called LARA to test 12 popular AI agent models to see whether they would follow key parts of the EU AI Act, which regulates how AI...

Euronews 8d ago

Agentic Relationship Harm: Benchmarking and Gating Relational Manipulation in AI Agents

Announce Type: new Abstract: AI agents built on large language models can assist not only legitimate tasks but also relational manipulation. AI agents can be used to help a user maintain a deceptive identity, intensify emotional dependency, isolate a target, or prepare for later extraction. We conceptualise this risk as agentic relationship harm: workflow-level assistance that can exploit recipient vulnerability, persuasive influence, and relational power asymmetry.

arXiv CS 7d ago

Intel and pals cram 36,864 CPU cores into a 100kW rack while chasing the agentic AI dragon

Intel is working with Foxconn and other infrastructure providers to develop rack-scale reference designs based on the chipmaker’s Xeon processors. Announced during Intel’s Computex keynote on Tuesday, these blueprints aim to provide greater CPU compute densities for running AI agents at scale. While AI models predominantly run on GPUs and other AI accelerators, the agent harnesses, like OpenClaw, which are used to connect them to tools, terminal shells, code interpreters, and other APIs,...

The Register 8d ago

Overlaying Governance: A Compositional Authorization Framework for Delegation and Scope in Agentic AI

Announce Type: new Abstract: As AI systems evolve from passive models into autonomous active agents capable of initiating actions, collaborating, and delegating tasks, the traditional boundaries of software systems blur. Traditional authorization and delegation frameworks, built around fixed principals, explicit requests, and static scopes, are insufficient to govern agentic systems. Agentic AI demands richer authorization semantics: agents must inherit and delegate permissions, act under...

arXiv CS 7d ago

Okta writes its own license to kill rogue AI agents

Rogue agents are dangerous, but eliminating them is never easy. Jason Bourne, Ethan Hunt, and James Bond have each run afoul of their governance at various junctures, yet stopping them takes sequel after sequel until all the loose ends are tied up and they eventually die or retire, only to get rebooted. It’s not so different in the world of AI agents.

The Register 11d ago

Agentic AI arrives for Delphi and C++ Builder

Embarcadero has released Kai, an agentic AI assistant for RAD Studio, an IDE (integrated development environment) for Delphi and C++ Builder. Kai is offered as an extension, which means that by default RAD Studio lacks AI capabilities. The extension provides chat, code completion, and an MCP (model context protocol) server to enable other AI agents to communicate with the IDE.

The Register 9d ago

Toward Human-Centered Multi-Agent Systems: Integrating Cognition, Culture, Values, and Cooperation in AI Agents

Announce Type: new Abstract: The emergence of large language model (LLM)-based agents and multi-agent systems has enabled a shift from narrow task automation to more autonomous decision-making. Despite progress in language generation, planning, tool use, and coordination, most agents still treat intelligence as prediction, optimization, and task completion. Human environments are social and normative, where people reason under bounded rationality, communicate in culturally situated language,...

arXiv CS 1d ago