Home Knowledge Base LangChain

LangChain

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Apache Burr: Build reliable AI agents and applications

Build reliable AI agents and applications Apache Burr (Incubating) makes it easy to develop applications that make decisions, from simple chatbots to complex multi-agent systems. Pure Python, no magic. Simple, powerful Python API Build anything from chatbots to multi-agent systems with a clean, composable interface.

Hacker News 3h ago

From Storage to Steering: Memory Control Flow Attacks on LLM Agents

Announce Type: replace Abstract: Modern agentic systems allow Large Language Model (LLM) agents to tackle complex tasks through extensive tool usage, forming structured control flows of tool selection and execution. Existing security analyses often treat these control flows as ephemeral, one-off sessions, overlooking the persistent influence of memory. This paper identifies a new threat from Memory Control Flow Attacks (MCFA) that memory can dominate the control flow, forcing unintended tool...

arXiv CS 2d ago

Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns

Announce Type: new Abstract: AI agents are commonly evaluated using task success, reward, latency, and cost. These metrics are useful, but they often miss important aspects of agent behavior: whether an agent explores too much, repeats itself too rigidly, uses tools effectively, reduces uncertainty over time, or remains robust across repeated runs. This paper proposes Entropy-Based Evaluation of AI Agents (EEA), a lightweight framework for measuring agent behavior through entropy.

arXiv CS 5d ago

Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and Mitigation

Announce Type: new Abstract: Multi-step agentic retrieval-augmented generation (RAG) pipelines have demonstrated significant capability for complex reasoning tasks, yet remain vulnerable to a class of failure that existing hallucination detection mechanisms systematically miss: cascading hallucination, where errors introduced at early pipeline stages propagate and amplify across successive reasoning steps, producing confident but factually incorrect final outputs. To address this...

arXiv CS 6d ago

The End of Software Engineering: How AI Agents Are Fundamentally Restructuring the Software Paradigm

arXiv:2606.05608v1 Announce Type: new Abstract: For over half a century, software engineering has operated on a foundational premise: human engineers decompose problems, encode decision logic into static code, and manually adapt that code as requirements evolve. This paper argues that the emergence of AI agents -- systems where large language models serve as the primary reasoning engine, dynamically generating and discarding code as an instrumental resource -- constitutes not an incremental...

arXiv CS 5d ago

Maris: A Formally Verifiable Privacy Policy Enforcement Paradigm for Multi-Agent Collaboration Systems

arXiv:2505.04799v4 Announce Type: replace Abstract: Multi-agent collaboration systems (MACS), powered by large language models (LLMs), solve complex problems efficiently by leveraging each agent's specialization and communication between agents. However, the inherent exchange of information between agents and their interaction with external environments, such as LLM, tools, and users, inevitably introduces significant risks of sensitive data leakage, including vulnerabilities to attacks such...

arXiv CS 8d ago

Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns

arXiv:2606.05872v2 Announce Type: replace Abstract: AI agents are commonly evaluated using task success, reward, latency, and cost. These metrics are useful, but they often miss important aspects of agent behavior: whether an agent explores too much, repeats itself too rigidly, uses tools effectively, reduces uncertainty over time, or remains robust across repeated runs. This paper proposes Entropy-Based Evaluation of AI Agents (EEA), a lightweight framework for measuring agent behavior...

arXiv CS 1d ago

Maris: A Formally Verifiable Privacy Policy Enforcement Paradigm for Multi-Agent Collaboration Systems

Announce Type: replace Abstract: Multi-agent collaboration systems (MACS), powered by large language models (LLMs), solve complex problems efficiently by leveraging each agent's specialization and communication between agents. However, the inherent exchange of information between agents and their interaction with external environments, such as LLM, tools, and users, inevitably introduces significant risks of sensitive data leakage, including vulnerabilities to attacks such as eavesdropping...

arXiv CS 1d ago