Home Knowledge Base TRIAD

TRIAD

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

XCR-Bench: Benchmarking Cross-Cultural Reasoning in LLMs via Culture-Specific Items and Hall's Triad

Announce Type: replace Abstract: Cross-cultural competence in large language models (LLMs) requires understanding and adapting Culture-Specific Items (CSIs) across varying cultural contexts. However, progress in evaluating this capability remains limited by the lack of high-quality CSI-annotated corpora with parallel cross-cultural sentence pairs. We introduce XCR-Bench, a Cross(X)-Cultural Reasoning Benchmark containing 4.1k parallel sentences and 1,098 CSIs across three reasoning tasks.

arXiv CS 1d ago

RunAgent SuperBrowser: A Theory of Autonomous Web Navigation Grounded in Human Browsing Behaviour

arXiv:2606.09399v1 Announce Type: new Abstract: We present SUPERBROWSER, an autonomous web-navigation agent designed against a single guiding hypothesis: a web agent should browse the way a person browses. A human reading a page does not retain every pixel they have seen; they look at a few candidate targets, decide on one, and remember only what is needed to keep the goal alive. We operationalize this perception-cognition-action triad as three coupled mechanisms.

arXiv CS 1d ago

From Risk Classification to Action Plan Remediation: A Guardrail Feedback Driven Framework for LLM Agents

Announce Type: new Abstract: LLM-based guardrails typically safeguard agents by evaluating proposed actions or inputs before execution, producing safety signals such as binary allow/deny decisions, risk categories, and/or explanatory rationales about potential policy violations. However, agent risks often arise when otherwise benign tasks are contaminated by untrusted external content, unsafe instructions, or risky tool use. Existing guardrails often flag the entire task uniformly as unsafe,...

arXiv CS 5d ago

Personality Shapes Gender Bias in Persona-Conditioned LLM Narratives Across English and Hindi: An Empirical Investigation

arXiv:2604.23600v2 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly deployed in persona-driven applications such as education, customer service, and social platforms, where models are prompted to adopt specific personas when interacting with users. While persona conditioning can improve user experience and engagement, it also raises concerns about how personality cues may interact with gender biases and stereotypes. In this work, we present a controlled study of...

arXiv CS 5d ago

Two-Phase Simulated Annealing for Equitable Team Formation: Eliminating Complaints in Large Engineering Cohorts

Announce Type: new Abstract: Contribution: This paper presents a novel two-phase algorithmic approach that decouples preference satisfaction from fairness optimization in student team formation, achieving both objectives without compromise. The method applies simulated annealing -- a core materials science technique -- to an educational challenge, demonstrating pedagogical integration of administrative processes. Background: Forming effective teams in large engineering cohorts (100+...

arXiv CS 2d ago

India now has 190 nuclear warheads: What's driving New Delhi's atomic buildup?

India’s estimated nuclear arsenal has increased from 180 warheads to 190 warheads, according to the latest assessment by the Stockholm International Peace Research Institute (SIPRI), reflecting New Delhi’s continued efforts to modernise its strategic deterrent amid a rapidly evolving security environment. The findings were released as part of SIPRI Yearbook 2026, which warns that the world is entering a new era of nuclear competition, with major powers increasingly relying on atomic weapons...

Times of India 2d ago

Beyond Knowledge to Agency: Evaluating Expertise, Autonomy, and Integrity in Finance with CNFinBench

arXiv:2512.09506v5 Announce Type: replace Abstract: As large language models (LLMs) become high-privilege agents in risk-sensitive settings, they introduce systemic threats beyond hallucination, where minor compliance errors can cause critical data leaks. However, existing benchmarks focus on rule-based QA, lacking agentic execution modeling, overlooking compliance drift in adversarial interactions, and relying on binary safety metrics that fail to capture behavioral degradation. To bridge...

arXiv CS 9d ago

Being Towards Death review – Chinese hospital comedy drama uses plucky patients to ask big questions

A debt-laden caregiver attempting suicide is the catalyst for him finding new meaning to life from a ward of terminally ill patients in touching ensemble drama‘You know the law of entropy? Life is a process of constant decay,” ssays a doctor in this Chinese hospital comedy drama – but not that you’d know it from the gabbling, frenetic first half-hour of director Chen Sicheng’s death-fixated film. Being Towards Death kicks off with caregiver Xiaobing (Jiang Long) about to throw himself off...

The Guardian UK 5d ago

The American Missile Crisis

Recent global conflicts, from Russia and Ukraine to Iran and Israel, have seen a resurgent awareness of the frailty of US munitions stock, which has been drawn down by both direct and indirect involvement in these events. While exact stockpile volumes are not disclosed, it is estimated that supplies of US warheads and the missiles that carry them have declined by nearly an order of magnitude since their peak during the Cuban Missile Crisis. Analysts have estimated that in the event of a...

Hacker News 7d ago

SIPRI: With peace elusive, nuclear weapons make a comeback

With peace elusive, nuclear weapons make a comeback June 8, 2026Many countries are ramping up their military capabilities — and nuclear weapons are back on the agenda. According to the Stockholm International Peace Research Institute (SIPRI), all nine nuclear-armed countries modernized and expanded their arsenals in 2025. In addition to new nuclear weapons, additional delivery systems have been introduced that can be equipped with both conventional and nuclear warheads.

Deutsche Welle 2d ago