Grounded but Misleading: Evaluating Semantic Alignment in AI-Generated Security Explanations

arXiv CS Friday 05 June 2026, 04:00 UTC By Heajun An, Connor Ng, Sandesh Sharma Dulal, Junghwan Kim, Jin-Hee Cho 1 min read

Key Points

arXiv:2602.05056v2 Announce Type: replace Abstract: Online scams increasingly leverage fluent and context-aware social engineering strategies, creating growing demand for AI systems that explain why a message may be risky. However, explanations that cite detector-derived evidence may still semantically weaken or redirect the intended risk interpretation. We introduce VEXA: Verifying Semantic Explanation Alignment, a controlled testbed for studying the gap between lexical grounding and semantic risk alignment in AI-generated scam-risk explanations. VEXA generates ungrounded, risk-aligned, and risk-diluting explanations by independently controlling evidence grounding and semantic framing. Through LLM-as-a-judge and human evaluations, we show that explanations may continue to appear comparatively grounded even when their semantic interpretation weakens the detector's intended risk assessment. In human evaluation, risk-diluting XAI-grounded explanations retained comparatively elevated Perceived Evidence Grounding scores (3.66) despite lower Helpfulness (3.00) and Reasoning Support (3.14) scores. These findings provide controlled evidence of grounding illusion effects in AI-generated security explanations and suggest that trustworthy explanation evaluation must verify not only whether evidence is cited, but also how that evidence is interpreted.

LLM (ORG)

Originally published by arXiv CS Read original →

Nasa chief defends choice of all-male Artemis III crew Critics fear the agency is following Trump’s order to eliminate diversity and inclusion efforts despite its vow to put a woman on the moon Nasa’s administrator Jared Isaacman on Wednesday defended the make-up of the space agency’s latest Artemis crew, an all-male group. The nominations have earned criticism that Nasa may have acted in accordance with US President Donald Trump’s direction to eliminate diversity and inclusion efforts....

South China Morning Post 19m ago

The asteroid that wiped out the dinosaurs may have created a vast underground habitat for life that lasted 8 million years

The asteroid that wiped out the dinosaurs may have created a vast underground habitat for life that lasted 8 million years The Chicxulub impact may have actually helped nurture life while destroying it, too. The asteroid impact that doomed the dinosaurs may also have built one of Earth's longest-lasting underground ecosystems. When a roughly 6-mile-wide (10-kilometer-wide) asteroid slammed into what is now Mexico's Yucatán Peninsula 66 million years ago, it triggered a global catastrophe...

Space.com 21m ago

See the 'crawling,' ball-shaped robot that rolled around the moon during Japan's historic first landing

See the 'crawling,' ball-shaped robot that rolled around the moon during Japan's historic first landing A morphable moon robot operated for 100 minutes in 2024, allowing investigators to get images of an upside-down spacecraft on the lunar surface. When the Japanese Smart Lander for Investigating Moon (SLIM) spacecraft, nicknamed the "Moon Sniper," face-planted onto the lunar surface in 2024, an experimental rover told Earth scientists what happened. Rolling autonomously through the lunar...

Live Science 21m ago

Grounded but Misleading: Evaluating Semantic Alignment in AI-Generated Security Explanations

Related Stories

'Worrying' pollution in Cotswolds river - volunteers

Nasa chief defends choice of all-male Artemis III crew

The asteroid that wiped out the dinosaurs may have created a vast underground habitat for life that lasted 8 million years

See the 'crawling,' ball-shaped robot that rolled around the moon during Japan's historic first landing