Home Knowledge Base MechaRule

MechaRule

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Neuron-Anchored Rule Extraction for Large Language Models via Contrastive Hierarchical Ablation

arXiv:2605.03058v2 Announce Type: replace Abstract: A central goal of explainable AI is to express large language model (LLM) decision logic symbolically and ground it in internal mechanisms. Existing rule-extraction methods usually learn ungrounded symbolic surrogates, while mechanistic interpretability links behavior to neurons but often requires hand-crafted hypotheses and costly interventions. We introduce MechaRule, a pipeline that grounds rule extraction in LLM circuits by localizing...

arXiv CS 1d ago