Home Knowledge Base Wikidata

Wikidata

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning

arXiv:2606.06586v1 Announce Type: new Abstract: Large language models (LLMs) trained predominantly on English data encode substantial world knowledge, yet often fail to express it reliably in other languages, a phenomenon known as cross-lingual factual inconsistency. To study and address this, we introduce PolyFact, a large-scale parallel multilingual factual QA dataset containing 100K Wikidata-grounded facts across 12 typologically diverse languages. Using PolyFact, we compare light...

arXiv CS 2d ago

Anatomy of Unlearning: The Dual Impact of Fact Salience and Model Fine-Tuning

Announce Type: replace Abstract: Machine Unlearning (MU) enables Large Language Models (LLMs) to remove unsafe or outdated information. However, existing work assumes that all facts are equally forgettable and largely ignores whether the forgotten knowledge originates from pretraining or supervised fine-tuning (SFT). In this paper, we introduce DUET (Dual Unlearning Evaluation across Training Stages), a benchmark of 28.6k Wikidata-derived triplets annotated with fact popularity using...

arXiv CS 8d ago

Institutions and the transmission of upper-tail human capital: scientific lineages across a millennium

Announce Type: new Abstract: What made useful knowledge cumulative was not discovery alone but the institutions that transmitted it. We provide the first exhaustive structural measurement of the network through which upper-tail human capital passed from master to student across a millennium. Using 470,000 mentor-student records from Wikidata (which integrates the Mathematics Genealogy Project and MacTutor Archive), and all 64 historical Fields Medalists as a fixed, ex ante tracer set,...

arXiv CS 9d ago

Speaker Mining -- FAIR Data on Public Broadcasts for Question Answering

arXiv:2606.02905v1 Announce Type: new Abstract: Public broadcasts are at the center of civic discourse: Traditional television talk shows, alongside emerging podcast and web video formats, capture and guide the attention of our societies, shaping how citizens encounter politics, science, and societal issues. Yet, systematic or even simple analyses of these formats face similar challenges: guest and content metadata are scarce, fleeting, fragmented, and not standardized.

arXiv CS 7d ago

All 9,300 Japanese train station, animated by the year it opened (1872–2026)

Eki · 駅 150 years of Japan, drawn in stations. On a June morning in 1872, Japan’s entire railway was a single line between Shimbashi and Yokohama. A century and a half later the map carries more than nine thousand stations.

Hacker News 3h ago