Home › Knowledge Base › Data

Data

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data

Announce Type: replace Abstract: The growing demand for data-driven decision-making has created an urgent need for data agents that can reason over heterogeneous data (databases, documents, web content, images, videos, and audio) to answer complex analytical queries. However, evaluating such agents remains challenging: existing benchmarks often focus on isolated agent capabilities or limited data modalities, lacking comprehensive coverage of heterogeneous data and rigorous evaluation across...

arXiv CS 9d ago

DAD4TS: Data-Augmentation-Oriented Diffusion Model for Time-Series Forecasting with Small-Scale Data

arXiv:2605.17866v2 Announce Type: replace Abstract: Small-scale data is a critical problem in time-series forecasting tasks. Data augmentation is an effective strategy for this task, but it has a limitation in generating meaningful data. To address this limitation, we propose DAD4TS, a diffusion-model-based data augmentation method with reinforcement learning, designed for time-series forecasting with small-scale data.

arXiv CS 7d ago

KDH-CAD: Knowledge-data hybrid CAD learning under data scarcity

arXiv:2606.01702v1 Announce Type: new Abstract: Deep learning in computer-aided design (CAD) remains fundamentally constrained by the data scarcity challenge: authentic CAD data is difficult to collect at scale, while synthetic data may not faithfully reflect real design practice. Rather than pursuing ever-larger CAD datasets, this paper alternatively treats CAD learning as a knowledge completion and calibration problem. It introduces KDH-CAD, a knowledge-data hybrid framework that...

arXiv CS 8d ago

Data Want to be Free: An Innovation Resistance Theory Model for Identifying Barriers to Government Data Sharing

arXiv:2407.10883v2 Announce Type: replace Abstract: Data sharing is increasingly essential for digital government and data-driven innovation, yet many public organizations remain reluctant to make their data openly available. While prior research has examined factors influencing open data adoption, little theoretical work explores why resistance persists within public agencies.

arXiv CS 1d ago

Community Services Data Set and healthy child programme: data quality review, 2015 to 2025

Community Services Data Set and healthy child programme: data quality review, 2015 to 2025 This review is a summary of analysis of the completeness of the CSDS for children aged 0 to 4 years. Applies to England Documents Details This review is a summary of analysis of how complete the Community Services Data Set (CSDS) is for children aged 0 to 4 years.

GOV.UK Statistics 23h ago

Phantom Transfer: Data Poisoning can Survive Data-Level Defences

arXiv:2602.04899v2 Announce Type: replace Abstract: We present a data poisoning attack -- Phantom Transfer -- with the property that, even if you know precisely how the poison was placed into an otherwise benign dataset, you cannot filter it out. We achieve this by modifying subliminal learning to work in real-world contexts and demonstrate that the attack works regardless of which model produced the data, which model is trained on the data or what the attack target is. Furthermore, the...

arXiv CS 7d ago

Leopards, tigers and AI data, oh my! Nashville Zoo tries to halt proposed data center

A nationwide backlash against artificial intelligence data centers has a new ally: the leopards of the Nashville Zoo. The zoo, a popular destination in Tennessee’s capital city, is trying to block a proposed 69,000-square-foot data center from being built next door. The zoo says that the facility would be about 50 yards from some of its animals and that the noise could disturb its residents, including a leap of leopards that hail originally from Southeast Asia.

NBC News 4d ago

Data Flow Control: Data Safety Policies for AI Agents

Announce Type: new Abstract: Agents increasingly generate SQL, orchestrate pipelines, and automate data analysis on behalf of users. While recent work improves query correctness, correctness is not safety. A query may be semantically valid yet violate regulatory, privacy, or business constraints that govern how data may be combined and released.

arXiv CS 5d ago

Implicit Data Synthesis for Contrastive Unsupervised Data Augmentation

Announce Type: new Abstract: Scientific observations generate large quantities of unlabeled data which is laborious to hand-label, making unsupervised learning techniques valuable for processing datasets. Among these approaches, contrastive learning provides a convenient mechanism for extracting structural representations from unannotated datasets. For natural imagery, the general approach is to use a variety of data-space augmentation methods in order to generate synthetic samples; however,...

arXiv CS 2d ago

Data-efficient semi-supervised learning for flow estimation using unlabelled probe data

arXiv:2605.28245v2 Announce Type: replace Abstract: Estimating time-resolved velocity and pressure fields from Particle Image Velocimetry (PIV) remains challenging due to its limited temporal resolution in many applications. Data-driven approaches that combine snapshot PIV with high-frequency probe data have shown great promise in reconstructing the flow dynamics for advection-dominated flows; however, they typically exploit only the probe measurements directly synchronized with the PIV...

arXiv Physics 9d ago