Data
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data
Announce Type: replace Abstract: The growing demand for data-driven decision-making has created an urgent need for data agents that can reason over heterogeneous data (databases, documents, web content, images, videos, and audio) to answer complex analytical queries. However, evaluating such agents remains challenging: existing benchmarks often focus on isolated agent capabilities or limited data modalities, lacking comprehensive coverage of heterogeneous data and rigorous evaluation across...
DAD4TS: Data-Augmentation-Oriented Diffusion Model for Time-Series Forecasting with Small-Scale Data
arXiv:2605.17866v2 Announce Type: replace Abstract: Small-scale data is a critical problem in time-series forecasting tasks. Data augmentation is an effective strategy for this task, but it has a limitation in generating meaningful data. To address this limitation, we propose DAD4TS, a diffusion-model-based data augmentation method with reinforcement learning, designed for time-series forecasting with small-scale data.
KDH-CAD: Knowledge-data hybrid CAD learning under data scarcity
arXiv:2606.01702v1 Announce Type: new Abstract: Deep learning in computer-aided design (CAD) remains fundamentally constrained by the data scarcity challenge: authentic CAD data is difficult to collect at scale, while synthetic data may not faithfully reflect real design practice. Rather than pursuing ever-larger CAD datasets, this paper alternatively treats CAD learning as a knowledge completion and calibration problem. It introduces KDH-CAD, a knowledge-data hybrid framework that...
Data Want to be Free: An Innovation Resistance Theory Model for Identifying Barriers to Government Data Sharing
arXiv:2407.10883v2 Announce Type: replace Abstract: Data sharing is increasingly essential for digital government and data-driven innovation, yet many public organizations remain reluctant to make their data openly available. While prior research has examined factors influencing open data adoption, little theoretical work explores why resistance persists within public agencies.
Community Services Data Set and healthy child programme: data quality review, 2015 to 2025
Community Services Data Set and healthy child programme: data quality review, 2015 to 2025 This review is a summary of analysis of the completeness of the CSDS for children aged 0 to 4 years. Applies to England Documents Details This review is a summary of analysis of how complete the Community Services Data Set (CSDS) is for children aged 0 to 4 years.
Phantom Transfer: Data Poisoning can Survive Data-Level Defences
arXiv:2602.04899v2 Announce Type: replace Abstract: We present a data poisoning attack -- Phantom Transfer -- with the property that, even if you know precisely how the poison was placed into an otherwise benign dataset, you cannot filter it out. We achieve this by modifying subliminal learning to work in real-world contexts and demonstrate that the attack works regardless of which model produced the data, which model is trained on the data or what the attack target is. Furthermore, the...
Leopards, tigers and AI data, oh my! Nashville Zoo tries to halt proposed data center
A nationwide backlash against artificial intelligence data centers has a new ally: the leopards of the Nashville Zoo. The zoo, a popular destination in Tennessee’s capital city, is trying to block a proposed 69,000-square-foot data center from being built next door. The zoo says that the facility would be about 50 yards from some of its animals and that the noise could disturb its residents, including a leap of leopards that hail originally from Southeast Asia.
Data Flow Control: Data Safety Policies for AI Agents
Announce Type: new Abstract: Agents increasingly generate SQL, orchestrate pipelines, and automate data analysis on behalf of users. While recent work improves query correctness, correctness is not safety. A query may be semantically valid yet violate regulatory, privacy, or business constraints that govern how data may be combined and released.
Implicit Data Synthesis for Contrastive Unsupervised Data Augmentation
Announce Type: new Abstract: Scientific observations generate large quantities of unlabeled data which is laborious to hand-label, making unsupervised learning techniques valuable for processing datasets. Among these approaches, contrastive learning provides a convenient mechanism for extracting structural representations from unannotated datasets. For natural imagery, the general approach is to use a variety of data-space augmentation methods in order to generate synthetic samples; however,...
Data-efficient semi-supervised learning for flow estimation using unlabelled probe data
arXiv:2605.28245v2 Announce Type: replace Abstract: Estimating time-resolved velocity and pressure fields from Particle Image Velocimetry (PIV) remains challenging due to its limited temporal resolution in many applications. Data-driven approaches that combine snapshot PIV with high-frequency probe data have shown great promise in reconstructing the flow dynamics for advection-dominated flows; however, they typically exploit only the probe measurements directly synchronized with the PIV...