Home › Knowledge Base › Data Shapley

Data Shapley

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

SurrogateSHAP: Training-Free Contributor Attribution for Text-to-Image (T2I) Models

Announce Type: replace Abstract: As Text-to-Image (T2I) diffusion models are increasingly used in real-world creative workflows, a principled framework for valuing contributors who provide a collection of data is essential for fair compensation and sustainable data marketplaces. While the Shapley value offers a theoretically grounded approach to attribution, it faces a dual computational bottleneck: (i) the prohibitive cost of exhaustive model retraining for each sampled subset of players...

arXiv CS 8d ago

From data to decisions: Bayesian modelling and global sensitivity analysis for flotation control

arXiv:2606.06173v1 Announce Type: new Abstract: This work presents a data-driven framework for interpretable modelling and decision support in flotation systems, integrating Gaussian Process (GP) regression with Global Sensitivity Analysis (GSA) via Sobol indices and local interpretability using SHapley Additive exPlanations (SHAP). Based on laboratory-scale experimental data, a static GP surrogate model is developed to capture how superficial air velocity, overflowing froth velocity, froth...

arXiv CS 5d ago

ShapDBM: Exploring Decision Boundary Maps in Shapley Space

Announce Type: replace Abstract: Decision Boundary Maps (DBMs) are an effective tool for visualising machine learning classification boundaries. Yet, DBM quality strongly depends on the dimensionality reduction (DR) technique and high dimensional space used for the data points. For complex ML data, DR can create many mixed classes which yield DBMs that are hard to use or even misleading.

arXiv CS 8d ago

ShapDBM: Exploring Decision Boundary Maps in Shapley Space

arXiv:2603.22235v2 Announce Type: replace Abstract: Decision Boundary Maps (DBMs) are an effective tool for visualising machine learning classification boundaries. Yet, DBM quality strongly depends on the dimensionality reduction (DR) technique and high dimensional space used for the data points. For complex ML data, DR can create many mixed classes which yield DBMs that are hard to use or even misleading.

arXiv CS 9d ago

Unifying and Optimizing Data Values for Selection via Sequential Decision-Making

arXiv:2502.04554v2 Announce Type: replace Abstract: Data selection has emerged as a crucial downstream application of data valuation, yet the theoretical foundations for using data values in selection remain underexplored. We reformulate data selection as a sequential decision-making problem where the optimal selection sequence arises from dynamic programming, and data values can be understood as encodings of this optimal sequence. This framework unifies and reinterprets existing methods...

arXiv CS 9d ago

Explaining a probabilistic prediction on the simplex with Shapley compositions

arXiv:2408.01382v3 Announce Type: replace Abstract: Originating in game theory, Shapley values are widely used for explaining a machine learning model's prediction by quantifying the contribution of each feature's value to the prediction. This requires a scalar prediction as in binary classification, whereas a multiclass probabilistic prediction is a discrete probability distribution, living on a multidimensional simplex. In such a multiclass setting the Shapley values are typically computed...

arXiv CS 6d ago

An Odd Estimator for Shapley Values

Announce Type: replace Abstract: The Shapley value is a ubiquitous framework for attribution in machine learning, encompassing feature importance, data valuation, and causal inference. However, its exact computation is generally intractable, necessitating efficient approximation methods. While the most effective and popular estimators leverage the paired sampling heuristic to reduce estimation error, the theoretical mechanism driving this improvement has remained opaque.

arXiv CS 9d ago

Beyond Additive Decompositions: Interpretability Through Separability

Announce Type: replace Abstract: Interpretable machine learning requires models that are accurate and structurally faithful to the data. Existing explainability methods rely heavily on additive representations (e.g., Generalized Additive Models (GAMs), SHapley Additive exPlanations (SHAP), functional ANOVA), which can suffer from signal cancellation and off-support extrapolation in the presence of strong interactions. We propose Tensor Separation Learning (TSL), a regression model that...

arXiv CS 8d ago

Beyond Additive Decompositions: Interpretability Through Separability

arXiv:2605.31200v1 Announce Type: new Abstract: Interpretable machine learning requires models that are accurate and structurally faithful to the data. Existing explainability methods rely heavily on additive representations (e.g., Generalized Additive Models (GAMs), SHapley Additive exPlanations (SHAP), functional ANOVA), which can suffer from signal cancellation and off-support extrapolation in the presence of strong interactions. We propose Tensor Separation Learning (TSL), a regression...

arXiv CS 9d ago

A Framework for Graph-Conditioned Hierarchical Shapley Attribution in Patent Valuation

arXiv:2606.01632v1 Announce Type: new Abstract: Estimating the economic contribution of a single patent inside a product that embodies tens of thousands of patents is a long-standing unsolved problem in intellectual property economics. We propose PatentXAI, a framework that treats patent valuation as a problem of explainable AI: given a characteristic function v(S) encoding the revenue achievable by patent subset S, a patent's Shapley value measures its fair share of product profit in a way...

arXiv CS 8d ago