Home Knowledge Base Q&A

Q&A

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Q-VGM: Q-Guided Value-Gradient Matching for Flow-Matching VLA Policies

Announce Type: new Abstract: We propose Q-Guided Value-Gradient Matching (Q-VGM), an off-policy reinforcement learning (RL) method that tackles a long-standing challenge in fine-tuning flow-matching vision-language-action (VLA) policies: efficiently improving an expressive flow-matching action expert with respect to a learned Q-function. Effective improvement must exploit the first-order (gradient) information of the critic, but this is difficult for flow policies, because directly...

arXiv CS 1d ago

SDM-Q: Cost-Aware Staged Decision-Making for Multi-Omics Classification with Deep Q-Learning

arXiv:2605.31014v1 Announce Type: new Abstract: Multi-omics data provide complementary molecular characterizations of disease phenotypes and play an important role in disease diagnosis and subtype classification in precision medicine. However, acquiring complete multi-omics profiles is expensive and time-consuming, while most existing deep learning methods assume full modality availability during inference, resulting in substantial redundancy and limited practicality in clinical settings. To...

arXiv CS 9d ago

SDM-Q: Cost-Aware Staged Decision-Making for Multi-Omics Classification with Deep Q-Learning

Announce Type: replace Abstract: Multi-omics data provide complementary molecular characterizations of disease phenotypes and play an important role in disease diagnosis and subtype classification in precision medicine. However, acquiring complete multi-omics profiles is expensive and time-consuming, while most existing deep learning methods assume full modality availability during inference, resulting in substantial redundancy and limited practicality in clinical settings. To address this...

arXiv CS 1d ago

A geometric $q$-analogue of Hamiltonian Monte Carlo

arXiv:2512.13246v3 Announce Type: replace Abstract: Hamiltonian Monte Carlo (HMC) generates efficient Markov transitions by combining Hamiltonian dynamics with a Metropolis correction. This paper develops a geometric \(q\)-analogue of HMC by replacing classical Hamiltonian dynamics with a \(q\)-deformed Hamiltonian system arising from \(q\)-calculus. Starting from a Lagrangian formulation, we derive the corresponding \(q\)-Hamiltonian equations and prove the formal invariance of the...

arXiv CS 2d ago

Late-Time Cosmology and Structure Formation in Quadratic $f(Q)$ Gravity

arXiv:2606.02660v1 Announce Type: new Abstract: We investigate the cosmological evolution associated with the quadratic symmetric teleparallel gravity framework, \( f(Q)=Q+\alpha Q^{2}+\beta \) where the relation \(Q\propto H^{2}\) generates an additional \(H^{4}\) contribution to the Friedmann equation. Using the exact algebraic solution for $H(z)$, we reconstruct the effective dark-energy sector and compare the background evolution with $\Lambda$CDM using Type Ia supernovae, BAO, and...

arXiv Physics 7d ago

A q-Tsallis Safe Approximation for Chance-Constrained Programs

Announce Type: new Abstract: Classical chance-constrained programs are solved by safe approximations based on the empirical CVaR, which uses a uniform measure over scenarios and systematically underweights tail events under heavy-tailed distributions. We introduce \emph{q-CCP}, a non-extensive safe approximation grounded in the Riemannian geometry of the Tsallis statistical manifold: the rank-based q-CVaR escort weights are the $g^{(q)}$-geodesic projection onto the tail simplex face, and...

arXiv CS 5d ago

Target Updates May Stabilize Linear Q-Learning: Periodic and Soft Dynamics

arXiv:2606.02645v1 Announce Type: cross Abstract: Periodic target updates in Q-learning and soft target updates in actor-critic methods are empirically well established stabilization mechanisms, but their precise theoretical explanation is still incomplete. This paper gives a rigorous and exact analysis of these mechanisms for Q-learning with linear function approximation (linear Q-learning) using the exact switched linear system (SLS) dynamics induced by the Bellman maximum and the joint...

arXiv CS 7d ago

A new open-shell CCSDTQ implementation and its application to the basis set convergence of post-CCSDT(Q) corrections in computational thermochemistry

arXiv:2605.19860v2 Announce Type: replace Abstract: We extend the CCSDTQ implementation in CFOUR to UHF and ROHF references and demonstrate its efficiency. We apply it to basis set convergence of post-CCSDT(Q) corrections for the W4-08 thermochemical dataset. Convergence of (Q)$_\Lambda$--(Q) is relatively rapid.

arXiv Physics 7d ago

On the Complexity of Offline Reinforcement Learning with $Q^\star$-Approximation and Partial Coverage

Announce Type: replace Abstract: We study offline reinforcement learning under $Q^\star$-approximation and partial coverage, a setting that motivates practical algorithms such as Conservative $Q$-Learning (CQL; Kumar et al., 2020) but has received limited theoretical attention. Our work is inspired by the following open question: "Are $Q^\star$-realizability and Bellman completeness sufficient for sample-efficient offline RL under partial coverage?" We answer in the negative via an...

arXiv CS 1d ago

Formal Foundations and Proof-Carrying Certificates for q-ary Covering Codes in Lean 4

Announce Type: new Abstract: Covering codes in finite Hamming spaces ask for small sets of words whose Hamming balls cover the whole space. This paper presents a Lean 4 formalization of the elementary theory of q-ary covering codes, centered on certificate predicates for upper bounds, lower bounds, and exact covering numbers $K_q(n,r)$. The formalization proves the q-ary Hamming-ball volume formula, the sphere-covering lower bound, elementary exact cases, product and relation rules, and...

arXiv CS 1d ago