Home Knowledge Base TD

TD

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Fast and Robust Convergence Rate for TD(0) with Linear Function Approximation, Universal Learning Steps and I.I.D. Samples

arXiv:2606.05967v2 Announce Type: replace-cross Abstract: In this paper, we study the finite-time behavior of the TD(0) temporal-difference method with linear function approximation (LFA). We consider on-policy independent and identically distributed (i.i.d.) samples, a constant learning step, and the Polyak-Juditsky averaging method.

arXiv CS 2d ago

Fast and Robust Convergence Rate for TD(0) with Linear Function Approximation, Universal Learning Steps and I.I.D. Samples

arXiv:2606.05967v1 Announce Type: cross Abstract: In this paper, we study the finite-time behavior of the TD(0) temporal-difference method with linear function approximation (LFA). We consider on-policy independent and identically distributed (i.i.d.) samples, a constant learning step, and the Polyak-Juditsky averaging method.

arXiv CS 5d ago

A Robust $\widetilde{\mathcal{O}}(1/\sqrt{T})$ Rate for Unprojected TD Learning with Linear Function Approximation

Announce Type: replace Abstract: We investigate the finite-time convergence properties of Temporal Difference (TD) learning with linear function approximation, a cornerstone of reinforcement learning. We are interested in the so-called ``robust'' setting, where the convergence guarantee does not depend on the potential function's minimal curvature. While prior work has established convergence guarantees in this setting, these results typically rely on the artificial assumption that each...

arXiv CS 1d ago

BMO Hires TD’s van Arragon for Domestic Business Banking Role

A Bank of Montreal (BMO) branch in Vancouver, British Columbia, Canada, on Wednesday, March 18, 2026. Canada's finance minister Francois-Philippe Champagne announced the government's new $10 cap on non-sufficient funds (NSF) fees, which is expected to save Canadians more than $600 million annually.

Bloomberg Markets 8d ago

Estimating spatially adjusted temperature-dependent time-varying reproduction numbers for vector-borne diseases

Estimating the effective reproduction number is crucial for understanding and managing infectious disease outbreaks. For vector-borne diseases like dengue, transmission depends on environmental and spatial conditions: temperature affects the extrinsic incubation period in mosquitoes, altering transmission timing, while spatial proximity can lead to clusters of transmission. We integrated a temperature-dependent (TD) generation time (GT) distribution and a spatial decay function weighting...

bioRxiv 6d ago

Bayesian Tensor Decomposition with Diffusion Model Prior

Announce Type: new Abstract: Low-rank tensor decomposition (TD) is usually effective on clean, fully observed data, but it often degrades under severe missingness or noise. Low-rankness is itself a useful but limited structural prior, and additional handcrafted priors (e.g., sparsity or smoothness) still fall short of capturing the rich statistics of real-world data. To compensate for this weak inductive bias under heavy corruption, one would like to inject a learned, data-driven prior;...

arXiv CS 7d ago

NavOne: One-Step Global Planning for Vision-Language Navigation on Top-Down Maps

arXiv:2605.06317v4 Announce Type: replace Abstract: Existing Vision-Language Navigation (VLN) methods typically adopt an egocentric, step-by-step paradigm, which struggles with error accumulation and limits efficiency. While recent approaches attempt to leverage pre-built environment maps, they often rely on incrementally updating memory graphs or scoring discrete path proposals, which restricts continuous spatial reasoning and creates discrete bottlenecks. We propose Top-Down VLN (TD-VLN),...

arXiv CS 1d ago

Neutrophil-Derived Oncostatin M Contributes to Endothelial Cell Dysfunction During Treponema denticola interaction

Periodontitis (PD) is a common chronic inflammatory condition and a risk factor for cardiovascular diseases (CVD), yet underlying linking mechanisms remain unclear. The cytokine Oncostain M (OSM) is elevated in both PD and CVD and has emerged as a potential mediator linking oral inflammation to vascular dysfunction. Neutrophils represent a prominent source of OSM during PD and OSM production is elevated by the periodontal pathobiont Treponema denticola (Td).

bioRxiv 8d ago

Brain dynamics supporting high cognitive performance reorganize after midlife

Quantifying functional brain aging trajectories at scale remains a fundamental challenge due to the scanner-bound limitations of traditional neuroimaging. Here, we deploy whole-head Time-Domain functional Near-Infrared Spectroscopy (TD-fNIRS) to map task-evoked cortical dynamics during a 30-minute cognitive battery across the adult lifespan (N = 302, age 18 to 87, 45% racial or ethnic minority). We developed a robust General Cognitive Factor (GCF) tracking age-related performance decline (r...

bioRxiv 5d ago

Curriculum-Adapted Robust Reinforcement Learning for UAV Deconfliction in Adversarial Environments

Announce Type: replace Abstract: Autonomous unmanned aerial vehicles (UAVs) increasingly rely on reinforcement learning (RL) for navigation. However, global navigation satellite system (GNSS) spoofing attacks can induce out-of-distribution observation shifts that corrupt value estimation and degrade mission performance. Existing robust RL approaches typically improve resilience against specific attack models but often fail to generalize to attacks not encountered during training.

arXiv CS 7d ago