Home › Knowledge Base › \(L_1\

\(L_1\

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Decentralized Stochastic Nonconvex Optimization under the $(L_0,L_1)$-Smoothness

arXiv:2509.08726v3 Announce Type: replace-cross Abstract: This paper focuses on the decentralized stochastic optimization problem $f(\mathbf{x})=\frac{1}{m}\sum_{i=1}^m f_i(\mathbf{x})$ over a connected network of $n$ agents, where each local function has the form of $f_i(\mathbf{x}) = {\mathbb E}\left[F(\mathbf{x};{\boldsymbol \xi}_i)\right]$ which satisfies the $(L_0,L_1)$-smooth condition but possibly nonconvex and each random variable ${\boldsymbol \xi}_i$ follows distribution ${\mathcal...

arXiv CS 7d ago

Truncated Huber Penalty for Sparse Signal Recovery with Convergence Analysis

arXiv:2504.04509v2 Announce Type: replace Abstract: Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadratic regularization to small entries while...

arXiv CS 5d ago

Sharp lower error bounds for strong approximation of SDEs with a drift coefficient of H\"older or Sobolev regularity using a Weierstra{\ss} scale

arXiv:2504.20728v2 Announce Type: replace-cross Abstract: We study strong approximation of solutions of SDEs with bounded $\alpha$-H\"older continuous drift coefficient and constant diffusion coefficient at time point $1$. Recently, it was shown in [arXiv:1909.07961v4 (2021)] that for such SDEs the equidistant Euler scheme achieves an $L^p$-error rate of at least $(1+\alpha)/2$, up to an arbitrary small $\varepsilon$, for all $p\geq 1$ and $\alpha\in (0,1]$, in terms of the number of...

arXiv CS 5d ago

Which Defense Closes Which Threat? Attributing OWASP-LLM-Top-10 Coverage and Its Brittleness Under Paraphrasing

arXiv:2606.02822v1 Announce Type: new Abstract: Production LLM applications stack several defense families -- refusal-phrase filters, token-budget controls, model allowlists, rate limits, tool-registry authentication -- yet existing breach-and-attack-simulation (BAS) benchmarks report a single aggregate coverage number, hiding which family closes which threat. We measure attribution. We add four OWASP-LLM-Top-10-aware agents to a 21-agent baseline scanner and target a lattice of four...

arXiv CS 7d ago

Song Yadong holds strong at bantamweight

Song Yadong and Deiveson Figueiredo fought on Saturday in a meeting of top-10 bantamweights, and while Song secured his No. 7 position in the ESPN divisional rankings, Figueiredo fell only one spot to No. 9 with the second-round submission loss. It was Song's fourth straight fight against a former UFC champion, and he is 2-2 in those contests. At 28 years old, the Chinese fighter has time to make up ground among 135-pound contenders.

ESPN 8d ago

A posteriori existence for the Keller-Segel model via a finite volume - finite element scheme

arXiv:2509.17710v2 Announce Type: replace Abstract: We derive two forms of conditional a posteriori error estimates for a finite volume scheme approximating the parabolic-elliptic Keller-Segel system. The estimates control the error in the $L^\infty(0,T, L^2(\Omega))$- and $L^2(0,T;H^1(\Omega))$-norm and exhibit linear convergence in the mesh size, as observed in numerical experiments. Crucially, we show that, as long as the condition of the error estimate is satisfied, a weak solution exists.

arXiv CS 6d ago

Approximation and learning of anisotropic and mixed smooth functions by deep ReLU neural networks

Announce Type: cross Abstract: This paper studies how efficiently deep ReLU neural networks can approximate and learn smooth functions. When the error is measured in $L^p([0,1]^d)$ norm and the approximator is a network with width $W$ and depth $L$, recent works have proven the supper approximation rate $\mathcal{O}((WL)^{-2s/d})$ for Besov space $\mathcal{B}^s_{q,r}([0,1]^d)$ under the Sobolev embedding condition $s/d>1/q-1/p$. In order to overcome the curse of dimensionality in this rate,...

arXiv CS 9d ago

$p$-adic Bi-Filtrations for Topological Machine Learning on Genomic Sequences

arXiv:2606.06117v1 Announce Type: cross Abstract: We introduce pVR, a topological machine learning framework for alignment-free genomic sequence classification that combines $p$-adic numbers with topological data analysis. Each DNA sequence is encoded along two complementary axes: a $p$-adic distance on $k$-mer prefixes, which captures hierarchical positional structure, and a compositional $L_1$ distance on $k$-mer frequencies, which captures local sequence content. The two distances jointly...

arXiv CS 5d ago

Cheap Reward Hacking Detection

arXiv:2606.08893v1 Announce Type: new Abstract: A small transformer encoder is trained to map Terminal-Wrench trajectories onto a unit sphere where embedding distance approximates the $L_1$ distance between reward and metadata signals. A linear probe on top of that embedding detects reward hacking on the cleaned test split with AUC $0.9467$ and TPR@5%FPR $0.8296$, matching the TW sanitized LLM-as-judge AUC ($0.9510$ on the cleaned split) and exceeding its TPR@5%FPR ($0.7130$ vs $0.8296$) on...

arXiv CS 1d ago

Implicit Regularization for Multi-label Feature Selection

arXiv:2411.11436v2 Announce Type: replace Abstract: In this paper, we address the problem of feature selection in the context of multi-label learning, by using a new estimator based on implicit regularization and label embedding. Unlike the sparse feature selection methods that use a penalized estimator with explicit regularization terms such as $l_{2,1}$-norm, MCP or SCAD, we propose a simple alternative method via Hadamard product parameterization. In order to guide the feature selection...

arXiv CS 8d ago