Home › Knowledge Base › Competitive Programming Solutions

Competitive Programming Solutions

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

arXiv:2602.20213v2 Announce Type: replace Abstract: The evaluation of Large Language Models (LLMs) for code generation relies heavily on the quality and robustness of test cases. However, existing benchmarks often lack coverage for subtle corner cases, allowing incorrect solutions to pass. To bridge this gap, we propose CodeHacker, an automated agent framework dedicated to generating targeted adversarial test cases that expose latent vulnerabilities in program submissions.

arXiv CS 7d ago

Where Do Large Language Models Fail on Competitive Programming? A Taxonomy of Failures by Algorithm Type and Difficulty Rating

arXiv:2606.05228v1 Announce Type: new Abstract: Large language models (LLMs) demonstrate increasing proficiency on competitive programming benchmarks, yet technical reports predominantly publish aggregate pass rates, obscuring domain-specific vulnerabilities. We present a systematic empirical study of LLM failure patterns using a balanced taxonomy of 315 Codeforces problems across seven algorithm categories and three difficulty tiers. We evaluate GPT-4o and Claude Sonnet 4.6 under strict...

arXiv CS 5d ago

CodeGolf Bench: A Multi-Language Benchmark for Evaluating Concise Code Generation Capabilities of Large Language Models

Announce Type: new Abstract: This paper introduces Code Bench, a benchmark capable of evaluating Large Language Models (LLMs) concise code generation abilities in 60 programming languages. Based on code golf, a recreational programming competition focused on minimal character or byte solutions, the benchmark provides a distinctive measure of LLMs ability to produce efficient, concise code. Unlike existing benchmarks limited by fixed problem sets and language coverage, CodeGolf Bench...

arXiv CS 9d ago

Cast a Wider Net: Coordinated Pass@K Policy Optimization for Code Reasoning

arXiv:2605.27000v2 Announce Type: replace Abstract: Repeated sampling with a verifier is the standard way to allocate test-time compute for code generation, with pass@$K$ as the canonical metric. Yet the standard policy class draws $K$ independent samples from a single answer distribution, so attempts often collapse onto near-duplicate reasoning paths and waste the budget on redundant rollouts. This failure is costly in competitive programming, where many problems admit multiple distinct...

arXiv CS 8d ago

Human-Like Neural Nets by Catapulting

Human-like Neural Nets by Catapulting Speculative proposal to create artificial neural nets with human-like performance by high-learning-rate/regularization training of overparameterized NNs to trigger catapulting/grokking. Over-parameterization as a route to true generalization would resolve many outstanding mysteries of artificial versus natural intelligence. There are many mysteries about deep learning and human intelligence, but we could describe the biggest anomaly this way: why are...

Hacker News 3d ago

Decomposable Neuro Symbolic Regression

Announce Type: replace Abstract: Symbolic regression (SR) models complex systems by discovering mathematical expressions that capture underlying relationships in observed data. However, most SR methods prioritize minimizing prediction error over identifying the governing equations, often producing overly complex or inaccurate expressions. To address this, we present a decomposable SR method that generates interpretable multivariate expressions leveraging transformer models, genetic...

arXiv CS 1d ago

Planning with Uncertainty: Symmetries, Policy Inference, and Solution Compression

arXiv:2403.19883v2 Announce Type: replace Abstract: Fully-observable non-deterministic (FOND) planning is at the core of artificial intelligence planning with uncertainty. It models uncertainty through actions with non-deterministic effects.

arXiv CS 7d ago

The need for a socialist planned economy (2021)

This article is a transcript of the presentation given by Vincent R. Beaudoin at Fightback’s Marxist Winter School 2021. When the Soviet Union collapsed in 1991, Francis Fukuyama told us that this was evidence of the failure of the planned economy and the success of the capitalist market economy, and that it represented the end of history. In October 2018, however, he changed his mind.

Hacker News 10d ago

What still needs answering in every QB room? 32 li...

No matter how many answers the NFL offseason provides, questions always remain. Especially about quarterbacks. Sure, your team might be all set at QB, but there might be questions around your quarterback or about his long-term situation.

ESPN 1d ago

Jan. 6 rally organizer getting another $1.2 million from taxpayers to promote ‘Trump Accounts’

Jan. 6 rally organizer getting another $1.2 million from taxpayers to promote ‘Trump Accounts’ First in The Independent: A Virginia firm that organized Donald Trump’s January 6, 2021, ‘Stop the Steal’ rally is set to receive a further $1.2 million in taxpayer funds to help promote ‘Trump Accounts’ - Bookmark A Virginia firm that organized and staged Donald Trump’s January 6, 2021, “Stop the Steal” rally, after which a violent mob of MAGA disciples stormed the U.S. Capitol building, is set to...

The Independent World 2d ago