Home › Knowledge Base › Learning Modular

Learning Modular

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

BRo-JEPA: Learning Modular Arithmetic in Latent Space

arXiv:2606.01372v1 Announce Type: new Abstract: Can neural networks learn abstract algebraic rules, or do they merely memorize training patterns? We investigate this using MNIST digits as states and modular arithmetic operations as actions in a JEPA-style latent world model.

arXiv CS 8d ago

Agent-R1: A Unified and Modular Framework for Agentic Reinforcement Learning

arXiv:2511.14460v2 Announce Type: replace Abstract: Large language models (LLMs) have rapidly evolved from single-turn text generators into the foundation of increasingly capable agents. As these agents take on more complex reasoning, decision making, tool use, and long-horizon tasks, reinforcement learning (RL) is becoming increasingly important for shaping their behavior. This shift is especially visible in agentic RL, where models must interact with tools and environments across multiple...

arXiv CS 8d ago

PE-MHL: Physics-Encoded Modular Hybrid Layers for Scalable Learning of Complex Systems

arXiv:2606.04290v1 Announce Type: new Abstract: Hybrid models that combine physics-based and data-driven components have shown strong potential for achieving accuracy and interpretability in control applications. While recent methods have made progress in incorporating physical consistency, challenges remain in scalability, robustness to noise, and control of model complexity. This paper proposes a Physics-Encoded Modular Hybrid Layer (PE-MHL) framework, in which a baseline physics-based...

arXiv CS 6d ago

ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning

Announce Type: replace Abstract: Reinforcement Learning from Human Feedback (RLHF) has become the standard for aligning Large Language Models (LLMs), yet its efficacy is bottlenecked by the high cost of acquiring preference data, especially in low-resource and expert domains. To address this, we introduce ACTIVEULTRAFEEDBACK, a modular active learning pipeline that leverages uncertainty estimates to dynamically identify the most informative responses for annotation. Our pipeline facilitates...

arXiv CS 8d ago

Admittance Sensitivity-Informed Modular GP for Scalable Topology-Adaptive Power-Flow Learning

arXiv:2606.03717v1 Announce Type: new Abstract: Data-driven approaches for learning power flow models suffer from weak generalization across varying network topologies and limited computational scalability. Existing methods typically rely on training over a large set of grid topologies, which becomes impractical for large networks. This paper proposes a scalable and computationally efficient framework for topology-adaptive learning of power flow solutions.

arXiv CS 7d ago

Multi-Agent Lipschitz Bandits

arXiv:2602.16965v2 Announce Type: replace Abstract: We study the decentralized multi-player stochastic bandit problem over a continuous, Lipschitz-structured action space where hard collisions yield zero reward. Our objective is to design a communication-free policy that maximizes collective reward, while separating coordination costs from learning costs. We propose a modular protocol that first solves the multi-agent coordination problem by identifying and seating players on distinct,...

arXiv CS 5d ago

WAM-Nav: Asymmetric Latent World-Action Modeling for Unified Visual Navigation

arXiv:2606.04907v1 Announce Type: new Abstract: Visual navigation requires generating smooth and collision-free trajectories under complex geometric and physical constraints. Existing reactive policies that directly map observations to actions lack anticipatory reasoning, limiting their ability to proactively avoid obstacles. While visual imagination offers predictive foresight, conventional modular approaches separate scene prediction from policy learning, often leading to error...

arXiv CS 6d ago

AlloGen: Conformation-Selective Binder Generation with Differential State Scoring

arXiv:2606.05474v1 Announce Type: cross Abstract: Protein binder design has largely optimized for affinity alone, leaving conformational selectivity unaddressed: for allosteric targets such as kinases, nuclear receptors, and GPCRs, a binder that engages both active and inactive states provides no functional specificity regardless of how tightly it binds. We introduce AlloGen, a modular framework that decouples backbone generation from a learned state-selectivity scorer $Q_\theta$, an...

arXiv CS 5d ago

Assign and Add: A Mechanistic Study of Compositional Arithmetic

arXiv:2605.31497v1 Announce Type: new Abstract: Large language models are able to compose skills in order to perform complex tasks, many of which might not have been seen during training. The details of how exactly this composition occurs remain elusive.

arXiv CS 9d ago

Orange Lab: Lowering Barriers to Data Mining through Embedded Interactive Workflows

arXiv:2606.09239v1 Announce Type: new Abstract: While visual programming of data analysis workflows has become an important vehicle for the democratization of data science, such systems remain largely confined to standalone applications and offer limited support for transitioning their visual analytics solutions into interactive web environments. As a result, data analysis pipelines are difficult to share, embed, and adapt into user-facing analytical tools. We present Orange Lab, a web-based...

arXiv CS 1d ago