Home Knowledge Base Graph-GRPO

Graph-GRPO

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Graph-GRPO: Training Graph Flow Models with Reinforcement Learning

Announce Type: replace Abstract: Graph generation is a fundamental task with broad applications, such as drug discovery. Recently, discrete flow matching-based graph generation, \aka, graph flow model (GFM), has emerged due to its superior performance and flexible sampling. However, effectively aligning GFMs with complex human preferences or task-specific objectives remains a significant challenge.

arXiv CS 1d ago

Graph-GRPO: Dependency-Aware Credit Assignment for Generative E-commerce Search Relevance

arXiv:2605.31003v1 Announce Type: new Abstract: Search relevance modeling is a core task in e-commerce search systems, assessing how well a user query matches candidate products. Rather than relying on a single holistic matching signal, relevance judgment often requires structured reasoning over query understanding, product understanding, and facet-level matching. With large language models (LLMs), this process is increasingly formulated as chain-of-thought (CoT) reasoning and optimized with...

arXiv CS 9d ago