arxiv:2603.29557

FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration

Published on Mar 31

· Submitted by

Mathsion Wong on Apr 1

IP Intelligence

Upvote

Authors:

Qiyao Wang ,

Abstract

FlowPIE presents a novel retrieval-generation framework for scientific idea generation that uses flow-guided Monte Carlo Tree Search and genetic algorithm principles to produce diverse, high-quality ideas through iterative evolution and cross-domain knowledge integration.

AI-generated summary

Scientific idea generation (SIG) is critical to AI-driven autonomous research, yet existing approaches are often constrained by a static retrieval-then-generation paradigm, leading to homogeneous and insufficiently divergent ideas. In this work, we propose FlowPIE, a tightly coupled retrieval-generation framework that treats literature exploration and idea generation as a co-evolving process. FlowPIE expands literature trajectories via a flow-guided Monte Carlo Tree Search (MCTS) inspired by GFlowNets, using the quality of current ideas assessed by an LLM-based generative reward model (GRM) as a supervised signal to guide adaptive retrieval and construct a diverse, high-quality initial population. Based on this population, FlowPIE models idea generation as a test-time idea evolution process, applying selection, crossover, and mutation with the isolation island paradigm and GRM-based fitness computation to incorporate cross-domain knowledge. It effectively mitigates the information cocoons arising from over-reliance on parametric knowledge and static literature. Extensive evaluations demonstrate that FlowPIE consistently produces ideas with higher novelty, feasibility and diversity compared to strong LLM-based and agent-based frameworks, while enabling reward scaling during test time.

View arXiv page View PDF Project page GitHub 10 Add to collection

Community

QiYao-Wang

Paper author Paper submitter 1 day ago

FlowPIE tackles scientific idea generation by combining flow-guided exploration with an evolutionary process, constructing a high-quality, diverse initial idea population and continuously improving it through selection, crossover, and cross-domain mutation. This framework overcomes reward plateaus in pure exploration and demonstrates strong test-time scaling, producing ideas with superior novelty, feasibility, and diversity compared to existing methods.