GRASP: Plan-Guided Graph Retrieval with Adaptive Fusion and Reranking on Semi-Structured Knowledge Bases

Researchers have developed GRASP, a three-stage retrieval framework that substantially improves how AI systems search semi-structured knowledge bases combining text and entity graphs. The approach integrates plan-guided graph traversal with dense retrieval and learned reranking, achieving a 19-point lift in Hit@1 accuracy across benchmark datasets. This work matters because semi-structured KBs power high-stakes applications from medical search to e-commerce discovery, and GRASP's modular design sidesteps the brittleness of end-to-end graph generators while outperforming existing hybrid methods. The result signals growing sophistication in retrieval-augmented systems that must reason over both unstructured text and structured relational data.

Modelwire context

Explainer

GRASP's actual novelty is architectural, not just empirical. The framework deliberately avoids training a single end-to-end model to generate graph traversals, instead composing three separate stages (planning, retrieval, reranking) that can each be tuned independently. This modularity is the opposite of the current trend toward unified neural pipelines.

This connects directly to the reasoning framework pattern we covered in PPC (the preplan paper from today). Both papers argue that explicit intermediate representations (problem diagnosis in PPC, plan-guided traversal in GRASP) outperform conflating understanding with execution. Where PPC surfaces problem type before solution strategy, GRASP surfaces traversal intent before dense retrieval. The difference is domain: PPC targets mathematical reasoning, GRASP targets knowledge base search. Both reject end-to-end black boxes in favor of interpretable stages.

If GRASP's 19-point Hit@1 gain holds when tested on out-of-distribution knowledge bases (different domains, graph structures, or entity types than STaRK), the modularity claim is validated. If performance degrades significantly on novel graph schemas, the gains may reflect overfitting to the benchmark's specific structure rather than a general retrieval principle.

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsGRASP · STaRK

Read full story at arXiv cs.CL →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.