View a PDF of the paper titled LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization, by Masafumi Enomoto and 4 other authors
Abstract:Open-Domain Multi-Document Summarization (ODMDS) is the task of generating summaries from large document collections in response to user queries. This task is crucial for efficiently addressing diverse information needs from users. Traditional retrieve-then-summarize approaches fall short for open-ended queries in ODMDS tasks. These queries often require broader context than initially retrieved passages provide, making it challenging to retrieve all relevant information in a single search. While iterative retrieval methods has been explored for multi-hop question answering (MQA), it’s impractical for ODMDS due to high latency from repeated LLM inference. Accordingly, we propose LightPAL, a lightweight passage retrieval method for ODMDS. LightPAL leverages an LLM to pre-construct a graph representing passage relationships, then employs random walk during retrieval, avoiding iterative LLM inference. Experiments demonstrate that LightPAL outperforms naive sparse and pre-trained dense retrievers in both retrieval and summarization metrics, while achieving higher efficiency compared to iterative MQA approaches.
Submission history
From: Masafumi Enomoto [view email]
[v1]
Tue, 18 Jun 2024 10:57:27 UTC (407 KB)
[v2]
Thu, 17 Oct 2024 08:09:37 UTC (691 KB)
Source link
lol