View a PDF of the paper titled RIRAG: Regulatory Information Retrieval and Answer Generation, by Tuba Gokhan and Kexin Wang and Iryna Gurevych and Ted Briscoe
Abstract:Regulatory documents, issued by governmental regulatory bodies, establish rules, guidelines, and standards that organizations must adhere to for legal compliance. These documents, characterized by their length, complexity and frequent updates, are challenging to interpret, requiring significant allocation of time and expertise on the part of organizations to ensure ongoing compliance. Regulatory Natural Language Processing (RegNLP) is a multidisciplinary field aimed at simplifying access to and interpretation of regulatory rules and obligations. We introduce a task of generating question-passages pairs, where questions are automatically created and paired with relevant regulatory passages, facilitating the development of regulatory question-answering systems. We create the ObliQA dataset, containing 27,869 questions derived from the collection of Abu Dhabi Global Markets (ADGM) financial regulation documents, design a baseline Regulatory Information Retrieval and Answer Generation (RIRAG) system and evaluate it with RePASs, a novel evaluation metric that tests whether generated answers accurately capture all relevant obligations while avoiding contradictions.
Submission history
From: Tuba Gokhan [view email]
[v1]
Mon, 9 Sep 2024 14:44:19 UTC (193 KB)
[v2]
Mon, 2 Dec 2024 18:13:28 UTC (1,944 KB)
Source link
lol