T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task

stp2yJanuary 20, 20250 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 26 Sep 2024 (v1), last revised 17 Jan 2025 (this version, v2)]

View a PDF of the paper titled T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task, by Xindi Tong and 3 other authors

View PDF
HTML (experimental)

Abstract:Long text summarization, gradually being essential for efficiently processing large volumes of information, stays challenging for Large Language Models (LLMs) such as GPT and LLaMA families because of the insufficient open-sourced training datasets and the high requirement of contextual details dealing. To address the issue, we design a novel zero-shot transfer learning framework, abbreviated as T3, to iteratively training a baseline LLM on an assistant task for the target task, where the former should own richer data resources and share structural or semantic similarity with the latter. In practice, T3 is approached to deal with the long text summarization task by utilizing question answering as the assistant task, and further validated its effectiveness on the BBC summary, NarraSum, FairytaleQA, and NLQuAD datasets, with up to nearly 14% improvement in ROUGE, 35% improvement in BLEU, and 16% improvement in Factscore compared to three baseline LLMs, demonstrating its potential for more assistant-target task combinations.

Submission history

From: Liang Xu [view email]
[v1]
Thu, 26 Sep 2024 08:44:38 UTC (209 KB)
[v2]
Fri, 17 Jan 2025 04:26:44 UTC (217 KB)

Source link
lol

By stp2y