Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

[Submitted on 15 Jul 2024 (v1), last revised 7 Oct 2024 (this version, v2)]

View a PDF of the paper titled Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together, by Dilara Soylu and 2 other authors

View PDF
HTML (experimental)

Abstract:Natural Language Processing (NLP) systems are increasingly taking the form of sophisticated modular pipelines, e.g., Retrieval Augmented Generation (RAG), where each module may involve a distinct Language Model (LM) and an associated prompt template. These compound systems often lack intermediate labels or gradient flow to optimize each module, making their end-to-end optimization challenging. Here we seek strategies to optimize both the module-level LM weights and the associated prompt templates of such systems to maximize a downstream task metric. We propose for the first time combining the weight and prompt optimization strategies to optimize a modular LM pipeline by alternating between the two to get the same LM to teach itself. In experiments with multi-hop QA, mathematical reasoning, and feature-based classification using mistral-7b, llama-2-7b, and llama-3-8b, these BetterTogether strategies optimizing the weights and prompts of a pipeline together outperform directly optimizing weights alone and prompts alone by up to 60% and 6%, respectively, on average across LMs and tasks. BetterTogether optimizer is released in DSPy at this http URL