Section 2. Training Paradigms of LLMs

Section3. Explanation for Traditional Fine-Tuning Paradigm

Here, local explanation aims to provide an understanding of how a language model makes a prediction for a specific input instance, while global explanation aims to provide a broad understanding of how the LLM works overall. Next, we discuss how explanations can be used to debug and improve models (Section 3.3)

Untitled

traditional explanation methods are unsuitable for LLMs 이유
- the aggressive surge in model scale (급증)
- Additionally, computationally demanding explanation techniques quickly become infeasible at the scale of hundreds of billions of parameters or more.
- Further, the intricate inner workings and reasoning processes of prompting-based models are too complex to be effectively captured by simplified surrogate models.