PALoRA: A Two-Stage Framework for Knowledge Injection in LLMs
There's this new approach called PALoRA that's trying to solve the issue of keeping both flexibility and stability in large language models, or LLMs. It helps these models integrate knowledge more effectively while still being able to reason well. The idea comes from the realization that important reasoning data is actually spread out in the weight matrices of multilayer perceptrons, rather than just being in the obvious areas. First, PALoRA trains an expert using Singular Value Fine-Tuning on a reasoning dataset. It then uses the derived scaling vector as a steady reference to find sensitive directions, which helps reduce conflicts when updating factual knowledge. You can check out the details in arXiv preprint 2605.24549.
Key facts
- PALoRA stands for Projection-Adaptive LoRA.
- It addresses the plasticity-stability dilemma in LLMs.
- Reasoning-critical information is distributed across the singular spectrum.
- PALoRA uses a two-stage framework for knowledge injection.
- Stage one trains an SVF expert on a reasoning dataset.
- The SVF expert's singular scaling vector serves as a frozen geometric probe.
- The method reduces interference with existing reasoning abilities.
- The preprint is available on arXiv with ID 2605.24549.
Entities
Institutions
- arXiv