Lightweight LLMs for Court View Generation
A study explores lightweight large language models (LLMs) under 2B parameters for Criminal Court View Generation (CVG), a Legal AI task that produces court views from case facts. The research addresses four questions: the impact of LLM architecture on CVG quality and charge prediction, the role of model size, comparison with Deep Neural Networks (DNNs), and whether generating court views first improves charge prediction over direct prediction. The authors developed CVGEvalKit, an evaluation framework with three public datasets for CVG and charge prediction. Comprehensive experiments were conducted on this framework, with models trained on a mix of data.
Key facts
- Lightweight LLMs (smaller than 2B parameters) are explored for CVG.
- CVG is a critical task in Legal AI.
- Four key questions are addressed regarding architecture, size, comparison with DNNs, and prediction order.
- CVGEvalKit is an evaluation framework with three public datasets.
- Models are trained on a mix of data.
- The study is published on arXiv with ID 2605.16770.
- The announcement type is cross.
- The research systematically explores lightweight LLMs for CVG.
Entities
Institutions
- arXiv