大语言模型训练与微调工具及平台的技术研究

高效微调技术与参数高效微调（PEFT）框架

这组文献聚焦于如何降低大语言模型微调的算力门槛。涵盖了核心算法（如QLoRA）、统一的微调工作流工具（LLaMA-Factory、PEFT-Factory）以及多种适配器集成框架（LLM-Adapters），并对LoRA/QLoRA在不同模型上的性能表现进行了对比实验。

QLoRA: Efficient Finetuning of Quantized LLMs（Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer, 2023, arXiv (Cornell University)）
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models（Yaowei Zheng, Richong Zhang, Junhao Zhang, YeYanhan YeYanhan, Zheyan Luo, 2024, No journal）
PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models（Robert Belanec, Ivan Srba, Mária Bieliková, 2025, ArXiv.org）
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models（Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee‐Peng Lim, Lidong Bing, Xing Xu, Soujanya Poria, Roy Lee, 2023, No journal）
Analyzing LLAMA3 Performance on Classification Task Using LoRA and QLoRA Techniques（Rajvardhan Patil, Priyanka Khot, Venkat N. Gudivada, 2025, Applied Sciences）
METRIC-BASED COMPARISON OF FINE-TUNED LLAMA 2 AND MIXTRAL LARGE LANGUAGE MODELS FOR INSTRUCTION TASKS（Bohdan M. Pavlyshenko, Ivan Bulka, 2024, Electronics and Information Technologies）

领域适应性定制与特定任务能力增强

该组论文探讨了如何通过微调或数据构建，使通用大模型具备特定领域或特定功能的能力。研究方向包括：中文语言能力的增强、外部工具（API）的使用能力、代码审查自动化、推荐系统对齐以及中国古诗词生成等垂直应用场景。

Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca（Yiming Cui, Ziqing Yang, Xin Yao, 2023, arXiv (Cornell University)）
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs（Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun, Sun, Maosong, 2023, arXiv (Cornell University)）
LLaMA-Reviewer: Advancing Code Review Automation with Large Language Models through Parameter-Efficient Fine-Tuning（Junyi Lu, Lei Yu, LI Xiao-jia, Yang Li, Chun Zuo, 2023, No journal）
TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with Recommendation（Keqin Bao, Jizhi Zhang, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He, 2023, No journal）
A Simple Approach of Chinese Poetry Generation Using Pre-trained LLMs（Yei-Zen Tang, Zhengchen Li, J. Y. Yu, Yang Liu, 2025, No journal）

企业级部署策略、知识注入与安全强化

这部分文献侧重于大模型在企业环境中的落地实践。讨论了微调与检索增强生成（RAG）在知识注入上的效果对比、企业私有数据微调的实践指南、差异化隐私（Differential Privacy）下的安全微调方案，以及大模型在企业信息化中的整体挑战。

Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs（Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha, 2024, No journal）
Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations（Mathav Raj J, Kushala VM, Harikrishna Warrier, Yogesh Kumar Gupta, 2024, arXiv (Cornell University)）
Hardening LLM Fine-Tuning: From Differentially Private Data Selection to Trustworthy Model Quantization（Zehang Deng, Ruoxi Sun, Minhui Xue, Wanlun Ma, Sheng Wen, ‪Surya Nepal‬, Yang Xiang, 2025, IEEE Transactions on Information Forensics and Security）
大语言模型在企业信息化中的应用探讨（刘浩东, 2025, 电子商务评论）

模型架构演进、对齐技术与多模态综述

该组文献提供了更宏观的技术视角，涉及大模型体系结构的演进趋势。包括ChatGLM等国产模型家族的迭代经验、基于Transformer的NLP模型发展史，以及从单模态向通用多模态基础模型演进的深度综述。

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools（Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Gang Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, M. Liu, Minlie Huang, Peng Zhang, Qinkai Zheng, Rui Lu, Shuaiqi Duan, Shudan Zhang, Shulin Cao, Shuxun Yang, Weng Lam Tam, Wenyi Zhao, Xiao Liu, Xia Xiao, Xiaohan Zhang, Xiaotao Gu, LV Xin, Xinghan Liu, Xinyi Liu, Xinyue Yang, Xixuan Song, Xunkai Zhang, Yifan An, Yifan Xu, Yilin Niu, Yuantao Yang, Yueyan Li, Yushi Bai, Yuxiao Dong, Zehan Qi, Zhaoyu Wang, Zhen Yang, Zhengxiao Du, Zhenyu Hou, Zihan Wang, Hou, Zhenyu, Wang, Zihan, 2024, arXiv (Cornell University)）
基于Transformer的自然语言处理模型综述（赖鸣姝, 2023, 人工智能与机器人研究）
Multimodal Foundation Models: From Specialists to General-Purpose Assistants（Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng Gao, 2024, Foundations and Trends® in Computer Graphics and Vision）

大语言模型训练与微调工具及平台的技术研究

本组参考文献系统地展示了大语言模型从底层训练算法、高效微调工具链、特定领域能力扩展，到企业级落地策略及安全防御的技术全景。研究重点已从单纯追求模型参数规模演进为：1. 开发如LLaMA-Factory和PEFT-Factory等集成化工具平台；2. 探索LoRA、QLoRA等参数高效型微调方案以降低资源消耗；3. 深入研究模型在特定任务（工具调用、代码审查、推荐等）中的对齐与泛化；4. 权衡微调与RAG在知识更新中的优劣，并兼顾数据隐私安全。

共 18 篇文献，4 个研究方向

高效微调技术与参数高效微调（PEFT）框架

这组文献聚焦于如何降低大语言模型微调的算力门槛。涵盖了核心算法（如QLoRA）、统一的微调工作流工具（LLaMA-Factory、PEFT-Factory）以及多种适配器集成框架（LLM-Adapters），并对LoRA/QLoRA在不同模型上的性能表现进行了对比实验。相关文献: Tim Dettmers et. al, 2023 等 6 篇文献

领域适应性定制与特定任务能力增强

该组论文探讨了如何通过微调或数据构建，使通用大模型具备特定领域或特定功能的能力。研究方向包括：中文语言能力的增强、外部工具（API）的使用能力、代码审查自动化、推荐系统对齐以及中国古诗词生成等垂直应用场景。相关文献: Yiming Cui et. al, 2023 等 5 篇文献

企业级部署策略、知识注入与安全强化

这部分文献侧重于大模型在企业环境中的落地实践。讨论了微调与检索增强生成（RAG）在知识注入上的效果对比、企业私有数据微调的实践指南、差异化隐私（Differential Privacy）下的安全微调方案，以及大模型在企业信息化中的整体挑战。相关文献: Oded Ovadia et. al, 2024 等 4 篇文献

模型架构演进、对齐技术与多模态综述

该组文献提供了更宏观的技术视角，涉及大模型体系结构的演进趋势。包括ChatGLM等国产模型家族的迭代经验、基于Transformer的NLP模型发展史，以及从单模态向通用多模态基础模型演进的深度综述。相关文献: Team GLM et. al, 2024 等 3 篇文献

总计18篇相关文献

TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with Recommendation

TALLRec：一种有效且高效的将大型语言模型与推荐系统对齐的调优框架

Keqin Bao, Jizhi Zhang, Yang Zhang 等, 2023-No journal

Large Language Models (LLMs) have demonstrated remarkable performance across diverse domains, thereby prompting researchers to explore their potential for use in recommendation systems. Initial attempts have leveraged the exceptional capabilities of LLMs, such as rich knowledge and strong generalization through In-context Learning, which involves phrasing the recommendation task as prompts. Nevertheless, the performance of LLMs in recommendation tasks remains suboptimal due to a substantial disparity between the training tasks for LLMs and recommendation tasks, as well as inadequate recommendation data during pre-training. To bridge the gap, we consider building a Large Recommendation Language Model by tunning LLMs with recommendation data. To this end, we propose an efficient and effective Tuning framework for Aligning LLMs with Recommendations, namely TALLRec. We have demonstrated that the proposed TALLRec framework can significantly enhance the recommendation capabilities of LLMs in the movie and book domains, even with a limited dataset of fewer than 100 samples. Additionally, the proposed framework is highly efficient and can be executed on a single RTX 3090 with LLaMA-7B. Furthermore, the fine-tuned LLM exhibits robust cross-domain generalization. Our code and data are available at https://github.com/SAI990323/TALLRec.