医疗垂域大模型技术与系统

方越; 丁泓馨; 章锦阳; 蒋新科; 王亚沙

doi:10.11991/cccf.202605003

医疗垂域大模型技术与系统

Medical Domain-Specific Large Language Model Technologies and Systems

摘要

摘要: 近年来大语言模型发展迅速，通用大模型规模和能力快速提升，但是由于对医疗专业知识和医疗场景中必备的能力优化不充分，导致通用大模型直接应用于医疗场景时效果不佳。在此背景下，基于通用大模型精调的医疗垂域大模型成为可能的解决方案。本文首先从训练数据质控、医学知识优化、场景能力提升和复杂任务调度等4个方面，阐述了医疗垂域大模型需要解决的技术问题和国内外研究现状，然后给出了北京大学“小北健康”医疗垂域大模型在以上4个方面的技术方案，最后介绍了“小北健康”医疗垂域大模型的整体性能评测和应用情况，实验结果表明，“小北健康”模型在CMB医学评测基准上的平均准确率达到87.95%，显著优于现有代表性模型，验证了所提出技术体系的有效性与实用价值。

Abstract: In recent years, large language models have advanced rapidly, with general-purpose models achieving substantial improvements in both scale and capability. However, due to insufficient optimization for medical expertise and the essential competencies required in clinical scenarios, general-purpose large language models often perform poorly when directly applied to the medical domain. Against this backdrop, medical domain-specific large language models fine-tuned from general-purpose foundation models have emerged as a promising solution. This article first discusses the key technical challenges and reviews the related research progress in China and abroad from four perspectives: training data quality control, medical knowledge optimization, scenario-specific capability enhancement, and complex task orchestration. It then presents the technical solutions adopted by Peking University’s “Xiaobei Health” medical domain-specific large language model in these four aspects. Finally, the article introduces the overall performance evaluation and practical applications of the “Xiaobei Health” model. Experimental results on the Chinese medical benchmark (CMB) show that the “Xiaobei Health” model achieves an average accuracy of 87.95%, significantly outperforming existing representative models, demonstrating the effectiveness and practical value of the proposed technical framework.

参考文献(15)

施引文献

资源附件(0)