DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities
Li, Wenqiang; Yu, Lina; Wu, Min; Liu, Jingyi; Hao, Meilan; Li, Yanjie Source: 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023, p 186-193, 2023, 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023;
Abstract:
Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answering (Q&A) tasks.
©2023 IEEE. (25 refs.)