A benchmark dataset for evaluating large language models in Traditional Chinese Medicine (TCM). This benchmark integrates real-world clinical case records, the latest national TCM examination questions, and classical TCM literature.
| Name | Name | Last commit date | ||
|---|---|---|---|---|