This repository contains the implementation code for DxEvolve.
To ensure correct data loading, please place the codebase and the dataset directories in the same parent folder:
.
├── DxEvolve # Source code directory
└── MIMIC-Clinical-Decision-Making-Dataset # Dataset directory
Important: The code uses relative paths to access clinical data. Maintaining this parallel structure is mandatory.
The experiments are conducted using the MIMIC-Clinical-Decision-Making-Dataset.
- Access & Authorization: Please refer to the MIMIC-Clinical-Decision-Making-Dataset GitHub repository for detailed instructions on obtaining authorization and downloading the data.
- Python: 3.11
- Environment Manager: Conda (Recommended)
- Create the environment:
conda create -n dxevolve python=3.11
conda activate dxevolve- Install requirements:
pip install -r requirements.txtRun the main process using run.py.
python run.py \
--model "$MODEL_NAME" \
--test_num 400-
--model: The identifier for the model. Available options are:DeepSeek-V3.2GLM-4.7Qwen3-30B-A3B-Instruct-2507Qwen3-235B-A22B-Instruct-2507
-
--test_num:400: Runs evaluation on a standard sample of 400 cases.-1: Doctor Labeled Subset.
- Logs: Results, including agent trajectories and intermediate steps, are saved as
.pklfiles in thenew_logs/directory. - Evaluation: Use the evaluation scripts to parse these
.pklfiles for metrics.