## Code Implementation

Our implementation is built upon the **SWIFT framework (v3.3.0.post1)**. The following is a list of core codes for our proposed Multimodal Medi-
cal Reasoning-with-Retrieval framework, Med-RwR.

### Core Scripts

*   `train.sh`: The bash script for launching the multimodal training stage. Detailed configurations can be found here.
*   `infer.sh`: The bash script for inference.

### Key Modules

*   `swift/trainers/rlhf_trainer/grpo_trainer_retrieve.py`: Implementation for **GRPO training**.
*   `swift/llm/infer/infer_engine/pt_engine_retrieve_train.py`: Implementation for **rollout generation** during training.
*   `examples/train/grpo/plugin/plugin.py`: Design of **Reward function**.
*   `swift/llm/infer/infer_engine/pt_engine_retrieve_infer.py`: Primary engine for **inference process**.
*   `swift/llm/infer/infer_engine/pt_engine_retrieve_infer_img.py`: Implementation for the proposed **Confidence-Driven Image Re-retrieval (CDIR)** mechanism.