Adapting Qwen2.5-14B to Brazilian SUS clinical guidelines. Includes 2 open benchmarks (HealthBench-BR, PCDT-QA) and 8 model checkpoints from the paper's ablations.
-
Updated
May 7, 2026 - Python
Adapting Qwen2.5-14B to Brazilian SUS clinical guidelines. Includes 2 open benchmarks (HealthBench-BR, PCDT-QA) and 8 model checkpoints from the paper's ablations.
Data Preparation for Large Language Models β a curated companion to our JCST 2026 survey. Covers Pre-training, Continual Pre-training, and Post-training (SFT/RLHF/RLAIF) across collection, filtering, dedup, generation, evaluation.
Add a description, image, and links to the continual-pretraining topic page so that developers can more easily learn about it.
To associate your repository with the continual-pretraining topic, visit your repo's landing page and select "manage topics."