Pareto Improvement
COPSD moves up and right.
Safer distillation. Less safety tax.
Cross-SFT first calibrates the teacher. Then constitution-conditioned OPSD distills safer behavior without collapsing expressiveness.
@article{wen2026copsd,
title = {Constitutional On-Policy Safe Distillation},
author = {Ming Wen and Yuxuan Liu and Kun Yang and Yunhao Feng and Zhuoer Xu and Yuhao Sun and Shiwen Cui and Xiang Zheng and Xingjun Ma and Yu-Gang Jiang},
journal = {arXiv preprint arXiv:2606.03089},
year = {2026},
url = {https://arxiv.org/abs/2606.03089}
}