Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation
通过详细的宪法定义和AI驱动的评估提高标注一致性
AI总结 提出一种AI驱动的工作流,通过为每个类别编写详细的宪法定义并由前沿LLM解释,以比人类更一致和准确地生成黄金标签,在三个内容审核类别上将跨模型不一致性降低高达57倍。
Comments Under review at ACL Rolling Review (ARR), May 2026 cycle. Also available at https://doi.org/10.5281/zenodo.20125267