This paper proposes a principled research investigation on exploiting the rich agreement information among multiple raters for improving the calibrated performance.
Wei Ji, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Qi Bi, Jingjing Li, Hanruo Liu, Li Cheng, Yefeng Zheng