Tailoring Self-Rationalizers with Multi-Reward Distillation
Ramnath, Sahana, Joshi, Brihi, Hallinan, Skyler, Lu, Ximing, Li, Liunian Harold, Chan, Aaron, Hessel, Jack, Choi, Yejin, Ren, Xiang
Published in arXiv.org (22.05.2024)
Get full text
Published in arXiv.org (22.05.2024)
Paper
SYSTEM UND VERFAHREN ZUM BEWERTEN EINES FAHRSTILS
Klesing, Joachim J, Llaneras, Robert E, Longuemare, Pierre C, Li, Harold, Ryne, Patrik M, Greb, Michelle, Rezaeian, Ayyoub, Story, Michael R
Year of Publication 06.06.2019
Get full text
Year of Publication 06.06.2019
Patent
On the Paradox of Learning to Reason from Data
Zhang, Honghua, Li, Liunian Harold, Meng, Tao, Kai-Wei, Chang, Van den Broeck, Guy
Published in arXiv.org (24.05.2022)
Get full text
Published in arXiv.org (24.05.2022)
Paper