Electronic device and operating method for obtaining a sentence corresponding to context information
CHOI YOON JUNG, WELLECK SEAN, KULIKOV LLIA, KIM JAE DEOK, PANG YUANZHE, CHO KYUNG HYUN
Year of Publication 09.08.2021
Get full text
Year of Publication 09.08.2021
Patent
Artificial Intelligence Security Competition (AISC)
Dong, Yinpeng, Chen, Peng, Deng, Senyou, L, Lianji, Sun, Yi, Zhao, Hanyu, Li, Jiaxing, Tan, Yunteng, Liu, Xinyu, Dong, Yangyi, Xu, Enhui, Xu, Jincai, Xu, Shu, Fu, Xuelin, Sun, Changfeng, Han, Haoliang, Zhang, Xuchong, Chen, Shen, Sun, Zhimin, Cao, Junyi, Yao, Taiping, Ding, Shouhong, Wu, Yu, Lin, Jian, Wu, Tianpeng, Wang, Ye, Fu, Yu, Feng, Lin, Gao, Kangkang, Liu, Zeyu, Pang, Yuanzhe, Duan, Chengqi, Zhou, Huipeng, Wang, Yajie, Zhao, Yuhang, Wu, Shangbo, Lyu, Haoran, Lin, Zhiyu, Gao, Yifei, Li, Shuang, Wang, Haonan, Sang, Jitao, Ma, Chen, Zheng, Junhao, Li, Yijia, Shen, Chao, Lin, Chenhao, Cui, Zhichao, Liu, Guoshuai, Shi, Huafeng, Hu, Kun, Zhang, Mengxin
Year of Publication 06.12.2022
Year of Publication 06.12.2022
Get full text
Journal Article
Iterative Reasoning Preference Optimization
Pang, Richard Yuanzhe, Yuan, Weizhe, Cho, Kyunghyun, He, He, Sukhbaatar, Sainbayar, Weston, Jason
Year of Publication 30.04.2024
Year of Publication 30.04.2024
Get full text
Journal Article
Artificial Intelligence Security Competition (AISC)
Dong, Yinpeng, Chen, Peng, Deng, Senyou, Lianji, L, Sun, Yi, Zhao, Hanyu, Li, Jiaxing, Tan, Yunteng, Liu, Xinyu, Dong, Yangyi, Xu, Enhui, Xu, Jincai, Xu, Shu, Fu, Xuelin, Sun, Changfeng, Han, Haoliang, Zhang, Xuchong, Chen, Shen, Sun, Zhimin, Cao, Junyi, Yao, Taiping, Ding, Shouhong, Wu, Yu, Lin, Jian, Wu, Tianpeng, Wang, Ye, Fu, Yu, Lin, Feng, Gao, Kangkang, Liu, Zeyu, Pang, Yuanzhe, Duan, Chengqi, Zhou, Huipeng, Wang, Yajie, Zhao, Yuhang, Wu, Shangbo, Lyu, Haoran, Lin, Zhiyu, Gao, Yifei, Li, Shuang, Wang, Haonan, Jitao Sang, Chen, Ma, Zheng, Junhao, Li, Yijia, Shen, Chao, Lin, Chenhao, Cui, Zhichao, Liu, Guoshuai, Shi, Huafeng, Hu, Kun, Zhang, Mengxin
Published in arXiv.org (07.12.2022)
Get full text
Published in arXiv.org (07.12.2022)
Paper
Self-Rewarding Language Models
Yuan, Weizhe, Pang, Richard Yuanzhe, Cho, Kyunghyun, Li, Xian, Sukhbaatar, Sainbayar, Xu, Jing, Weston, Jason
Year of Publication 18.01.2024
Year of Publication 18.01.2024
Get full text
Journal Article
Leveraging Implicit Feedback from Deployment Data in Dialogue
Pang, Richard Yuanzhe, Roller, Stephen, Cho, Kyunghyun, He, He, Weston, Jason
Year of Publication 26.07.2023
Year of Publication 26.07.2023
Get full text
Journal Article
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Saparov, Abulhair, Pang, Richard Yuanzhe, Padmakumar, Vishakh, Joshi, Nitish, Kazemi, Seyed Mehran, Kim, Najoung, He, He
Year of Publication 24.05.2023
Year of Publication 24.05.2023
Get full text
Journal Article
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Rein, David, Hou, Betty Li, Stickland, Asa Cooper, Petty, Jackson, Pang, Richard Yuanzhe, Dirani, Julien, Michael, Julian, Bowman, Samuel R
Year of Publication 20.11.2023
Year of Publication 20.11.2023
Get full text
Journal Article
Reward Gaming in Conditional Text Generation
Pang, Richard Yuanzhe, Padmakumar, Vishakh, Sellam, Thibault, Parikh, Ankur P, He, He
Year of Publication 16.11.2022
Year of Publication 16.11.2022
Get full text
Journal Article
Self-Taught Evaluators
Wang, Tianlu, Kulikov, Ilia, Golovneva, Olga, Yu, Ping, Yuan, Weizhe, Dwivedi-Yu, Jane, Pang, Richard Yuanzhe, Fazel-Zarandi, Maryam, Weston, Jason, Li, Xian
Year of Publication 05.08.2024
Year of Publication 05.08.2024
Get full text
Journal Article
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way
Wang, Alex, Pang, Richard Yuanzhe, Chen, Angelica, Phang, Jason, Bowman, Samuel R
Year of Publication 23.05.2022
Year of Publication 23.05.2022
Get full text
Journal Article