Evaluating Language-Model Agents on Realistic Autonomous Tasks
Kinniment, Megan, Sato, Lucas Jun Koba, Du, Haoxing, Goodrich, Brian, Hasin, Max, Chan, Lawrence, Miles, Luke Harold, Lin, Tao R, Wijk, Hjalmar, Burget, Joel, Ho, Aaron, Barnes, Elizabeth, Christiano, Paul
Year of Publication 18.12.2023
Year of Publication 18.12.2023
Get full text
Journal Article
Evaluating Language-Model Agents on Realistic Autonomous Tasks
Kinniment, Megan, Lucas Jun Koba Sato, Du, Haoxing, Goodrich, Brian, Hasin, Max, Chan, Lawrence, Miles, Luke Harold, Lin, Tao R, Wijk, Hjalmar, Burget, Joel, Ho, Aaron, Barnes, Elizabeth, Christiano, Paul
Published in arXiv.org (04.01.2024)
Get full text
Published in arXiv.org (04.01.2024)
Paper