LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Abdulhai, Marwa, White, Isadora, Snell, Charlie, Sun, Charles, Hong, Joey, Zhai, Yuexiang, Xu, Kelvin, Levine, Sergey
Year of Publication 29.11.2023
Year of Publication 29.11.2023
Get full text
Journal Article