Offline RL for Natural Language Generation with Implicit Language Q Learning
Snell, Charlie, Kostrikov, Ilya, Su, Yi, Yang, Mengjiao, Levine, Sergey
Year of Publication 05.06.2022
Year of Publication 05.06.2022
Get full text
Journal Article
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Abdulhai, Marwa, White, Isadora, Snell, Charlie, Sun, Charles, Hong, Joey, Zhai, Yuexiang, Xu, Kelvin, Levine, Sergey
Year of Publication 29.11.2023
Year of Publication 29.11.2023
Get full text
Journal Article
The False Promise of Imitating Proprietary LLMs
Gudibande, Arnav, Wallace, Eric, Snell, Charlie, Geng, Xinyang, Liu, Hao, Abbeel, Pieter, Levine, Sergey, Song, Dawn
Year of Publication 25.05.2023
Year of Publication 25.05.2023
Get full text
Journal Article
Offline RL for Natural Language Generation with Implicit Language Q Learning
Snell, Charlie, Kostrikov, Ilya, Su, Yi, Yang, Mengjiao, Levine, Sergey
Published in arXiv.org (01.05.2023)
Get full text
Published in arXiv.org (01.05.2023)
Paper
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Snell, Charlie, Yang, Mengjiao, Fu, Justin, Su, Yi, Levine, Sergey
Published in arXiv.org (22.04.2022)
Get full text
Published in arXiv.org (22.04.2022)
Paper
The False Promise of Imitating Proprietary LLMs
Gudibande, Arnav, Wallace, Eric, Snell, Charlie, Geng, Xinyang, Liu, Hao, Abbeel, Pieter, Levine, Sergey, Song, Dawn
Published in arXiv.org (25.05.2023)
Get full text
Published in arXiv.org (25.05.2023)
Paper