Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Zhang, Jingyu, Marone, Marc, Li, Tianjian, Van Durme, Benjamin, Khashabi, Daniel
Year of Publication 04.04.2024
Year of Publication 04.04.2024
Get full text
Journal Article
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Cheng, Jeffrey, Marone, Marc, Weller, Orion, Lawrie, Dawn, Khashabi, Daniel, Van Durme, Benjamin
Year of Publication 19.03.2024
Year of Publication 19.03.2024
Get full text
Journal Article
"According to ...": Prompting Language Models Improves Quoting from Pre-Training Data
Weller, Orion, Marone, Marc, Weir, Nathaniel, Lawrie, Dawn, Khashabi, Daniel, Van Durme, Benjamin
Year of Publication 22.05.2023
Year of Publication 22.05.2023
Get full text
Journal Article
Pretrained Models for Multilingual Federated Learning
Weller, Orion, Marone, Marc, Braverman, Vladimir, Lawrie, Dawn, Van Durme, Benjamin
Year of Publication 05.06.2022
Year of Publication 05.06.2022
Get full text
Journal Article
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Cheng, Jeffrey, Marone, Marc, Weller, Orion, Lawrie, Dawn, Khashabi, Daniel, Benjamin Van Durme
Published in arXiv.org (17.09.2024)
Get full text
Published in arXiv.org (17.09.2024)
Paper
StarCoder 2 and The Stack v2: The Next Generation
Lozhkov, Anton, Li, Raymond, Allal, Loubna Ben, Cassano, Federico, Lamy-Poirier, Joel, Tazi, Nouamane, Tang, Ao, Pykhtar, Dmytro, Liu, Jiawei, Wei, Yuxiang, Liu, Tianyang, Tian, Max, Kocetkov, Denis, Zucker, Arthur, Belkada, Younes, Wang, Zijian, Liu, Qian, Abulkhanov, Dmitry, Paul, Indraneil, Li, Zhuang, Li, Wen-Ding, Risdal, Megan, Li, Jia, Zhu, Jian, Zhuo, Terry Yue, Zheltonozhskii, Evgenii, Dade, Nii Osae Osae, Yu, Wenhao, Krauß, Lucas, Jain, Naman, Su, Yixuan, He, Xuanli, Dey, Manan, Abati, Edoardo, Chai, Yekun, Muennighoff, Niklas, Tang, Xiangru, Oblokulov, Muhtasham, Akiki, Christopher, Marone, Marc, Mou, Chenghao, Mishra, Mayank, Gu, Alex, Hui, Binyuan, Dao, Tri, Zebaze, Armel, Dehaene, Olivier, Patry, Nicolas, Xu, Canwen, McAuley, Julian, Hu, Han, Scholak, Torsten, Paquet, Sebastien, Robinson, Jennifer, Anderson, Carolyn Jane, Chapados, Nicolas, Patwary, Mostofa, Tajbakhsh, Nima, Jernite, Yacine, Ferrandis, Carlos Muñoz, Zhang, Lingming, Hughes, Sean, Wolf, Thomas, Guha, Arjun, von Werra, Leandro, de Vries, Harm
Year of Publication 29.02.2024
Year of Publication 29.02.2024
Get full text
Journal Article
Pretrained Models for Multilingual Federated Learning
Weller, Orion, Marone, Marc, Braverman, Vladimir, Lawrie, Dawn, Benjamin Van Durme
Published in arXiv.org (06.06.2022)
Get full text
Published in arXiv.org (06.06.2022)
Paper
StarCoder: may the source be with you
Li, Raymond, Allal, Loubna Ben, Zi, Yangtian, Muennighoff, Niklas, Kocetkov, Denis, Mou, Chenghao, Marone, Marc, Akiki, Christopher, Li, Jia, Chim, Jenny, Liu, Qian, Zheltonozhskii, Evgenii, Zhuo, Terry Yue, Wang, Thomas, Dehaene, Olivier, Davaadorj, Mishig, Lamy-Poirier, Joel, Monteiro, João, Shliazhko, Oleh, Gontier, Nicolas, Meade, Nicholas, Zebaze, Armel, Yee, Ming-Ho, Umapathi, Logesh Kumar, Zhu, Jian, Lipkin, Benjamin, Oblokulov, Muhtasham, Wang, Zhiruo, Murthy, Rudra, Stillerman, Jason, Patel, Siva Sankalp, Abulkhanov, Dmitry, Zocca, Marco, Dey, Manan, Zhang, Zhihan, Fahmy, Nour, Bhattacharyya, Urvashi, Yu, Wenhao, Singh, Swayam, Luccioni, Sasha, Villegas, Paulo, Kunakov, Maxim, Zhdanov, Fedor, Romero, Manuel, Lee, Tony, Timor, Nadav, Ding, Jennifer, Schlesinger, Claire, Schoelkopf, Hailey, Ebert, Jan, Dao, Tri, Mishra, Mayank, Gu, Alex, Robinson, Jennifer, Anderson, Carolyn Jane, Dolan-Gavitt, Brendan, Contractor, Danish, Reddy, Siva, Fried, Daniel, Bahdanau, Dzmitry, Jernite, Yacine, Ferrandis, Carlos Muñoz, Hughes, Sean, Wolf, Thomas, Guha, Arjun, von Werra, Leandro, de Vries, Harm
Year of Publication 09.05.2023
Year of Publication 09.05.2023
Get full text
Journal Article
StarCoder 2 and The Stack v2: The Next Generation
Lozhkov, Anton, Li, Raymond, Loubna Ben Allal, Cassano, Federico, Lamy-Poirier, Joel, Tazi, Nouamane, Tang, Ao, Pykhtar, Dmytro, Liu, Jiawei, Wei, Yuxiang, Liu, Tianyang, Tian, Max, Kocetkov, Denis, Zucker, Arthur, Younes Belkada, Wang, Zijian, Liu, Qian, Abulkhanov, Dmitry, Indraneil, Paul, Zhuang, Li, Wen-Ding, Li, Risdal, Megan, Li, Jia, Zhu, Jian, Terry Yue Zhuo, Zheltonozhskii, Evgenii, Nii Osae Osae Dade, Yu, Wenhao, Krauß, Lucas, Jain, Naman, Su, Yixuan, He, Xuanli, Dey, Manan, Abati, Edoardo, Chai, Yekun, Muennighoff, Niklas, Tang, Xiangru, Oblokulov, Muhtasham, Akiki, Christopher, Marone, Marc, Mou, Chenghao, Mishra, Mayank, Gu, Alex, Binyuan Hui, Dao, Tri, Zebaze, Armel, Dehaene, Olivier, Patry, Nicolas, Xu, Canwen, McAuley, Julian, Hu, Han, Scholak, Torsten, Paquet, Sebastien, Robinson, Jennifer, Anderson, Carolyn Jane, Chapados, Nicolas, Patwary, Mostofa, Tajbakhsh, Nima, Jernite, Yacine, Carlos Muñoz Ferrandis, Zhang, Lingming, Hughes, Sean, Wolf, Thomas, Guha, Arjun, Leandro von Werra, de Vries, Harm
Published in arXiv.org (29.02.2024)
Get full text
Published in arXiv.org (29.02.2024)
Paper
StarCoder: may the source be with you
Li, Raymond, Loubna Ben Allal, Yangtian Zi, Muennighoff, Niklas, Kocetkov, Denis, Mou, Chenghao, Marone, Marc, Akiki, Christopher, Li, Jia, Chim, Jenny, Liu, Qian, Zheltonozhskii, Evgenii, Terry Yue Zhuo, Wang, Thomas, Dehaene, Olivier, Davaadorj, Mishig, Lamy-Poirier, Joel, Monteiro, João, Shliazhko, Oleh, Gontier, Nicolas, Meade, Nicholas, Zebaze, Armel, Ming-Ho, Yee, Umapathi, Logesh Kumar, Zhu, Jian, Lipkin, Benjamin, Oblokulov, Muhtasham, Wang, Zhiruo, Murthy, Rudra, Stillerman, Jason, Patel, Siva Sankalp, Abulkhanov, Dmitry, Zocca, Marco, Dey, Manan, Zhang, Zhihan, Fahmy, Nour, Bhattacharyya, Urvashi, Yu, Wenhao, Singh, Swayam, Luccioni, Sasha, Villegas, Paulo, Kunakov, Maxim, Zhdanov, Fedor, Romero, Manuel, Lee, Tony, Timor, Nadav, Ding, Jennifer, Schlesinger, Claire, Hailey Schoelkopf, Ebert, Jan, Dao, Tri, Mishra, Mayank, Gu, Alex, Robinson, Jennifer, Anderson, Carolyn Jane, Dolan-Gavitt, Brendan, Contractor, Danish, Reddy, Siva, Fried, Daniel, Bahdanau, Dzmitry, Jernite, Yacine, Carlos Muñoz Ferrandis, Hughes, Sean, Wolf, Thomas, Guha, Arjun, Leandro von Werra, de Vries, Harm
Published in arXiv.org (13.12.2023)
Get full text
Published in arXiv.org (13.12.2023)
Paper