Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Kreutzer, Julia, Caswell, Isaac, Wang, Lisa, Wahab, Ahsan, van Esch, Daan, Ulzii-Orshikh, Nasanbayar, Tapo, Allahsera, Subramani, Nishant, Sokolov, Artem, Sikasote, Claytone, Setyawan, Monang, Sarin, Supheakmungkol, Samb, Sokhar, Sagot, Benoît, Rivera, Clara, Rios, Annette, Papadimitriou, Isabel, Osei, Salomey, Suarez, Pedro Ortiz, Orife, Iroro, Ogueji, Kelechi, Rubungo, Andre Niyongabo, Nguyen, Toan Q., Müller, Mathias, Müller, André, Muhammad, Shamsuddeen Hassan, Muhammad, Nanda, Mnyakeni, Ayanda, Mirzakhalov, Jamshidbek, Matangira, Tapiwanashe, Leong, Colin, Lawson, Nze, Kudugunta, Sneha, Jernite, Yacine, Jenny, Mathias, Firat, Orhan, Dossou, Bonaventure F. P., Dlamini, Sakhile, de Silva, Nisansa, Çabuk Ballı, Sakine, Biderman, Stella, Battisti, Alessia, Baruwa, Ahmed, Bapna, Ankur, Baljekar, Pallavi, Azime, Israel Abebe, Awokoya, Ayodele, Ataman, Duygu, Ahia, Orevaoghene, Ahia, Oghenefego, Agrawal, Sweta, Adeyemi, Mofetoluwa
Published in Transactions of the Association for Computational Linguistics (31.01.2022)
Published in Transactions of the Association for Computational Linguistics (31.01.2022)
Get full text
Journal Article
Light bulbs have energy ratings — so why can’t AI chatbots?
Luccioni, Sasha, Gamazaychikov, Boris, Hooker, Sara, Pierrard, Régis, Strubell, Emma, Jernite, Yacine, Wu, Carole-Jean
Published in Nature (London) (22.08.2024)
Published in Nature (London) (22.08.2024)
Get full text
Journal Article
Improving documentation of presenting problems in the emergency department using a domain-specific ontology and machine learning-driven user interfaces
Greenbaum, Nathaniel R., Jernite, Yacine, Halpern, Yoni, Calder, Shelley, Nathanson, Larry A., Sontag, David A., Horng, Steven
Published in International journal of medical informatics (Shannon, Ireland) (01.12.2019)
Published in International journal of medical informatics (Shannon, Ireland) (01.12.2019)
Get full text
Journal Article
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model
Akiki, Christopher, Pistilli, Giada, Mieskes, Margot, Gallé, Matthias, Wolf, Thomas, Ilić, Suzana, Jernite, Yacine
Published in Psychofenia (09.12.2022)
Get full text
Published in Psychofenia (09.12.2022)
Conference Proceeding
Documenting Geographically and Contextually Diverse Language Data Sources
McMillan-Major, Angelina, De Toni, Francesco, Alyafeai, Zaid, Biderman, Stella, Chen, Kimbo, Dupont, Gérard, Elsahar, Hady, Emezue, Chris, Aji, Alham Fikri, Ilić, Suzana, Khamis, Nurulaqilla, Leong, Colin, Masoud, Maraim, Soroa, Aitor, Ortiz Suarez, Pedro, Van Strien, Daniel, Talat, Zeerak, Jernite, Yacine
Published in Northern European Journal of Language Technology (12.09.2024)
Published in Northern European Journal of Language Technology (12.09.2024)
Get full text
Journal Article
Stable Bias: Analyzing Societal Representations in Diffusion Models
Luccioni, Alexandra Sasha, Akiki, Christopher, Mitchell, Margaret, Jernite, Yacine
Year of Publication 20.03.2023
Year of Publication 20.03.2023
Get full text
Journal Article
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models
Pistilli, Giada, Leidinger, Alina, Jernite, Yacine, Kasirzadeh, Atoosa, Luccioni, Alexandra Sasha, Mitchell, Margaret
Year of Publication 22.05.2024
Year of Publication 22.05.2024
Get full text
Journal Article
Stronger Together: on the Articulation of Ethical Charters, Legal Tools, and Technical Documentation in ML
Pistilli, Giada, Carlos Munoz Ferrandis, Jernite, Yacine, Mitchell, Margaret
Published in arXiv.org (09.05.2023)
Published in arXiv.org (09.05.2023)
Get full text
Paper
Journal Article
Towards Openness Beyond Open Access: User Journeys through 3 Open AI Collaboratives
Ding, Jennifer, Akiki, Christopher, Jernite, Yacine, Steele, Anne Lee, Popo, Temi
Year of Publication 20.01.2023
Year of Publication 20.01.2023
Get full text
Journal Article
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model
Akiki, Christopher, Pistilli, Giada, Mieskes, Margot, Gallé, Matthias, Wolf, Thomas, Ilić, Suzana, Jernite, Yacine
Year of Publication 09.12.2022
Year of Publication 09.12.2022
Get full text
Journal Article
Training Transformers Together
Borzunov, Alexander, Ryabinin, Max, Dettmers, Tim, Lhoest, Quentin, Saulnier, Lucile, Diskin, Michael, Jernite, Yacine, Wolf, Thomas
Year of Publication 07.07.2022
Year of Publication 07.07.2022
Get full text
Journal Article
On the Standardization of Behavioral Use Clauses and Their Adoption for Responsible Licensing of AI
McDuff, Daniel, Korjakow, Tim, Cambo, Scott, Benjamin, Jesse Josua, Lee, Jenny, Jernite, Yacine, Ferrandis, Carlos Muñoz, Gokaslan, Aaron, Tarkowski, Alek, Lindley, Joseph, Cooper, A. Feder, Contractor, Danish
Year of Publication 07.02.2024
Year of Publication 07.02.2024
Get full text
Journal Article
The ROOTS Search Tool: Data Transparency for LLMs
Piktus, Aleksandra, Akiki, Christopher, Villegas, Paulo, Laurençon, Hugo, Dupont, Gérard, Luccioni, Alexandra Sasha, Jernite, Yacine, Rogers, Anna
Year of Publication 27.02.2023
Year of Publication 27.02.2023
Get full text
Journal Article