LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Schuhmann, Christoph, Vencu, Richard, Beaumont, Romain, Kaczmarczyk, Robert, Mullis, Clayton, Katta, Aarush, Coombes, Theo, Jitsev, Jenia, Komatsuzaki, Aran
Published in arXiv.org (03.11.2021)
Published in arXiv.org (03.11.2021)
Get full text
Paper
Journal Article
LAION-5B: An open large-scale dataset for training next generation image-text models
Schuhmann, Christoph, Beaumont, Romain, Vencu, Richard, Cade, Gordon, Wightman, Ross, Cherti, Mehdi, Coombes, Theo, Katta, Aarush, Mullis, Clayton, Wortsman, Mitchell, Schramowski, Patrick, Srivatsa Kundurthy, Crowson, Katherine, Schmidt, Ludwig, Kaczmarczyk, Robert, Jitsev, Jenia
Published in arXiv.org (16.10.2022)
Published in arXiv.org (16.10.2022)
Get full text
Paper
Journal Article
DataComp: In search of the next generation of multimodal datasets
Gadre, Samir Yitzhak, Ilharco, Gabriel, Fang, Alex, Hayase, Jonathan, Smyrnis, Georgios, Nguyen, Thao, Ryan, Marten, Wortsman, Mitchell, Ghosh, Dhruba, Zhang, Jieyu, Orgad, Eyal, Entezari, Rahim, Daras, Giannis, Pratt, Sarah, Ramanujan, Vivek, Bitton, Yonatan, Marathe, Kalyani, Mussmann, Stephen, Vencu, Richard, Cherti, Mehdi, Krishna, Ranjay, Pang Wei Koh, Saukh, Olga, Ratner, Alexander, Song, Shuran, Hannaneh Hajishirzi, Farhadi, Ali, Beaumont, Romain, Oh, Sewoong, Dimakis, Alex, Jitsev, Jenia, Carmon, Yair, Shankar, Vaishaal, Schmidt, Ludwig
Published in arXiv.org (20.10.2023)
Published in arXiv.org (20.10.2023)
Get full text
Paper
Journal Article