Improving Large-scale Language Models and Resources for Filipino
Get full text
Paper
Journal Article
Establishing Baselines for Text Classification in Low-Resource Languages
Get full text
Paper
Journal Article
Multilingual Large Language Models Are Not (Yet) Code-Switchers
Zhang, Ruochen, Cahyawijaya, Samuel, Blaise Cruz, Jan Christian, Genta Indra Winata, Aji, Alham Fikri
Published in arXiv.org (23.10.2023)
Published in arXiv.org (23.10.2023)
Get full text
Paper
Journal Article
Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings
Velasco, Dan John, Alba, Axel, Pelagio, Trisha Gail, Ramirez, Bryce Anthony, Chua, Unisse, Samson, Briane Paul, Blaise Cruz, Jan Christian, Cheng, Charibeth
Published in arXiv.org (19.10.2023)
Published in arXiv.org (19.10.2023)
Get full text
Paper
Journal Article
Simplifying Paragraph-level Question Generation via Transformer Language Models
Lopez, Luis Enrico, Cruz, Diane Kathryn, Blaise Cruz, Jan Christian, Cheng, Charibeth
Published in arXiv.org (13.08.2021)
Published in arXiv.org (13.08.2021)
Get full text
Paper
Journal Article
Evaluating Language Model Finetuning Techniques for Low-resource Languages
Get full text
Paper
Journal Article
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets
Blaise Cruz, Jan Christian, Resabal, Jose Kristian, Lin, James, Velasco, Dan John, Cheng, Charibeth
Published in arXiv.org (13.08.2021)
Published in arXiv.org (13.08.2021)
Get full text
Paper
Journal Article
Using Synthetic Data for Conversational Response Generation in Low-resource Settings
Tan, Gabriel Louis, Adrian Paule Ty, Ng, Schuyler, Denzel Adrian Co, Blaise Cruz, Jan Christian, Cheng, Charibeth
Published in arXiv.org (06.04.2022)
Published in arXiv.org (06.04.2022)
Get full text
Paper
Journal Article
Localization of Fake News Detection via Multitask Transfer Learning
Blaise Cruz, Jan Christian, Tan, Julianne Agatha, Cheng, Charibeth
Published in arXiv.org (15.05.2020)
Published in arXiv.org (15.05.2020)
Get full text
Paper
Journal Article
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Zheng-Xin, Yong, Zhang, Ruochen, de, Jessica Zosa, Wang, Skyler, Subramonian, Arjun, Holy Lovenia, Cahyawijaya, Samuel, Genta Indra Winata, Sutawika, Lintang, Blaise Cruz, Jan Christian, Yin Lin Tan, Long, Phan, Garcia, Rowena, Solorio, Thamar, Aji, Alham Fikri
Published in arXiv.org (12.09.2023)
Published in arXiv.org (12.09.2023)
Get full text
Paper
Journal Article
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Miranda, Lester James V, Santoso, Jennifer, Aco, Elyanah, Akhdan Fadhilah, Mansurov, Jonibek, Imperial, Joseph Marvin, Kampman, Onno P, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Hudi, Frederikus, Railey Montalan, Ryan, Ignatius, Joanito Agili Lopo, Nixon, William, Karlsson, Börje F, Jaya, James, Diandaru, Ryandito, Gao, Yuze, Amadeus, Patrick, Wang, Bin, Blaise Cruz, Jan Christian, Whitehouse, Chenxi, Ivan Halim Parmonangan, Khelli, Maria, Zhang, Wenyu, Susanto, Lucky, Reynard Adha Ryanda, Hermawan, Sonny Lazuardi, Velasco, Dan John, Muhammad Dehan Al Kautsar, Hendria, Willy Fitra, Moslem, Yasmin, Flynn, Noah, Muhammad Farid Adilazuarda, Li, Haochen, Lee, Johanes, Damanhuri, R, Sun, Shuo, Qorib, Muhammad Reza, Djanibekov, Amirbek, Wei Qi Leong, Do, Quyet V, Muennighoff, Niklas, Pansuwan, Tanrada, Putra, Ilham Firdausi, Xu, Yan, Ngee Chia Tai, Purwarianti, Ayu, Ruder, Sebastian, Tjhi, William, Limkonchotiwat, Peerat, Aji, Alham Fikri, Keh, Sedrick, Genta Indra Winata, Zhang, Ruochen, Koto, Fajri, Zheng-Xin, Yong, Cahyawijaya, Samuel
Published in arXiv.org (08.07.2024)
Published in arXiv.org (08.07.2024)
Get full text
Paper
Journal Article
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Romero, David, Lyu, Chenyang, Wibowo, Haryo Akbarianto, Lynn, Teresa, Hamed, Injy, Aditya Nanda Kishore, Mandal, Aishik, Dragonetti, Alina, Abzaliev, Artem, Tonja, Atnafu Lambebo, Bontu Fufa Balcha, Whitehouse, Chenxi, Salamea, Christian, Velasco, Dan John, Adelani, David Ifeoluwa, David Le Meur, Villa-Cueva, Emilio, Koto, Fajri, Farooqui, Fauzan, Belcavello, Frederico, Batnasan, Ganzorig, Vallejo, Gisela, Caulfield, Grainne, Ivetta, Guido, Song, Haiyue, Ademtew, Henok Biadglign, Maina, Hernán, Holy Lovenia, Israel Abebe Azime, Blaise Cruz, Jan Christian, Gala, Jay, Geng, Jiahui, Jesus-German Ortiz-Barajas, Baek, Jinheon, Dunstan, Jocelyn, Laura Alonso Alemany, Kumaranage Ravindu Yasas Nagasinghe, Benotti, Luciana, D'Haro, Luis Fernando, Viridiano, Marcelo, Estecha-Garitagoitia, Marcos, Maria Camila Buitrago Cabrera, Rodríguez-Cantelar, Mario, Jouitteau, Mélanie, Mihaylov, Mihail, Mohamed Fazli Mohamed Imam, Muhammad Farid Adilazuarda, Gochoo, Munkhjargal, Otgonbold, Munkh-Erdene, Etori, Naome, Niyomugisha, Olivier, Paula Mónica Silva, Chitale, Pranjal, Dabre, Raj, Rendi Chevi, Zhang, Ruochen, Diandaru, Ryandito, Cahyawijaya, Samuel, Góngora, Santiago, Jeong, Soyeong, Purkayastha, Sukannya, Kuribayashi, Tatsuki, Jayakumar, Thanmay, Torrent, Tiago Timponi, Toqeer Ehsan, Araujo, Vladimir, Kementchedjhieva, Yova, Burzo, Zara, Zheng Wei Lim, Zheng, Xin Yong, Ignat, Oana, Nwatu, Joan, Mihalcea, Rada, Solorio, Thamar, Aji, Alham Fikri
Published in arXiv.org (10.06.2024)
Published in arXiv.org (10.06.2024)
Get full text
Paper
Journal Article