Building Text and Speech Benchmark Datasets and Models for Low‐Resourced East African Languages: Experiences and Lessons
Nakatumba‐Nabende, Joyce, Babirye, Claire, Nabende, Peter, Tusubira, Jeremy Francis, Mukiibi, Jonathan, Wairagala, Eric Peter, Mutebi, Chodrine, Bateesa, Tobius Saul, Nahabwe, Alvin, Tusiime, Hewitt, Katumba, Andrew
Published in Applied AI letters (01.04.2024)
Published in Applied AI letters (01.04.2024)
Get full text
Journal Article
MasakhaNER: Named entity recognition for African languages
Adelani, David Ifeoluwa, Abbott, Jade, Neubig, Graham, d'Souza, Daniel, Kreutzer, Julia, Lignos, Constantine, Palen-Michel, Chester, Buzaaba, Happy, Rijhwani, Shruti, Ruder, Sebastian, Mayhew, Stephen, Abebe Azime, Israel, Muhammad, Shamsuddeen H, Chinenye Emezue, Chris, Nakatumba-Nabende, Joyce, Ogayo, Perez, Aremu, Anuoluwapo, Gitau, Catherine, Mbaye, Derguene, Alabi, Jesujoba, Yimam, Seid Muhie, Rabiu Gwadabe, Tajuddeen, Ezeani, Ignatius, Niyongabo, Rubungo Andre, Mukiibi, Jonathan, Otiende, Verrah, Orife, Iroro, David, Davis, Ngom, Samba, Adewumi, Tosin, Rayson, Paul, Adeyemi, Mofetoluwa, Muriuki, Gerald, Anebi, Emmanuel, Chukwuneke, Chiamaka, Odu, Nkiruka, Wairagala, Eric Peter, Oyerinde, Samuel, Siro, Clemencia, Saul Bateesa, Tobius, Oloyede, Temilola, Wambui, Yvonne, Akinode, Victor, Nabagereka, Deborah, Katusiime, Maurice, Awokoya, Ayodele, Mboup, Mouhamadane, Gebreyohannes, Dibora, Tilaye, Henok, Nwaike, Kelechi, Wolde, Degaga, Faye, Abdoulaye, Sibanda, Blessing, Ahia, Orevaoghene, Dossou, Bonaventure F P, Ogueji, Kelechi, Thierno, Ibrahima, Diallo, Abdoulaye, Akinfaderin, Adewale, Marengereke, Tendai, Osei, Salomey
Published in Transactions of the Association for Computational Linguistics (14.06.2021)
Published in Transactions of the Association for Computational Linguistics (14.06.2021)
Get full text
Journal Article
The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Mukiibi, Jonathan, Katumba, Andrew, Nakatumba-Nabende, Joyce, Hussein, Ali, Meyer, Josh
Year of Publication 20.06.2022
Year of Publication 20.06.2022
Get full text
Journal Article
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
Adelani, David Ifeoluwa, Ojo, Jessica, Azime, Israel Abebe, Zhuang, Jian Yun, Alabi, Jesujoba O, He, Xuanli, Ochieng, Millicent, Hooker, Sara, Bukula, Andiswa, Lee, En-Shiun Annie, Chukwuneke, Chiamaka, Buzaaba, Happy, Sibanda, Blessing, Kalipe, Godson, Mukiibi, Jonathan, Kabongo, Salomon, Yuehgoh, Foutse, Setaka, Mmasibidi, Ndolela, Lolwethu, Odu, Nkiruka, Mabuya, Rooweither, Muhammad, Shamsuddeen Hassan, Osei, Salomey, Samb, Sokhar, Guge, Tadesse Kebede, Stenetorp, Pontus
Year of Publication 05.06.2024
Year of Publication 05.06.2024
Get full text
Journal Article
BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Meyer, Josh, Adelani, David Ifeoluwa, Casanova, Edresson, Öktem, Alp, Weber, Daniel Whitenack Julian, Kabongo, Salomon, Salesky, Elizabeth, Orife, Iroro, Leong, Colin, Ogayo, Perez, Emezue, Chris, Mukiibi, Jonathan, Osei, Salomey, Agbolo, Apelete, Akinode, Victor, Opoku, Bernard, Olanrewaju, Samuel, Alabi, Jesujoba, Muhammad, Shamsuddeen
Year of Publication 07.07.2022
Year of Publication 07.07.2022
Get full text
Journal Article
Keyword Spotter Model for Crop Pest and Disease Monitoring from Community Radio Data
Akera, Benjamin, Nakatumba-Nabende, Joyce, Mukiibi, Jonathan, Hussein, Ali, Baleeta, Nathan, Ssendiwala, Daniel, Nalwooga, Samiiha
Year of Publication 05.10.2019
Year of Publication 05.10.2019
Get full text
Journal Article
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages
Dione, Cheikh M. Bamba, Adelani, David, Nabende, Peter, Alabi, Jesujoba, Sindane, Thapelo, Buzaaba, Happy, Muhammad, Shamsuddeen Hassan, Emezue, Chris Chinenye, Ogayo, Perez, Aremu, Anuoluwapo, Gitau, Catherine, Mbaye, Derguene, Mukiibi, Jonathan, Sibanda, Blessing, Dossou, Bonaventure F. P, Bukula, Andiswa, Mabuya, Rooweither, Tapo, Allahsera Auguste, Munkoh-Buabeng, Edwin, Koagne, victoire Memdjokam, Kabore, Fatoumata Ouoba, Taylor, Amelia, Kalipe, Godson, Macucwa, Tebogo, Marivate, Vukosi, Gwadabe, Tajuddeen, Elvis, Mboning Tchiaze, Onyenwe, Ikechukwu, Atindogbe, Gratien, Adelani, Tolulope, Akinade, Idris, Samuel, Olanrewaju, Nahimana, Marien, Musabeyezu, Théogène, Niyomutabazi, Emile, Chimhenga, Ester, Gotosa, Kudzai, Mizha, Patrick, Agbolo, Apelete, Traore, Seydou, Uchechukwu, Chinedu, Yusuf, Aliyu, Abdullahi, Muhammad, Klakow, Dietrich
Year of Publication 23.05.2023
Year of Publication 23.05.2023
Get full text
Journal Article
MasakhaNEWS: News Topic Classification for African languages
Adelani, David Ifeoluwa, Masiak, Marek, Azime, Israel Abebe, Alabi, Jesujoba, Tonja, Atnafu Lambebo, Mwase, Christine, Ogundepo, Odunayo, Dossou, Bonaventure F. P, Oladipo, Akintunde, Nixdorf, Doreen, Emezue, Chris Chinenye, al-azzawi, sana, Sibanda, Blessing, David, Davis, Ndolela, Lolwethu, Mukiibi, Jonathan, Ajayi, Tunde, Moteu, Tatiana, Odhiambo, Brian, Owodunni, Abraham, Obiefuna, Nnaemeka, Mohamed, Muhidin, Muhammad, Shamsuddeen Hassan, Ababu, Teshome Mulugeta, Salahudeen, Saheed Abdullahi, Yigezu, Mesay Gemeda, Gwadabe, Tajuddeen, Abdulmumin, Idris, Taye, Mahlet, Awoyomi, Oluwabusayo, Shode, Iyanuoluwa, Adelani, Tolulope, Abdulganiyu, Habiba, Omotayo, Abdul-Hakeem, Adeeko, Adetola, Afolabi, Abeeb, Aremu, Anuoluwapo, Samuel, Olanrewaju, Siro, Clemencia, Kimotho, Wangari, Ogbu, Onyekachi, Mbonu, Chinedu, Chukwuneke, Chiamaka, Fanijo, Samuel, Ojo, Jessica, Awosan, Oyinkansola, Kebede, Tadesse, Sakayo, Toadoum Sari, Nyatsine, Pamela, Sidume, Freedmore, Yousuf, Oreen, Oduwole, Mardiyyah, Tshinu, Tshinu, Kimanuka, Ussen, Diko, Thina, Nxakama, Siyanda, Nigusse, Sinodos, Johar, Abdulmejid, Mohamed, Shafie, Hassan, Fuad Mire, Mehamed, Moges Ahmed, Ngabire, Evrard, Jules, Jules, Ssenkungu, Ivan, Stenetorp, Pontus
Year of Publication 19.04.2023
Year of Publication 19.04.2023
Get full text
Journal Article
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition
Adelani, David Ifeoluwa, Neubig, Graham, Ruder, Sebastian, Rijhwani, Shruti, Beukman, Michael, Palen-Michel, Chester, Lignos, Constantine, Alabi, Jesujoba O, Muhammad, Shamsuddeen H, Nabende, Peter, Dione, Cheikh M. Bamba, Bukula, Andiswa, Mabuya, Rooweither, Dossou, Bonaventure F. P, Sibanda, Blessing, Buzaaba, Happy, Mukiibi, Jonathan, Kalipe, Godson, Mbaye, Derguene, Taylor, Amelia, Kabore, Fatoumata, Emezue, Chris Chinenye, Aremu, Anuoluwapo, Ogayo, Perez, Gitau, Catherine, Munkoh-Buabeng, Edwin, Koagne, Victoire M, Tapo, Allahsera Auguste, Macucwa, Tebogo, Marivate, Vukosi, Mboning, Elvis, Gwadabe, Tajuddeen, Adewumi, Tosin, Ahia, Orevaoghene, Nakatumba-Nabende, Joyce, Mokono, Neo L, Ezeani, Ignatius, Chukwuneke, Chiamaka, Adeyemi, Mofetoluwa, Hacheme, Gilles Q, Abdulmumin, Idris, Ogundepo, Odunayo, Yousuf, Oreen, Ngoli, Tatiana Moteu, Klakow, Dietrich
Year of Publication 22.10.2022
Year of Publication 22.10.2022
Get full text
Journal Article
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
Adelani, David Ifeoluwa, Alabi, Jesujoba Oluwadara, Fan, Angela, Kreutzer, Julia, Shen, Xiaoyu, Reid, Machel, Ruiter, Dana, Klakow, Dietrich, Nabende, Peter, Chang, Ernie, Gwadabe, Tajuddeen, Sackey, Freshia, Dossou, Bonaventure F. P, Emezue, Chris Chinenye, Leong, Colin, Beukman, Michael, Muhammad, Shamsuddeen Hassan, Jarso, Guyo Dub, Yousuf, Oreen, Rubungo, Andre Niyongabo, Hacheme, Gilles, Wairagala, Eric Peter, Nasir, Muhammad Umair, Ajibade, Benjamin Ayoade, Ajayi, Tunde Oluwaseyi, Gitau, Yvonne Wambui, Abbott, Jade, Ahmed, Mohamed, Ochieng, Millicent, Aremu, Anuoluwapo, Ogayo, Perez, Mukiibi, Jonathan, Kabore, Fatoumata Ouoba, Kalipe, Godson Koffi, Mbaye, Derguene, Tapo, Allahsera Auguste, Koagne, Victoire Memdjokam, Munkoh-Buabeng, Edwin, Wagner, Valencia, Abdulmumin, Idris, Awokoya, Ayodele, Buzaaba, Happy, Sibanda, Blessing, Bukula, Andiswa, Manthalu, Sam
Year of Publication 04.05.2022
Year of Publication 04.05.2022
Get full text
Journal Article
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
Adelani, David Ifeoluwa, Ojo, Jessica, Israel Abebe Azime, Jian Yun Zhuang, Alabi, Jesujoba O, He, Xuanli, Ochieng, Millicent, Hooker, Sara, Bukula, Andiswa, En-Shiun Annie Lee, Chukwuneke, Chiamaka, Buzaaba, Happy, Sibanda, Blessing, Godson Kalipe, Mukiibi, Jonathan, Kabongo, Salomon, Yuehgoh, Foutse, Setaka, Mmasibidi, Ndolela, Lolwethu, Odu, Nkiruka, Mabuya, Rooweither, Shamsuddeen Hassan Muhammad, Osei, Salomey, Samb, Sokhar, Tadesse Kebede Guge, Stenetorp, Pontus
Published in arXiv.org (05.06.2024)
Get full text
Published in arXiv.org (05.06.2024)
Paper
MasakhaNER: Named Entity Recognition for African Languages
Adelani, David Ifeoluwa, Abbott, Jade, Neubig, Graham, D'souza, Daniel, Kreutzer, Julia, Lignos, Constantine, Palen-Michel, Chester, Buzaaba, Happy, Rijhwani, Shruti, Ruder, Sebastian, Mayhew, Stephen, Azime, Israel Abebe, Muhammad, Shamsuddeen, Emezue, Chris Chinenye, Nakatumba-Nabende, Joyce, Ogayo, Perez, Aremu, Anuoluwapo, Gitau, Catherine, Mbaye, Derguene, Alabi, Jesujoba, Yimam, Seid Muhie, Gwadabe, Tajuddeen, Ezeani, Ignatius, Niyongabo, Rubungo Andre, Mukiibi, Jonathan, Otiende, Verrah, Orife, Iroro, David, Davis, Ngom, Samba, Adewumi, Tosin, Rayson, Paul, Adeyemi, Mofetoluwa, Muriuki, Gerald, Anebi, Emmanuel, Chukwuneke, Chiamaka, Odu, Nkiruka, Wairagala, Eric Peter, Oyerinde, Samuel, Siro, Clemencia, Bateesa, Tobius Saul, Oloyede, Temilola, Wambui, Yvonne, Akinode, Victor, Nabagereka, Deborah, Katusiime, Maurice, Awokoya, Ayodele, MBOUP, Mouhamadane, Gebreyohannes, Dibora, Tilaye, Henok, Nwaike, Kelechi, Wolde, Degaga, Faye, Abdoulaye, Sibanda, Blessing, Ahia, Orevaoghene, Dossou, Bonaventure F. P, Ogueji, Kelechi, DIOP, Thierno Ibrahima, Diallo, Abdoulaye, Akinfaderin, Adewale, Marengereke, Tendai, Osei, Salomey
Year of Publication 22.03.2021
Year of Publication 22.03.2021
Get full text
Journal Article
BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Meyer, Josh, Adelani, David Ifeoluwa, Casanova, Edresson, Öktem, Alp, Daniel Whitenack Julian Weber, Kabongo, Salomon, Salesky, Elizabeth, Orife, Iroro, Leong, Colin, Perez Ogayo, Emezue, Chris, Mukiibi, Jonathan, Osei, Salomey, Agbolo, Apelete, Akinode, Victor, Opoku, Bernard, Olanrewaju, Samuel, Alabi, Jesujoba, Shamsuddeen Muhammad
Published in arXiv.org (07.07.2022)
Get full text
Published in arXiv.org (07.07.2022)
Paper
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages
Cheikh M Bamba Dione, Adelani, David, Nabende, Peter, Alabi, Jesujoba, Sindane, Thapelo, Buzaaba, Happy, Shamsuddeen Hassan Muhammad, Emezue, Chris Chinenye, Perez Ogayo, Aremu, Anuoluwapo, Gitau, Catherine, Mbaye, Derguene, Mukiibi, Jonathan, Sibanda, Blessing, Bonaventure F P Dossou, Bukula, Andiswa, Mabuya, Rooweither, Allahsera Auguste Tapo, Munkoh-Buabeng, Edwin, Koagne, victoire Memdjokam, Fatoumata Ouoba Kabore, Taylor, Amelia, Godson Kalipe, Macucwa, Tebogo, Marivate, Vukosi, Gwadabe, Tajuddeen, Elvis, Mboning Tchiaze, Onyenwe, Ikechukwu, Atindogbe, Gratien, Adelani, Tolulope, Akinade, Idris, Olanrewaju, Samuel, Marien Nahimana, Musabeyezu, Théogène, Niyomutabazi, Emile, Chimhenga, Ester, Gotosa, Kudzai, Mizha, Patrick, Agbolo, Apelete, Traore, Seydou, Chinedu Uchechukwu, Aliyu Yusuf, Abdullahi, Muhammad, Klakow, Dietrich
Published in arXiv.org (23.05.2023)
Get full text
Published in arXiv.org (23.05.2023)
Paper
MasakhaNEWS: News Topic Classification for African languages
Adelani, David Ifeoluwa, Masiak, Marek, Israel Abebe Azime, Alabi, Jesujoba, Tonja, Atnafu Lambebo, Mwase, Christine, Ogundepo, Odunayo, Bonaventure F P Dossou, Akintunde Oladipo, Nixdorf, Doreen, Emezue, Chris Chinenye, Al-Azzawi, Sana, Sibanda, Blessing, Davis, David, Ndolela, Lolwethu, Mukiibi, Jonathan, Ajayi, Tunde, Moteu, Tatiana, Odhiambo, Brian, Owodunni, Abraham, Obiefuna, Nnaemeka, Muhidin Mohamed, Shamsuddeen Hassan Muhammad, Ababu, Teshome Mulugeta, Saheed Abdullahi Salahudeen, Mesay Gemeda Yigezu, Gwadabe, Tajuddeen, Idris Abdulmumin, Taye, Mahlet, Awoyomi, Oluwabusayo, Shode, Iyanuoluwa, Adelani, Tolulope, Abdulganiyu, Habiba, Abdul-Hakeem Omotayo, Adeeko, Adetola, Afolabi, Abeeb, Aremu, Anuoluwapo, Olanrewaju, Samuel, Siro, Clemencia, Wangari Kimotho, Ogbu, Onyekachi, Mbonu, Chinedu, Chukwuneke, Chiamaka, Fanijo, Samuel, Ojo, Jessica, Awosan, Oyinkansola, Kebede, Tadesse, Toadoum, Sari Sakayo, Nyatsine, Pamela, Sidume, Freedmore, Yousuf, Oreen, Oduwole, Mardiyyah, Tshinu, Tshinu, Kimanuka, Ussen, Diko, Thina, Nxakama, Siyanda, Nigusse, Sinodos, Johar, Abdulmejid, Shafie, Mohamed, Fuad Mire Hassan, Moges Ahmed Mehamed, Evrard Ngabire, Jules, Jules, Ssenkungu, Ivan, Stenetorp, Pontus
Published in arXiv.org (20.09.2023)
Get full text
Published in arXiv.org (20.09.2023)
Paper
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition
Adelani, David Ifeoluwa, Neubig, Graham, Ruder, Sebastian, Rijhwani, Shruti, Beukman, Michael, Palen-Michel, Chester, Lignos, Constantine, Alabi, Jesujoba O, Muhammad, Shamsuddeen H, Nabende, Peter, Cheikh M Bamba Dione, Bukula, Andiswa, Mabuya, Rooweither, Bonaventure F P Dossou, Sibanda, Blessing, Buzaaba, Happy, Mukiibi, Jonathan, Godson Kalipe, Mbaye, Derguene, Taylor, Amelia, Kabore, Fatoumata, Emezue, Chris Chinenye, Aremu, Anuoluwapo, Perez Ogayo, Gitau, Catherine, Munkoh-Buabeng, Edwin, Koagne, Victoire M, Allahsera Auguste Tapo, Macucwa, Tebogo, Marivate, Vukosi, Mboning, Elvis, Gwadabe, Tajuddeen, Adewumi, Tosin, Ahia, Orevaoghene, Joyce Nakatumba-Nabende, Mokono, Neo L, Ezeani, Ignatius, Chukwuneke, Chiamaka, Adeyemi, Mofetoluwa, Hacheme, Gilles Q, Idris Abdulmumin, Ogundepo, Odunayo, Yousuf, Oreen, Ngoli, Tatiana Moteu, Klakow, Dietrich
Published in arXiv.org (15.11.2022)
Get full text
Published in arXiv.org (15.11.2022)
Paper
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
Adelani, David Ifeoluwa, Alabi, Jesujoba Oluwadara, Fan, Angela, Kreutzer, Julia, Shen, Xiaoyu, Reid, Machel, Ruiter, Dana, Klakow, Dietrich, Nabende, Peter, Chang, Ernie, Gwadabe, Tajuddeen, Sackey, Freshia, Bonaventure F P Dossou, Emezue, Chris Chinenye, Leong, Colin, Beukman, Michael, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Yousuf, Oreen, Andre Niyongabo Rubungo, Hacheme, Gilles, Wairagala, Eric Peter, Nasir, Muhammad Umair, Ajibade, Benjamin Ayoade, Ajayi, Tunde Oluwaseyi, Gitau, Yvonne Wambui, Abbott, Jade, Ahmed, Mohamed, Ochieng, Millicent, Aremu, Anuoluwapo, Perez Ogayo, Mukiibi, Jonathan, Fatoumata Ouoba Kabore, Godson Koffi Kalipe, Mbaye, Derguene, Allahsera Auguste Tapo, Koagne, Victoire Memdjokam, Munkoh-Buabeng, Edwin, Wagner, Valencia, Idris Abdulmumin, Awokoya, Ayodele, Buzaaba, Happy, Sibanda, Blessing, Bukula, Andiswa, Manthalu, Sam
Published in arXiv.org (22.08.2022)
Get full text
Published in arXiv.org (22.08.2022)
Paper
MasakhaNER: Named Entity Recognition for African Languages
Adelani, David Ifeoluwa, Abbott, Jade, Neubig, Graham, D'souza, Daniel, Kreutzer, Julia, Lignos, Constantine, Palen-Michel, Chester, Buzaaba, Happy, Rijhwani, Shruti, Ruder, Sebastian, Mayhew, Stephen, Israel Abebe Azime, Shamsuddeen Muhammad, Emezue, Chris Chinenye, Joyce Nakatumba-Nabende, Perez Ogayo, Aremu, Anuoluwapo, Gitau, Catherine, Mbaye, Derguene, Alabi, Jesujoba, Yimam, Seid Muhie, Gwadabe, Tajuddeen, Ezeani, Ignatius, Rubungo Andre Niyongabo, Mukiibi, Jonathan, Otiende, Verrah, Orife, Iroro, Davis, David, Ngom, Samba, Adewumi, Tosin, Rayson, Paul, Adeyemi, Mofetoluwa, Muriuki, Gerald, Anebi, Emmanuel, Chukwuneke, Chiamaka, Odu, Nkiruka, Wairagala, Eric Peter, Oyerinde, Samuel, Siro, Clemencia, Tobius Saul Bateesa, Oloyede, Temilola, Wambui, Yvonne, Akinode, Victor, Nabagereka, Deborah, Katusiime, Maurice, Awokoya, Ayodele, MBOUP, Mouhamadane, Gebreyohannes, Dibora, Tilaye, Henok, Nwaike, Kelechi, Degaga Wolde, Abdoulaye Faye, Sibanda, Blessing, Ahia, Orevaoghene, Bonaventure F P Dossou, Ogueji, Kelechi, Thierno Ibrahima DIOP, Diallo, Abdoulaye, Akinfaderin, Adewale, Marengereke, Tendai, Osei, Salomey
Published in arXiv.org (05.07.2021)
Get full text
Published in arXiv.org (05.07.2021)
Paper