A unified framework for dataset shift diagnostics
Maia Polo, Felipe, Izbicki, Rafael, Lacerda, Evanildo Gomes, Ibieta-Jimenez, Juan Pablo, Vicente, Renato
Published in Information sciences (01.11.2023)
Published in Information sciences (01.11.2023)
Get full text
Journal Article
Predicting Legal Proceedings Status: Approaches Based on Sequential Text Data
Get full text
Paper
Journal Article
A unified framework for dataset shift diagnostics
Felipe Maia Polo, Izbicki, Rafael, Lacerda, Evanildo Gomes, Juan Pablo Ibieta-Jimenez, Vicente, Renato
Published in arXiv.org (12.09.2023)
Published in arXiv.org (12.09.2023)
Get full text
Paper
Journal Article
A statistical framework for weak-to-strong generalization
Somerstep, Seamus, Polo, Felipe Maia, Banerjee, Moulinath, Ritov, Ya'acov, Yurochkin, Mikhail, Sun, Yuekai
Year of Publication 25.05.2024
Year of Publication 25.05.2024
Get full text
Journal Article
tinyBenchmarks: evaluating LLMs with fewer examples
Polo, Felipe Maia, Weber, Lucas, Choshen, Leshem, Sun, Yuekai, Xu, Gongjun, Yurochkin, Mikhail
Year of Publication 22.02.2024
Year of Publication 22.02.2024
Get full text
Journal Article
Weak Supervision Performance Evaluation via Partial Identification
Polo, Felipe Maia, Maity, Subha, Yurochkin, Mikhail, Banerjee, Moulinath, Sun, Yuekai
Year of Publication 07.12.2023
Year of Publication 07.12.2023
Get full text
Journal Article
Fusing Models with Complementary Expertise
Wang, Hongyi, Polo, Felipe Maia, Sun, Yuekai, Kundu, Souvik, Xing, Eric, Yurochkin, Mikhail
Year of Publication 02.10.2023
Year of Publication 02.10.2023
Get full text
Journal Article
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Shabtay, Nimrod, Polo, Felipe Maia, Doveh, Sivan, Lin, Wei, Mirza, M. Jehanzeb, Chosen, Leshem, Yurochkin, Mikhail, Sun, Yuekai, Arbelle, Assaf, Karlinsky, Leonid, Giryes, Raja
Year of Publication 14.10.2024
Year of Publication 14.10.2024
Get full text
Journal Article
Efficient multi-prompt evaluation of LLMs
Polo, Felipe Maia, Xu, Ronald, Weber, Lucas, Silva, Mírian, Bhardwaj, Onkar, Choshen, Leshem, de Oliveira, Allysson Flavio Melo, Sun, Yuekai, Yurochkin, Mikhail
Year of Publication 27.05.2024
Year of Publication 27.05.2024
Get full text
Journal Article
Weak Supervision Performance Evaluation via Partial Identification
Felipe Maia Polo, Maity, Subha, Yurochkin, Mikhail, Banerjee, Moulinath, Sun, Yuekai
Published in arXiv.org (31.10.2024)
Get full text
Published in arXiv.org (31.10.2024)
Paper