A noise audit of human-labeled benchmarks for machine commonsense reasoning
Kejriwal, Mayank, Santos, Henrique, Shen, Ke, Mulvehill, Alice M, McGuinness, Deborah L
Published in Scientific reports (14.04.2024)
Published in Scientific reports (14.04.2024)
Get full text
Journal Article
TG-CSR: A human-labeled dataset grounded in nine formal commonsense categories
Santos, Henrique, Mulvehill, Alice M., Shen, Ke, Kejriwal, Mayank, McGuinness, Deborah L.
Published in Data in brief (01.12.2023)
Published in Data in brief (01.12.2023)
Get full text
Journal Article
Designing a strong test for measuring true common-sense reasoning
Kejriwal, Mayank, Santos, Henrique, Mulvehill, Alice M., McGuinness, Deborah L.
Published in Nature machine intelligence (01.04.2022)
Published in Nature machine intelligence (01.04.2022)
Get full text
Journal Article
A Theoretically Grounded Question Answering Data Set for Evaluating Machine Common Sense
Santos, Henrique, Shen, Ke, Mulvehill, Alice M., Kejriwal, Mayank, McGuinness, Deborah L.
Published in Data intelligence (01.12.2024)
Published in Data intelligence (01.12.2024)
Get full text
Journal Article
Operational assessment: so how are we doing?
Welshans, James S, Owen, Charles, Mulvehill, Alice M, Hickey, Calvin W, Farrell, Robert J., Jr
Published in Air & space power journal (22.12.2016)
Get full text
Published in Air & space power journal (22.12.2016)
Journal Article
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense
Santos, Henrique, Shen, Ke, Mulvehill, Alice M, Razeghi, Yasaman, McGuinness, Deborah L, Kejriwal, Mayank
Year of Publication 23.03.2022
Year of Publication 23.03.2022
Get full text
Journal Article