TORCHLITE: New, Open Analytical Tools and Infrastructure for a Mega‐Scale Digital Library
Lamba, Manika, Walsh, John, Dubnicek, Ryan, Christie, Jennifer, Downie, J. Stephen, Swatscheno, Janet, Kudeki, Deren, Layne‐Worthey, Glen
Published in Proceedings of the Association for Information Science and Technology (01.10.2024)
Published in Proceedings of the Association for Information Science and Technology (01.10.2024)
Get full text
Journal Article
Uncovering Black Fantastic: Piloting A Word Feature Analysis and Machine Learning Approach for Genre Classification
Parulian, Nikolaus Nova, Dubnicek, Ryan, Worthey, Glen, Evans, Daniel J., Walsh, John A., Downie, J. Stephen
Published in Proceedings of the Association for Information Science and Technology (01.10.2022)
Published in Proceedings of the Association for Information Science and Technology (01.10.2022)
Get full text
Journal Article
Tuning Out the Noise: Benchmarking Entity Extraction for Digitized Native American Literature
Parulian, Nikolaus Nova, Dubnicek, Ryan, Evans, Daniel J., Hu, Yuerong, Layne‐Worthey, Glen, Downie, J. Stephen, Heaton, Raina, Lu, Kun, Orr, Raymond I., Magni, Isabella, Walsh, John A.
Published in Proceedings of the Association for Information Science and Technology (01.10.2023)
Published in Proceedings of the Association for Information Science and Technology (01.10.2023)
Get full text
Journal Article
A Prototype Gutenberg-HathiTrust Sentence-level Parallel Corpus for OCR Error Analysis: Pilot Investigations
Jiang, Ming, Dubnicek, Ryan C, Worthey, Glen, Underwood, Ted, Downie, J. Stephen
Published in 2022 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (20.06.2022)
Published in 2022 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (20.06.2022)
Get full text
Conference Proceeding
Evaluating BERT's Encoding of Intrinsic Semantic Features of OCR'd Digital Library Collections
Jiang, Ming, Hu, Yuerong, Worthey, Glen, Dubnicek, Ryan C, Underwood, Ted, Downie, J Stephen
Published in 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (01.09.2021)
Published in 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (01.09.2021)
Get full text
Conference Proceeding
Introduction to and Hands-On Use Cases with HathiTrust Research Center's Extracted Features 2.0 Dataset
Dubnicek, Ryan, Kudeki, Deren
Published in 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (01.09.2021)
Published in 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (01.09.2021)
Get full text
Conference Proceeding
Text Mining with HathiTrust
Koehl, Eleanor Dickson, Dubnicek, Ryan
Published in 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (01.06.2019)
Published in 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (01.06.2019)
Get full text
Conference Proceeding