AUTOMATICALLY GENERATING TRAINING DATA
Computer-readable media, computer systems, and computing devices facilitate generating binary classifier and entity extractor training data. Seed URLs are selected and URL patterns within the seed URLs are identified. Matching URLs in a data structure are identified and corresponding queries and the...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
22.12.2011
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Computer-readable media, computer systems, and computing devices facilitate generating binary classifier and entity extractor training data. Seed URLs are selected and URL patterns within the seed URLs are identified. Matching URLs in a data structure are identified and corresponding queries and their associated weights are added to a potential training data set from which training data is selected. |
---|---|
Bibliography: | Application Number: US20100818377 |