AUTOMATED TOOL FOR QUESTION GENERATION

Computerized methods are disclosed for automated question generation from source documents through natural language processing, for applications including training and testing. Interleaved selection and transformation phases employ combined systematic-syntactic analysis to progressively refine natur...

Full description

Saved in:

Bibliographic Details
Main Authors	VEDEN, AARON J, GOETSCHALCKX, ROBBY JOZEF MARIA, ROBSON, ROBERT O, ROBSON, ELLIOT NICHOLAS, KELSEY, ELAINE, RAY, RONALD EDWARD
Format	Patent
Language	English French
Published	21.02.2023
Subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Computerized methods are disclosed for automated question generation from source documents through natural language processing, for applications including training and testing. Interleaved selection and transformation phases employ combined systematic-syntactic analysis to progressively refine natural input text into a high density of text fragments having high content value. Non-local semantic content and attributes such as emphasis attributes can be attached to the text fragments. The text fragments are reverse parsed by matching against a precomputed library of combined semantic-syntactic patterns. Once the patterns of each fragment are determined, transformation of fragments into question-answer pairs is performed using question selectors and answer selectors tailored to each pattern. Methods for constructing distractors, both internal and external, are also disclosed. The ecosystem of machine learning components, ontology resources, and process improvement are also described. L'invention concerne des procédés informatisés destinés à la génération automatisée de questions à partir de documents sources par le biais d'un traitement de langage naturel, pour diverses applications telles que l'apprentissage et les tests. Des phases entrelacées de sélection et de transformation utilisent une analyse systématique-syntaxique combinée pour affiner progressivement un texte entré en langage naturel en une haute densité de fragments de texte ayant une valeur de contenu élevée. Un contenu sémantique non local et des attributs tels que des attributs d'accentuation peuvent être associés aux fragments de texte. Les fragments de texte sont analysés à rebours par une mise en correspondance avec une bibliothèque précalculée de motifs sémantiques-syntaxiques combinés. Une fois que les motifs de chaque fragment sont déterminés, la transformation de fragments en paires de questions-réponses est effectuée à l'aide de sélecteurs de questions et de sélecteurs de réponses adaptés à chaque motif. L'invention concerne également des procédés de construction de distracteurs, à la fois internes et externes. L'écosystème de composants d'apprentissage automatique, de ressources d'ontologie et d'amélioration de processus sont également décrits.
Bibliography:	Application Number: CA20183055379