METHOD AND APPARATUS FOR DOCUMENT SUMMARIZATION

A method and device for document summarization are disclosed. The document summarization device according to one embodiment may comprise: an encoding portion which receives document data consisting of one or more sentences and converts the data into tokens defined in predetermined units to generate...

Full description

Saved in:
Bibliographic Details
Main Authors CHOI HYUN JIN, HWANG BONG KYU, KIM JU DONG, LEE HYUN JAE, YUN JAE WOONG
Format Patent
LanguageEnglish
Korean
Published 09.05.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method and device for document summarization are disclosed. The document summarization device according to one embodiment may comprise: an encoding portion which receives document data consisting of one or more sentences and converts the data into tokens defined in predetermined units to generate a feature vector; an extraction summarization portion which receives the feature vector, calculates a probability value that each sentence corresponds to the summarization for each one or more sentences constituting the document data, and generates an attention vector for a weight for each token based on the probability value; and a decoding portion which receives the feature vector and the attention vector and generates abstract summarization data. 문서 요약을 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 요약 장치는 하나 이상의 문장으로 구성된 문서 데이터를 입력 받아 소정 단위로 정의된 토큰으로 변환하여 특징 벡터(feature vector)를 생성하는 인코딩부; 특징 벡터를 입력 받아 문서 데이터를 구성하는 하나 이상의 문장 별로 각각의 문장이 요약에 해당할 확률값을 계산하며, 확률값에 기초하여 토큰 별 가중치에 대한 주의 벡터(attention vector)를 생성하는 추출 요약부; 및 특징 벡터 및 주의 벡터를 입력 받아 추상 요약 데이터를 생성하는 디코딩부를 포함할 수 있다.
Bibliography:Application Number: KR20210146956