APPARATUS AND METHOD FOR EXTRACTING INDEX

The present invention relates to an apparatus and a method for extracting a subject word, and more specifically, to an apparatus and a method for extracting a subject word from a book or an electronic book of which includes literary and non-literary genres. To this end, according to the present inve...

Full description

Saved in:
Bibliographic Details
Main Authors AHN, HEE JEONG, CHOI, GUN HEE, KIM, SEO HEE, YOU, EUN SOON, HONG, MIN HA, KIM, SEUNG HUN
Format Patent
LanguageEnglish
Korean
Published 13.09.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The present invention relates to an apparatus and a method for extracting a subject word, and more specifically, to an apparatus and a method for extracting a subject word from a book or an electronic book of which includes literary and non-literary genres. To this end, according to the present invention, the method for extracting the subject word from the book or the electronic book includes the steps of: dividing body content included in the book or the electronic book into a plurality of divided sections; extracting words included in each of the divided sections and frequency of the words, by performing morpheme analysis the divided sections, respectively; separating a general sentence from an important sentence in the divided sections; calculating an important sentence weighted value with respect to each of important words, based on the important words included in the important sentence, and frequency of the important words; and deriving the subject word of the book or the electronic book, based on the frequency of the words, and the important sentence weighted value with respect to the important sentence. 본 발명은 주제어 추출 장치 및 방법에 관한 것이고, 보다 상세하게 문학 장르와 비문학 장르를 포함하는 책 또는 전자 책의 주제어를 추출하는 장치 및 방법에 관한 것이다. 이를 위한 본 발명의 책 또는 전자 책의 주제어를 추출하는 방법은 책 또는 전자 책에 포함된 본문 내용을 복수의 분리 영역들로 분리하는 단계; 분리 영역들 각각에 대해 형태소 분석을 수행함으로써 각 분리 영역에 포함된 단어들과 단어들의 빈도수를 추출하는 단계; 복수의 분리 영역들에서 중요 문장과 일반 문장을 분리하는 단계; 중요 문장에 포함된 중요 단어들과, 중요 단어들에 대한 빈도수를 근거로, 각 중요 단어에 대한 중요 문장 가중치를 계산하는 단계; 및 단어들의 빈도수와 중요 문장에 대한 중요 문장 가중치를 근거로, 책 또는 전자 책에 대한 주제어를 도출하는 단계를 포함하는 것을 특징으로 한다.
AbstractList The present invention relates to an apparatus and a method for extracting a subject word, and more specifically, to an apparatus and a method for extracting a subject word from a book or an electronic book of which includes literary and non-literary genres. To this end, according to the present invention, the method for extracting the subject word from the book or the electronic book includes the steps of: dividing body content included in the book or the electronic book into a plurality of divided sections; extracting words included in each of the divided sections and frequency of the words, by performing morpheme analysis the divided sections, respectively; separating a general sentence from an important sentence in the divided sections; calculating an important sentence weighted value with respect to each of important words, based on the important words included in the important sentence, and frequency of the important words; and deriving the subject word of the book or the electronic book, based on the frequency of the words, and the important sentence weighted value with respect to the important sentence. 본 발명은 주제어 추출 장치 및 방법에 관한 것이고, 보다 상세하게 문학 장르와 비문학 장르를 포함하는 책 또는 전자 책의 주제어를 추출하는 장치 및 방법에 관한 것이다. 이를 위한 본 발명의 책 또는 전자 책의 주제어를 추출하는 방법은 책 또는 전자 책에 포함된 본문 내용을 복수의 분리 영역들로 분리하는 단계; 분리 영역들 각각에 대해 형태소 분석을 수행함으로써 각 분리 영역에 포함된 단어들과 단어들의 빈도수를 추출하는 단계; 복수의 분리 영역들에서 중요 문장과 일반 문장을 분리하는 단계; 중요 문장에 포함된 중요 단어들과, 중요 단어들에 대한 빈도수를 근거로, 각 중요 단어에 대한 중요 문장 가중치를 계산하는 단계; 및 단어들의 빈도수와 중요 문장에 대한 중요 문장 가중치를 근거로, 책 또는 전자 책에 대한 주제어를 도출하는 단계를 포함하는 것을 특징으로 한다.
Author YOU, EUN SOON
HONG, MIN HA
AHN, HEE JEONG
CHOI, GUN HEE
KIM, SEO HEE
KIM, SEUNG HUN
Author_xml – fullname: AHN, HEE JEONG
– fullname: CHOI, GUN HEE
– fullname: KIM, SEO HEE
– fullname: YOU, EUN SOON
– fullname: HONG, MIN HA
– fullname: KIM, SEUNG HUN
BookMark eNrjYmDJy89L5WTQdAwIcAxyDAkNVnD0c1HwdQ3x8HdRcPMPUnCNCAlydA7x9HNX8PRzcY3gYWBNS8wpTuWF0twMym6uIc4euqkF-fGpxQWJyal5qSXx3kFGBoZmBoYGZpYWJo7GxKkCAKfDJvE
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 주제어 추출 장치 및 방법
ExternalDocumentID KR20160106984A
GroupedDBID EVB
ID FETCH-epo_espacenet_KR20160106984A3
IEDL.DBID EVB
IngestDate Fri Jul 19 16:49:07 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
Korean
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_KR20160106984A3
Notes Application Number: KR20150029807
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20160913&DB=EPODOC&CC=KR&NR=20160106984A
ParticipantIDs epo_espacenet_KR20160106984A
PublicationCentury 2000
PublicationDate 20160913
PublicationDateYYYYMMDD 2016-09-13
PublicationDate_xml – month: 09
  year: 2016
  text: 20160913
  day: 13
PublicationDecade 2010
PublicationYear 2016
RelatedCompanies INDUSTRY-ACADEMIC COOPERATION FOUNDATION, DANKOOKUNIVERSITY
RelatedCompanies_xml – name: INDUSTRY-ACADEMIC COOPERATION FOUNDATION, DANKOOKUNIVERSITY
Score 3.0321982
Snippet The present invention relates to an apparatus and a method for extracting a subject word, and more specifically, to an apparatus and a method for extracting a...
SourceID epo
SourceType Open Access Repository
SubjectTerms CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
Title APPARATUS AND METHOD FOR EXTRACTING INDEX
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20160913&DB=EPODOC&locale=&CC=KR&NR=20160106984A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3dT4MwEL_M-fmmqPFjGhINiQ9EWQHHAzGMgtMFRpAZ3pZSIDEatjiM_75X3HRPe-xdcm0vuV6v97srwHX3jrCCka5aajoGKEQzVKswSrXkJfovS8_NXGR0g9AcjPXn1Ehb8LGshWn6hH43zRHRojjae92c17P_RyzaYCvnt9kbkqYPfmJTZREda6Zoc6nQvu1FIzpyFde1h7ESxr88DH-snu5swCZepO8FAMx77Yu6lNmqU_H3YStCeVV9AK33qQS77vLvNQl2gkXKW4LtBqPJ50hc2OH8EG6cKHJiJxm_yE5I5cBLBiMqY0Ane2kSC2BI-Cg_hdRLj-DK9xJ3oOLsk7_NTobx6lLJMbSraVWcgEyIUWjcyjVGmJ7xIiszjgpmjPE8M3vlKXTWSTpbzz6HPTEUSAiNdKBdf34VF-hu6-yy0dIPaX1_Hw
link.rule.ids 230,309,783,888,25578,76884
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT4NAEJ7U-qg3rRofVUk0JB6I0gVSDo2hLJXaQglSw40sCyRG0zYW4993FlvtqdeZZPaRzM7OzjffAty2HwjLGWkrhaphgkJUXTFzvVAKXmD8MrXMyERF1_MNd6I9x3pcg49VL0zFE_pdkSOiR3H097I6r-f_j1i0wlYu7tM3FM0e-1GXysvsWDUEzaVMe10nGNOxLdt2dxjKfvirw_TH7GjWFmzjJbsjmPad157oS5mvB5X-AewEaG9aHkLtfdaEhr36e60Je96y5N2E3QqjyRcoXPrh4gjurCCwQiuavEiWTyXPidwxlTChk5w4CgUwxH-SBj514mO46TuR7So4evK32GQYrk-VnEB9OpvmpyARoucqNzOVEaalPE-LlOMGM8Z4lhqd4gxamyydb1ZfQ8ONvFEyGvjDC9gXKoGKUEkL6uXnV36JobdMr6od-wGHPYIP
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=APPARATUS+AND+METHOD+FOR+EXTRACTING+INDEX&rft.inventor=AHN%2C+HEE+JEONG&rft.inventor=CHOI%2C+GUN+HEE&rft.inventor=KIM%2C+SEO+HEE&rft.inventor=YOU%2C+EUN+SOON&rft.inventor=HONG%2C+MIN+HA&rft.inventor=KIM%2C+SEUNG+HUN&rft.date=2016-09-13&rft.externalDBID=A&rft.externalDocID=KR20160106984A