Document key information extraction method and system based on keyword splitting technology

The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out ke...

Full description

Saved in:
Bibliographic Details
Main Authors ZHAO ZENGTAO, SHE JUN, LUO YONG, YU SHAOFENG, LIAO CHONGYANG
Format Patent
LanguageChinese
English
Published 28.12.2021
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t
AbstractList The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t
Author ZHAO ZENGTAO
YU SHAOFENG
LIAO CHONGYANG
SHE JUN
LUO YONG
Author_xml – fullname: ZHAO ZENGTAO
– fullname: SHE JUN
– fullname: LUO YONG
– fullname: YU SHAOFENG
– fullname: LIAO CHONGYANG
BookMark eNqNizsOwjAQRF1Awe8OywGQEkVBtCiAqKjoKCJjbxILe9eKF0FuT4Q4ANWM3ryZqwkx4UzdDmyeAUnggQM4argPWhwT4Ft6bb41oHRsQZOFNCTBAHed0MI4ja8X9yOP3ok4akHQdMSe22Gppo32CVe_XKj16XitzhuMXGOK2iCh1NUlz4tdmWXldl_843wAJx4-HQ
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 一种基于关键词拆分技术的文档关键信息提取方法和系统
ExternalDocumentID CN113850056A
GroupedDBID EVB
ID FETCH-epo_espacenet_CN113850056A3
IEDL.DBID EVB
IngestDate Fri Aug 30 05:40:47 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language Chinese
English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_CN113850056A3
Notes Application Number: CN202111052073
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&CC=CN&NR=113850056A
ParticipantIDs epo_espacenet_CN113850056A
PublicationCentury 2000
PublicationDate 20211228
PublicationDateYYYYMMDD 2021-12-28
PublicationDate_xml – month: 12
  year: 2021
  text: 20211228
  day: 28
PublicationDecade 2020
PublicationYear 2021
RelatedCompanies INFORMATION COMMUNICATION BRANCH, SOUTHERN POWER GRID PEAKING FM POWER GENERATION CO., LTD
RelatedCompanies_xml – name: INFORMATION COMMUNICATION BRANCH, SOUTHERN POWER GRID PEAKING FM POWER GENERATION CO., LTD
Score 3.500715
Snippet The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document...
SourceID epo
SourceType Open Access Repository
SubjectTerms CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
Title Document key information extraction method and system based on keyword splitting technology
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&locale=&CC=CN&NR=113850056A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH_MqdObVmXODyJIb0WbrVt2KOLSliGsGzJl4GEsbYp66MbaIfjX-5J2mxe9vpCQPPjlfeS9XwBuu8qpthOMTlgkFam2tJgt7y27TamMOyJhmqd7ELb7L62niTOpwOe6F0bzhH5pckREVIR4z_V9vdgmsTxdW5ndiQ8UzR-CseuZZXSM0QylzPR6rj8aekNucu7y0AyfXdtuMkfxXj7uwC660R2FBv-1p7pSFr9NSnAEeyNcLc2PofL9bsABX_-8ZkBtUD54G7CvKzSjDIUlCrMTeEPLsFJZPYIIJCX1qdozwZt2WXQqkOJraDJLY1KwNRNlsGKCQzgLFYBydEB12TPJNwn2U7gJ_DHvW7jf6UY5Ux5uj9Y8g2o6T2UdSJSwGXUE-s8RxgAJ7bakIwRaf4bxXiLYOTT-Xqfx3-AFHCpFq5IOyi6hmi9X8goNcy6utUZ_AN0jk3g
link.rule.ids 230,309,783,888,25576,76876
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NT8IwFH9B_MCbTo3iV03MbouuMCiHxUgHQWWDGDQkHhY6uqiHQWDExL_e126AF72-pk37kl_fR9_7FeC6oZxqO8bohEVSkWpLi9ny1rJrlMpxXcRM83T7Qa3zUn0cOsMCfC57YTRP6JcmR0RERYj3VN_X03USy9O1lfMb8YGiyV174HpmHh1jNEMpM72m2-r3vB43OXd5YAbPrm1XmKN4L-83YBNd7LpCQ-u1qbpSpr9NSnsPtvq4WpLuQ-H73YASX_68ZsCOnz94G7CtKzSjOQpzFM4P4A0tw0Jl9QgikOTUp2rPBG_aWdapQLKvockoGZOMrZkogzUmOISzUAEoRwdUlz2TdJVgP4SrdmvAOxbuN1wpJ-TB-miVIygmk0QeA4liNqKOQP85whggpo2qdIRA688w3osFO4Hy3-uU_xu8hFJn4HfD7kPwdAq7SumqvIOyMyims4U8RyOdigut3R_0hJZr
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Document+key+information+extraction+method+and+system+based+on+keyword+splitting+technology&rft.inventor=ZHAO+ZENGTAO&rft.inventor=SHE+JUN&rft.inventor=LUO+YONG&rft.inventor=YU+SHAOFENG&rft.inventor=LIAO+CHONGYANG&rft.date=2021-12-28&rft.externalDBID=A&rft.externalDocID=CN113850056A