Document key information extraction method and system based on keyword splitting technology
The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out ke...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
28.12.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t |
---|---|
AbstractList | The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t |
Author | ZHAO ZENGTAO YU SHAOFENG LIAO CHONGYANG SHE JUN LUO YONG |
Author_xml | – fullname: ZHAO ZENGTAO – fullname: SHE JUN – fullname: LUO YONG – fullname: YU SHAOFENG – fullname: LIAO CHONGYANG |
BookMark | eNqNizsOwjAQRF1Awe8OywGQEkVBtCiAqKjoKCJjbxILe9eKF0FuT4Q4ANWM3ryZqwkx4UzdDmyeAUnggQM4argPWhwT4Ft6bb41oHRsQZOFNCTBAHed0MI4ja8X9yOP3ok4akHQdMSe22Gppo32CVe_XKj16XitzhuMXGOK2iCh1NUlz4tdmWXldl_843wAJx4-HQ |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
DocumentTitleAlternate | 一种基于关键词拆分技术的文档关键信息提取方法和系统 |
ExternalDocumentID | CN113850056A |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_CN113850056A3 |
IEDL.DBID | EVB |
IngestDate | Fri Aug 30 05:40:47 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | Chinese English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_CN113850056A3 |
Notes | Application Number: CN202111052073 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&CC=CN&NR=113850056A |
ParticipantIDs | epo_espacenet_CN113850056A |
PublicationCentury | 2000 |
PublicationDate | 20211228 |
PublicationDateYYYYMMDD | 2021-12-28 |
PublicationDate_xml | – month: 12 year: 2021 text: 20211228 day: 28 |
PublicationDecade | 2020 |
PublicationYear | 2021 |
RelatedCompanies | INFORMATION COMMUNICATION BRANCH, SOUTHERN POWER GRID PEAKING FM POWER GENERATION CO., LTD |
RelatedCompanies_xml | – name: INFORMATION COMMUNICATION BRANCH, SOUTHERN POWER GRID PEAKING FM POWER GENERATION CO., LTD |
Score | 3.500715 |
Snippet | The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
Title | Document key information extraction method and system based on keyword splitting technology |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&locale=&CC=CN&NR=113850056A |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH_MqdObVmXODyJIb0WbrVt2KOLSliGsGzJl4GEsbYp66MbaIfjX-5J2mxe9vpCQPPjlfeS9XwBuu8qpthOMTlgkFam2tJgt7y27TamMOyJhmqd7ELb7L62niTOpwOe6F0bzhH5pckREVIR4z_V9vdgmsTxdW5ndiQ8UzR-CseuZZXSM0QylzPR6rj8aekNucu7y0AyfXdtuMkfxXj7uwC660R2FBv-1p7pSFr9NSnAEeyNcLc2PofL9bsABX_-8ZkBtUD54G7CvKzSjDIUlCrMTeEPLsFJZPYIIJCX1qdozwZt2WXQqkOJraDJLY1KwNRNlsGKCQzgLFYBydEB12TPJNwn2U7gJ_DHvW7jf6UY5Ux5uj9Y8g2o6T2UdSJSwGXUE-s8RxgAJ7bakIwRaf4bxXiLYOTT-Xqfx3-AFHCpFq5IOyi6hmi9X8goNcy6utUZ_AN0jk3g |
link.rule.ids | 230,309,783,888,25576,76876 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NT8IwFH9B_MCbTo3iV03MbouuMCiHxUgHQWWDGDQkHhY6uqiHQWDExL_e126AF72-pk37kl_fR9_7FeC6oZxqO8bohEVSkWpLi9ny1rJrlMpxXcRM83T7Qa3zUn0cOsMCfC57YTRP6JcmR0RERYj3VN_X03USy9O1lfMb8YGiyV174HpmHh1jNEMpM72m2-r3vB43OXd5YAbPrm1XmKN4L-83YBNd7LpCQ-u1qbpSpr9NSnsPtvq4WpLuQ-H73YASX_68ZsCOnz94G7CtKzSjOQpzFM4P4A0tw0Jl9QgikOTUp2rPBG_aWdapQLKvockoGZOMrZkogzUmOISzUAEoRwdUlz2TdJVgP4SrdmvAOxbuN1wpJ-TB-miVIygmk0QeA4liNqKOQP85whggpo2qdIRA688w3osFO4Hy3-uU_xu8hFJn4HfD7kPwdAq7SumqvIOyMyims4U8RyOdigut3R_0hJZr |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Document+key+information+extraction+method+and+system+based+on+keyword+splitting+technology&rft.inventor=ZHAO+ZENGTAO&rft.inventor=SHE+JUN&rft.inventor=LUO+YONG&rft.inventor=YU+SHAOFENG&rft.inventor=LIAO+CHONGYANG&rft.date=2021-12-28&rft.externalDBID=A&rft.externalDocID=CN113850056A |