"All You Can Eat" Ontology-Building Feeding Wikipedia to Cyc

In order to achieve genuine web intelligence, building some kind of large general machine-readable conceptual scheme (i.e. ontology) seems inescapable. Yet the past 20 years have shown that manual ontology-building is not practicable. The recent explosion of free user-supplied knowledge on the Web h...

Full description

Saved in:
Bibliographic Details
Published inProceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01 Vol. 1; pp. 341 - 348
Main Authors Sarjant, Samuel, Legg, Catherine, Robinson, Michael, Medelyan, Olena
Format Conference Proceeding
LanguageEnglish
Published Washington, DC, USA IEEE Computer Society 15.09.2009
IEEE
SeriesACM Conferences
Subjects
Online AccessGet full text
ISBN0769538010
9780769538013
DOI10.1109/WI-IAT.2009.60

Cover

Abstract In order to achieve genuine web intelligence, building some kind of large general machine-readable conceptual scheme (i.e. ontology) seems inescapable. Yet the past 20 years have shown that manual ontology-building is not practicable. The recent explosion of free user-supplied knowledge on the Web has led to great strides in automatic ontology-building, but quality-control is still a major issue. Ideally one should automatically build onto an already intelligent base. We suggest that the long-running Cyc project is able to assist here. We describe methods used to add 35K new concepts mined from Wikipedia to collections in ResearchCyc entirely automatically. Evaluation with 22 human subjects shows high precision both for the new concepts’ categorization, and their assignment as individuals or collections. Most importantly we show how Cyc itself can be leveraged for ontological quality control by ‘feeding’ it assertions one by one, enabling it to reject those that contradict its other knowledge.
AbstractList In order to achieve genuine web intelligence, building some kind of large general machine-readable conceptual scheme (i.e. ontology) seems inescapable. Yet the past 20 years have shown that manual ontology-building is not practicable. The recent explosion of free user-supplied knowledge on the Web has led to great strides in automatic ontology-building, but quality-control is still a major issue. Ideally one should automatically build onto an already intelligent base. We suggest that the long-running Cyc project is able to assist here. We describe methods used to add 35K new concepts mined from Wikipedia to collections in ResearchCyc entirely automatically. Evaluation with 22 human subjects shows high precision both for the new concepts’ categorization, and their assignment as individuals or collections. Most importantly we show how Cyc itself can be leveraged for ontological quality control by ‘feeding’ it assertions one by one, enabling it to reject those that contradict its other knowledge.
Author Legg, Catherine
Robinson, Michael
Medelyan, Olena
Sarjant, Samuel
Author_xml – sequence: 1
  givenname: Samuel
  surname: Sarjant
  fullname: Sarjant, Samuel
– sequence: 2
  givenname: Catherine
  surname: Legg
  fullname: Legg, Catherine
– sequence: 3
  givenname: Michael
  surname: Robinson
  fullname: Robinson, Michael
– sequence: 4
  givenname: Olena
  surname: Medelyan
  fullname: Medelyan, Olena
BookMark eNqNkLFOwzAURY0ACRqysrBEZWBKeLZjJx5DVCBSpS5FFZPl2M9VIE1Q0g79e1KVD2C6w7m6VzozctX1HRJyTyGhFNTzpoqrYp0wAJVIuCChynKasjQVnFN-SWaQSSV4DhRuSDiOXwBAKYNUyFvyOC_aNvrsD1Fpumhh9vNo1e37tt8e45dD07qm296Ra2_aEcO_DMjH62JdvsfL1VtVFsvYTN8QI3PCGsY4U2iZB8az1EtAaV1t0XPrKSqvlHDG5dZBRmWtLBcGhXfokAfk4bzbIKL-GZqdGY5asFxCmk80PlNjd7ru--9RU9AnA3pT6cmAPhnQEnQ9NNNdQJ7-1-e_ONtamg
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/WI-IAT.2009.60
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore Digital Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore Digital Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781424453313
1424453313
EndPage 348
ExternalDocumentID 5286048
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AARBI
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
IERZE
OCL
RIB
RIC
RIE
RIL
AAWTH
LHSKQ
ID FETCH-LOGICAL-a2000-e2d5ca22329ec2f02374f60e6cdbcef3cf1e9f995dad8cd0716b9c35ae5fdede3
IEDL.DBID RIE
ISBN 0769538010
9780769538013
IngestDate Wed Aug 27 01:35:30 EDT 2025
Wed Jan 31 06:43:04 EST 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Keywords Wikipedia
Ontology
Cyc
Web Mining
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a2000-e2d5ca22329ec2f02374f60e6cdbcef3cf1e9f995dad8cd0716b9c35ae5fdede3
OpenAccessLink https://hdl.handle.net/10289/2752
PageCount 8
ParticipantIDs acm_books_10_1109_WI_IAT_2009_60
acm_books_10_1109_WI_IAT_2009_60_brief
ieee_primary_5286048
PublicationCentury 2000
PublicationDate 20090915
2009-Sept.
PublicationDateYYYYMMDD 2009-09-15
2009-09-01
PublicationDate_xml – month: 09
  year: 2009
  text: 20090915
  day: 15
PublicationDecade 2000
PublicationPlace Washington, DC, USA
PublicationPlace_xml – name: Washington, DC, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
PublicationTitleAbbrev WIIAT
PublicationYear 2009
Publisher IEEE Computer Society
IEEE
Publisher_xml – name: IEEE Computer Society
– name: IEEE
SSID ssj0001120456
Score 1.5165058
Snippet In order to achieve genuine web intelligence, building some kind of large general machine-readable conceptual scheme (i.e. ontology) seems inescapable. Yet the...
SourceID ieee
acm
SourceType Publisher
StartPage 341
SubjectTerms Computing methodologies -- Artificial intelligence -- Knowledge representation and reasoning
Computing methodologies -- Artificial intelligence -- Knowledge representation and reasoning -- Semantic networks
Computing methodologies -- Machine learning -- Machine learning approaches -- Rule learning
Conferences
Cyc
Hafnium
Helium
Information systems -- Information retrieval
Information systems -- Information retrieval -- Evaluation of retrieval results
Information systems -- Information systems applications -- Data mining
Intelligent agent
Ontologies
Ontology
Web Mining
Wikipedia
Subtitle Feeding Wikipedia to Cyc
Title "All You Can Eat" Ontology-Building
URI https://ieeexplore.ieee.org/document/5286048
Volume 1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEB7anjz5qlhfBBFPpu6jGxvBQy0trVD1oNTbkscEirotdXvQX2-S3VoUQW-7S1jCMMPMJPN9H8CJFMzTRtFAoqGtFje0bdMAVcjaQmrXFjm88-iWDR5bN0_JUwXOvrAwiOiHz7DpHv1dvp6qhTsqO0-iNrMeV4WqdbMCq7U6TwkdsTorOnNuw9g2GiXBzvI9Lkkbw4Cfj4d02Hko-CodPWVVqNdvAis-v_TXYbTcWTFW8txc5LKpPn6QNv536xtQXyH5yP1XjtqECmZbsL6UciBlZG_D1XHn5YXY0CddkZGeyI_JXealbd_pdamcfUn6xW_IePI8mTnICcmnpPuu6vDY7z10B7RUVqDCQ8kx0okStjKIOKrI2Lx90TIsQKa0VGhiZULkhvNEC6duZMsQJrmKE4GJ0agx3oFaNs1wFwgXkRGRCqVpO3KwUNoaTjHNDRPmItZxA4g1bepahrfUdxwBT8fD1FrfSWDylAUNOP1rSSrnEzQN2HaWTWcFEUdaGnXv98_7sFZc_biBsAOo5fMFHtoKIpdH3nU-AXqgvcI
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEB60HvTkG-szFPHk1n2mjeBBi6VVqx4q9RbymECpbkW3B_31JrvbiiLobbOEEIYMM5PM930Ah1LQnDbK8yUaL46Z8Zo2DHgKaVNI7coih3fu3dLOQ3z1mDzOwfEMC4OIefMZ1t1n_pavx2rirspOkrBJ7YmbhwUb9-OkQGt93agEjlqdFrU5s45sS42SYmc6jkraxsBnJ4Ou1z3vF4yVjqByXqjnbxIreYRpL0NvureisWRUn2Syrj5-0Db-d_MrsPGF5SP3syi1CnOYrsHyVMyBlL69Dme186cnYp2ftERKLkVWI3dpLm777l2U2tmnpF0sQwbD0fDFgU5INiatd7UBD-3LfqvjldoKnsjB5BjqRAmbG4QMVWhs5G7EhvpIlZYKTaRMgMwwlmjh9I1sIkIlU1EiMDEaNUabUEnHKW4BYSI0IlSBNE1HDxZIm8UpqpmhwjQiHVWBWNNyVzS88bzm8BkfdLm1vhPBZJz6VTj6awqXr0M0VVh3luUvBRUHL426_fvvA1js9Hs3_KZ7e70DS8VDkGsP24VK9jrBPZtPZHI_P0afnmrBDw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+2009+IEEE%2FWIC%2FACM+International+Joint+Conference+on+Web+Intelligence+and+Intelligent+Agent+Technology+-+Volume+01&rft.atitle=%22All+You+Can+Eat%22+Ontology-Building&rft.au=Sarjant%2C+Samuel&rft.au=Legg%2C+Catherine&rft.au=Robinson%2C+Michael&rft.au=Medelyan%2C+Olena&rft.series=ACM+Conferences&rft.date=2009-09-15&rft.pub=IEEE+Computer+Society&rft.isbn=0769538010&rft.spage=341&rft.epage=348&rft_id=info:doi/10.1109%2FWI-IAT.2009.60
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769538013/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769538013/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769538013/sc.gif&client=summon&freeimage=true