Cross-partition clustering: revealing corresponding themes across related datasets

This article studies the task of discovering correspondences across related domains based on real-world data collections. We address this task through a designated extension of distributional data-clustering methods. The method is empirically demonstrated on synthetic data as well as on texts addres...

Full description

Saved in:
Bibliographic Details
Published inJournal of experimental & theoretical artificial intelligence Vol. 23; no. 2; pp. 153 - 180
Main Authors Marx, Zvika, Dagan, Ido, Shamir, Eli
Format Journal Article
LanguageEnglish
Published Abingdon Taylor & Francis Group 01.06.2011
Taylor & Francis Ltd
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This article studies the task of discovering correspondences across related domains based on real-world data collections. We address this task through a designated extension of distributional data-clustering methods. The method is empirically demonstrated on synthetic data as well as on texts addressing different religions, where the goal is to identify commonalities shared by all religions. This article generalises and demonstrates the empirical improvement relative to our previous studies on this subject, as well as to other comparable methods.
AbstractList This article studies the task of discovering correspondences across related domains based on real-world data collections. We address this task through a designated extension of distributional data-clustering methods. The method is empirically demonstrated on synthetic data as well as on texts addressing different religions, where the goal is to identify commonalities shared by all religions. This article generalises and demonstrates the empirical improvement relative to our previous studies on this subject, as well as to other comparable methods.
This article studies the task of discovering correspondences across related domains based on real-world data collections. We address this task through a designated extension of distributional data-clustering methods. The method is empirically demonstrated on synthetic data as well as on texts addressing different religions, where the goal is to identify commonalities shared by all religions. This article generalises and demonstrates the empirical improvement relative to our previous studies on this subject, as well as to other comparable methods. [PUBLICATION ABSTRACT]
Author Shamir, Eli
Marx, Zvika
Dagan, Ido
Author_xml – sequence: 1
  givenname: Zvika
  surname: Marx
  fullname: Marx, Zvika
  email: marxzv@gmail.com
  organization: eBay Inc
– sequence: 2
  givenname: Ido
  surname: Dagan
  fullname: Dagan, Ido
  organization: Department of Computer Science , Bar-Ilan University
– sequence: 3
  givenname: Eli
  surname: Shamir
  fullname: Shamir, Eli
  organization: School of Computer Science and Engineering, The Hebrew University of Jerusalem
BookMark eNp9kM1OxCAUhYkZE8fRN3DRuHHVEWingBtjJv4lk5gYTdwRClQ7oVCBaubtpVY3LtwAl3zn5NxzCGbWWQ3ACYJLBCk8h2yFKSpelhimr5JBVsE9MEdFhfMCEjYD8xHJR-YAHIawhRCiFUJz8Lj2LoS8Fz62sXU2k2YIUfvWvl5kXn9oYdIzk857HXpn1TjFN93pkAk5ahNlRNQqUyKKoGM4AvuNMEEf_9wL8Hxz_bS-yzcPt_frq00uC7SKOUF1SoAZLkhFlUIUEdpIJTCmmuKa1qykitFGqLSQkLBW6SSoVFIygqUuFuBs8u29ex90iLxrg9TGCKvdEDilrIQEVSSRp3_IrRu8TeE4TY4pBa4SVE7Q91ZeN7z3bSf8jiPIx5r5b818rJlPNSfZ5SRrbeN8Jz6dN4pHsTPON15Y2QZe_OvwBdVthuY
Cites_doi 10.1080/095281398146842
10.1016/S0031-3203(99)00076-X
10.1207/s15516709cog0702_3
10.3115/1072228.1072372
10.1111/j.0963-7214.2005.00350.x
10.1017/CBO9780511809071
10.1162/coli.2006.32.3.379
10.1089/106652799318274
10.3115/1118853.1118862
10.1109/ICDM.2004.10104
10.1007/s10994-005-0913-1
10.1016/0004-3702(89)90077-5
10.1017/S1351324902002838
10.1002/0471200611
10.1109/PROC.1982.12425
10.3115/981574.981598
ContentType Journal Article
Copyright Copyright Taylor & Francis Group, LLC 2011
Copyright Taylor & Francis Ltd. 2011
Copyright_xml – notice: Copyright Taylor & Francis Group, LLC 2011
– notice: Copyright Taylor & Francis Ltd. 2011
DBID AAYXX
CITATION
JQ2
7SC
8FD
F28
FR3
L7M
L~C
L~D
DOI 10.1080/0952813X.2010.490960
DatabaseName CrossRef
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Technology Research Database
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
ProQuest Computer Science Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts
Engineering Research Database
Advanced Technologies Database with Aerospace
ANTE: Abstracts in New Technology & Engineering
Computer and Information Systems Abstracts Professional
DatabaseTitleList
ProQuest Computer Science Collection
Technology Research Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
Religion
EISSN 1362-3079
EndPage 180
ExternalDocumentID 2373044691
10_1080_0952813X_2010_490960
490960
Genre Feature
GroupedDBID .4S
.7F
.DC
.QJ
0BK
0R~
29K
2DF
30N
4.4
5GY
5VS
8VB
AAENE
AAJMT
AALDU
AAMIU
AAPUL
AAQRR
ABCCY
ABDBF
ABFIM
ABHAV
ABIVO
ABJNI
ABLIJ
ABPAQ
ABPEM
ABTAI
ABXUL
ABXYU
ACGEJ
ACGFS
ACGOD
ACTIO
ACUHS
ADCVX
ADGTB
ADUMR
ADXPE
AEGXH
AEISY
AEMOZ
AENEX
AEOZL
AEPSL
AEYOC
AFKVX
AGDLA
AGMYJ
AHDZW
AHQJS
AIJEM
AJWEG
AKBVH
AKOOK
AKVCP
ALMA_UNASSIGNED_HOLDINGS
ALQZU
AQRUH
ARCSS
AVBZW
AWYRJ
BLEHA
CAG
CCCUG
COF
CS3
D-I
DGEBU
DKSSO
EAP
EBR
EBS
EBU
ECS
EDO
EJD
EMK
EPL
EST
ESX
E~A
E~B
F5P
GTTXZ
H13
HF~
HZ~
H~P
I-F
IPNFZ
J.P
K1G
KYCEM
M4Z
MK~
NA5
NX~
O9-
P2P
PQQKQ
QWB
RIG
RNANH
ROSJB
RTWRZ
S-T
SNACF
TBQAZ
TDBHL
TEN
TFL
TFT
TFW
TH9
TNC
TTHFI
TUROJ
TUS
TWF
UT5
UU3
ZGOLN
ZL0
~S~
AAGDL
AAHIA
AAYXX
ADMLS
ADYSH
AFRVT
AIYEW
AMPGV
CITATION
JQ2
TASJS
7SC
8FD
F28
FR3
L7M
L~C
L~D
ID FETCH-LOGICAL-c315t-71b5112923768dd18178fcda228e82b8b948d98fad490ac0bd0ac714dcc972ce3
ISSN 0952-813X
IngestDate Fri Jul 11 04:10:15 EDT 2025
Fri Jul 25 08:02:16 EDT 2025
Tue Jul 01 03:12:34 EDT 2025
Wed Dec 25 09:00:10 EST 2024
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c315t-71b5112923768dd18178fcda228e82b8b948d98fad490ac0bd0ac714dcc972ce3
Notes SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
PQID 871411226
PQPubID 53008
PageCount 28
ParticipantIDs proquest_miscellaneous_889407167
informaworld_taylorfrancis_310_1080_0952813X_2010_490960
proquest_journals_871411226
crossref_primary_10_1080_0952813X_2010_490960
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2011-06-00
PublicationDateYYYYMMDD 2011-06-01
PublicationDate_xml – month: 06
  year: 2011
  text: 2011-06-00
PublicationDecade 2010
PublicationPlace Abingdon
PublicationPlace_xml – name: Abingdon
PublicationTitle Journal of experimental & theoretical artificial intelligence
PublicationYear 2011
Publisher Taylor & Francis Group
Taylor & Francis Ltd
Publisher_xml – name: Taylor & Francis Group
– name: Taylor & Francis Ltd
References Marx Z (CIT0019) 2004
CIT0010
CIT0011
Cover TM (CIT0005) 1991
Hofstadter D (CIT0012) 1995
Marx Z (CIT0020) 2002; 3
Tiwari KN (CIT0026) 1992
CIT0014
CIT0013
CIT0016
Chalmers D (CIT0003) 1995
CIT0018
CIT0017
Li H (CIT0015) 2002; 8
CIT0021
CIT0001
CIT0023
Turney PD (CIT0028) 2005; 60
CIT0022
Chechik G (CIT0004) 2003
Gedeon T (CIT0009) 2003; 10
CIT0025
CIT0002
CIT0027
CIT0007
Smart N (CIT0024) 1996
CIT0029
CIT0006
CIT0008
References_xml – ident: CIT0008
  doi: 10.1080/095281398146842
– ident: CIT0023
  doi: 10.1016/S0031-3203(99)00076-X
– volume-title: Dimensions of the Sacred: An Anatomy of the Worlds Beliefs
  year: 1996
  ident: CIT0024
– ident: CIT0021
– ident: CIT0010
  doi: 10.1207/s15516709cog0702_3
– ident: CIT0016
  doi: 10.3115/1072228.1072372
– ident: CIT0029
– ident: CIT0025
– ident: CIT0013
  doi: 10.1111/j.0963-7214.2005.00350.x
– ident: CIT0017
  doi: 10.1017/CBO9780511809071
– ident: CIT0027
  doi: 10.1162/coli.2006.32.3.379
– ident: CIT0002
  doi: 10.1089/106652799318274
– start-page: 857
  volume-title: Advances in Neural Processing Information Systems 15 (NIPS 2002)
  year: 2003
  ident: CIT0004
– ident: CIT0006
  doi: 10.3115/1118853.1118862
– ident: CIT0018
– volume-title: Comparative Religion,
  year: 1992
  ident: CIT0026
– ident: CIT0011
  doi: 10.1109/ICDM.2004.10104
– volume: 60
  start-page: 251
  year: 2005
  ident: CIT0028
  publication-title: Machine Learning
  doi: 10.1007/s10994-005-0913-1
– start-page: 205
  volume-title: Fluid Concepts and Creative Analogies
  year: 1995
  ident: CIT0012
– ident: CIT0007
  doi: 10.1016/0004-3702(89)90077-5
– ident: CIT0001
– volume: 8
  start-page: 25
  year: 2002
  ident: CIT0015
  publication-title: Natural Language Engineering
  doi: 10.1017/S1351324902002838
– start-page: 169
  volume-title: Fluid Concepts and Creative Analogies
  year: 1995
  ident: CIT0003
– start-page: 489
  volume-title: Advances in Neural Information Processing Systems 16 (NIPS 2003)
  year: 2004
  ident: CIT0019
– volume: 10
  start-page: 33
  year: 2003
  ident: CIT0009
  publication-title: Canadian Applied Mathematics Quarterly
– volume-title: Elements of Information Theory,
  year: 1991
  ident: CIT0005
  doi: 10.1002/0471200611
– volume: 3
  start-page: 747
  year: 2002
  ident: CIT0020
  publication-title: Journal of Machine Learning Research
– ident: CIT0014
  doi: 10.1109/PROC.1982.12425
– ident: CIT0022
  doi: 10.3115/981574.981598
SSID ssj0001511
Score 1.8406483
Snippet This article studies the task of discovering correspondences across related domains based on real-world data collections. We address this task through a...
SourceID proquest
crossref
informaworld
SourceType Aggregation Database
Index Database
Publisher
StartPage 153
SubjectTerms analogy
Artificial intelligence
Cluster analysis
Collection
Commonality
data clustering
Empirical analysis
Expert systems
information theory
natural language processing
Objectives
Religion
structure mapping
Tasks
text mining
Texts
Title Cross-partition clustering: revealing corresponding themes across related datasets
URI https://www.tandfonline.com/doi/abs/10.1080/0952813X.2010.490960
https://www.proquest.com/docview/871411226
https://www.proquest.com/docview/889407167
Volume 23
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLdKd-HCN6IMkA_cUFDtOIm927SBChIc0CamXSJ_BaqNtmpTDvsz-It5znPSdKsmxsWqktSx835579l57_cIeSu04p7nPhlXwsACJXOJzrVOtFJacp_mKgsJzl--5pNT8fksOxsM_vSilta1eW-vduaV_I9U4RjINWTJ3kGyXadwAH6DfKEFCUP7TzI-CiYuWYQzjRjt5TrwHsQc5kDOpC8xqTaU4FjMMYMl8LT61TvdGEhMZgGvM4SKrjzyOu3wVrcqAQS49DMgwwAiE8W0R_G52e1eNibu_Pf0orMCx_oH7r1-cvNum-en_jVdxmiz_n4E68VNdRuLoGNZU-UXLAyq1ZCbBdpE9fUu5hlHfPGeEmVIHxztMcNKTzdUfYyNhLuFm2GQnlBhRbYxbe3n_GsWr4tDZC1BauylDL2U2Ms9ssdh6cGHZO9wcnz-vbPv4CMxZHDEebYJmYGxfcdothyeLTrcG-a_8WlOHpEHUbz0EJH1mAz87Al52Bb6oFHvPyXfrgGNboB2QDuY0S2YUYQZRZjRCDPawuwZOf344eRoksRiHIlNWVYnBTONbx6iqKRz4BgWsrJOcy695EYaJaRTstIOpq3t2DhoCyactarg1qfPyXA2n_kXhFphVOYz5rXPxVhLJVTlU-kdy0011mJEkvaJlQvkXClvk9SIyP5jLetmr6vCwjRlevtf91sRlPHFXpUShg1T5fmI0O4saN3wKU3P_HwNl8CgwTnPi5d3HOs-ub95aV6RYb1c-9fg1tbmTcTZX9AtoCw
linkProvider Library Specific Holdings
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwEB5BOZQL5VHUbQv4wDVt4jiO3RuqWi3Q7gG1Um-WH5MLaHfVZC_99XjspKJFcIBLLokTj8fz8GTmG4CPwmqOXGJRdsLFA0oTCiutLazWVnGspW6owPlyIefX4stNM2UT9mNaJZ2huwwUkXQ1CTcFo6eUuOPoFnBV1Tc5M0tocsOfwrNGy5aaGNTl4l4ZR4NWZbi9KPhxyFQ994e3PLBOD7BLf9PVyQCd74Cbpp7zTr4fbQZ35O8eoTr-F20v4cXonrJPeT-9gie4fA07U-sHNmqCN_DtlOZfrGnjEWuZ_7EhyIVoCE8YoUJZKnNnPvX-WK9S6QwjgFjsmU20s1RFg4FRjmqPQ78L1-dnV6fzYmzPUPi6aoairVzy1iivRoUQXYVWdT5YzhUq7pTTQgWtOhsiFdaXLsRrW4ngvW65x_otbC1XS9wD5oXTDTYVWpSitEoL3WGtMFTSdaUVMygmtph1RuEw1QRuOi6YoQUzecFmoH7lnRlS9KPLrUpM_fehBxOfzSjOvYmnShFJ5XIG7P5ulEP6uWKXuNrER-Kko7sm2_1___YH2J5fXV6Yi8-LrwfwPIevKeBzCFvD7QbfRf9ncO_TDv8JWmb7Lg
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwEB6VVkJcaCkgllLwgWtK4jiOzQ21XbUFVlXVSnuz_JhcinZXbPbCr8djJ1ULggNcckmceDyelzPzDcB7YTVHLrEoO-FigNKEwkprC6u1VRxrqRsqcP46k2c34mLezO9V8VNaJcXQXQaKSLqahHsVujEj7kP0Criq6nlOzBKavPBHsCMJO5yKOMrZnS6O9qzKaHtR7uOQsXjuD295YJweQJf-pqqT_Znugh1nntNObo82vTvyP34Bdfwf0vbg6eCcsk95Nz2DLVzsw-7Y-IENeuA5XB3T9IsVbTtiLPPfNgS4EM3gR0aYUJaK3JlPnT9Wy1Q4wwgeFtfMJtJZqqHBwChDdY39-gXcTE-vj8-KoTlD4euq6Yu2cslXo6waFUJ0FFrV-WA5V6i4U04LFbTqbIhUWF-6EK9tJYL3uuUe65ewvVgu8BUwL5xusKnQohSlVVroDmuFoZKuK62YQDFyxawyBoepRmjTYcEMLZjJCzYBdZ91pk9nH11uVGLqvw89GNlsBmFemxhTikgqlxNgd3ejFNKvFbvA5SY-EicdnTXZvv73b7-Dx5cnU_PlfPb5AJ7ks2s67XkD2_33DR5G56d3b9P-_glJDPnS
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Cross-partition+clustering%3A+revealing+corresponding+themes+across+related+datasets&rft.jtitle=Journal+of+experimental+%26+theoretical+artificial+intelligence&rft.au=Marx%2C+Zvika&rft.au=Dagan%2C+Ido&rft.au=Shamir%2C+Eli&rft.date=2011-06-01&rft.issn=0952-813X&rft.eissn=1362-3079&rft.volume=23&rft.issue=2&rft.spage=153&rft.epage=180&rft_id=info:doi/10.1080%2F0952813X.2010.490960&rft.externalDBID=n%2Fa&rft.externalDocID=10_1080_0952813X_2010_490960
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0952-813X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0952-813X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0952-813X&client=summon