Similarity Modeling on Heterogeneous Networks via Automatic Path Discovery

Heterogeneous networks are widely used to model real-world semi-structured data. The key challenge of learning over such networks is the modeling of node similarity under both network structures and contents. To deal with network structures, most existing works assume a given or enumerable set of me...

Full description

Saved in:
Bibliographic Details
Published inMachine Learning and Knowledge Discovery in Databases Vol. 11052; pp. 37 - 54
Main Authors Yang, Carl, Liu, Mengxiong, He, Frank, Zhang, Xikun, Peng, Jian, Han, Jiawei
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2019
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Heterogeneous networks are widely used to model real-world semi-structured data. The key challenge of learning over such networks is the modeling of node similarity under both network structures and contents. To deal with network structures, most existing works assume a given or enumerable set of meta-paths and then leverage them for the computation of meta-path-based proximities or network embeddings. However, expert knowledge for given meta-paths is not always available, and as the length of considered meta-paths increases, the number of possible paths grows exponentially, which makes the path searching process very costly. On the other hand, while there are often rich contents around network nodes, they have hardly been leveraged to further improve similarity modeling. In this work, to properly model node similarity in content-rich heterogeneous networks, we propose to automatically discover useful paths for pairs of nodes under both structural and content information. To this end, we combine continuous reinforcement learning and deep content embedding into a novel semi-supervised joint learning framework. Specifically, the supervised reinforcement learning component explores useful paths between a small set of example similar pairs of nodes, while the unsupervised deep embedding component captures node contents and enables inductive learning on the whole network. The two components are jointly trained in a closed loop to mutually enhance each other. Extensive experiments on three real-world heterogeneous networks demonstrate the supreme advantages of our algorithm. Code related to this paper is available at: https://github.com/yangji9181/AutoPath.
AbstractList Heterogeneous networks are widely used to model real-world semi-structured data. The key challenge of learning over such networks is the modeling of node similarity under both network structures and contents. To deal with network structures, most existing works assume a given or enumerable set of meta-paths and then leverage them for the computation of meta-path-based proximities or network embeddings. However, expert knowledge for given meta-paths is not always available, and as the length of considered meta-paths increases, the number of possible paths grows exponentially, which makes the path searching process very costly. On the other hand, while there are often rich contents around network nodes, they have hardly been leveraged to further improve similarity modeling. In this work, to properly model node similarity in content-rich heterogeneous networks, we propose to automatically discover useful paths for pairs of nodes under both structural and content information. To this end, we combine continuous reinforcement learning and deep content embedding into a novel semi-supervised joint learning framework. Specifically, the supervised reinforcement learning component explores useful paths between a small set of example similar pairs of nodes, while the unsupervised deep embedding component captures node contents and enables inductive learning on the whole network. The two components are jointly trained in a closed loop to mutually enhance each other. Extensive experiments on three real-world heterogeneous networks demonstrate the supreme advantages of our algorithm. Code related to this paper is available at: https://github.com/yangji9181/AutoPath.
Author He, Frank
Zhang, Xikun
Yang, Carl
Liu, Mengxiong
Han, Jiawei
Peng, Jian
Author_xml – sequence: 1
  givenname: Carl
  surname: Yang
  fullname: Yang, Carl
  email: jiyang3@illinois.edu
– sequence: 2
  givenname: Mengxiong
  surname: Liu
  fullname: Liu, Mengxiong
– sequence: 3
  givenname: Frank
  surname: He
  fullname: He, Frank
– sequence: 4
  givenname: Xikun
  surname: Zhang
  fullname: Zhang, Xikun
– sequence: 5
  givenname: Jian
  surname: Peng
  fullname: Peng, Jian
– sequence: 6
  givenname: Jiawei
  surname: Han
  fullname: Han, Jiawei
BookMark eNo1kMlOwzAQhs0qWugTcMkLGMZLEvuIyq6ySMDZcpwJDQ1xid2ivj1mO3n8jb7RzD8mu73vkZBjBicMoDzVpaKCggDKQHNFlRFbZJKoSOwHqW0yYgVjVAipd8j4v1Hmu2SUak51KcU-GTNQUhZlXsABmYTwBgCcsSLP1YjcPrXvbWeHNm6yO19j1_avme-za4w4-Ffs0a9Cdo_x0w-LkK1bm52ton-3sXXZo43z7LwNzq9x2ByRvcZ2ASd_7yF5ubx4nl7T2cPVzfRsRpdcQqRacafKxrpCF1aglI10pdZosaksVlBBXonaoXBCJiDBKXS15nUJ0Ehei0PCfueG5ZC2xcFU3i-CYWC-kzMpIyNMCsD8xGTSLzn811kO_mOFIRr8lhz2cbCdm9tlOjeYXPNc8CIJpsjFF28Sb14
ContentType Book Chapter
Copyright Springer Nature Switzerland AG 2019
Copyright_xml – notice: Springer Nature Switzerland AG 2019
DBID FFUUA
DEWEY 6.3
DOI 10.1007/978-3-030-10928-8_3
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783030109288
3030109283
EISSN 1611-3349
Editor Berlingerio, Michele
Ifrim, Georgiana
Hurley, Neil
Gärtner, Thomas
Bonchi, Francesco
Editor_xml – sequence: 1
  fullname: Berlingerio, Michele
– sequence: 2
  fullname: Ifrim, Georgiana
– sequence: 3
  fullname: Gärtner, Thomas
– sequence: 4
  fullname: Bonchi, Francesco
– sequence: 5
  fullname: Hurley, Neil
EndPage 54
ExternalDocumentID EBC5925326_33_65
GroupedDBID 0D6
0DA
38.
AABBV
AEDXK
AEJLV
AEKFX
AEZAY
AIFIR
ALEXF
ALMA_UNASSIGNED_HOLDINGS
AYMPB
BBABE
CXBFT
CZZ
EXGDT
FCSXQ
FFUUA
I4C
IEZ
MGZZY
NSQWD
OORQV
SBO
TPJZQ
TSXQS
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z82
Z83
Z84
Z85
Z87
Z88
-DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-p240t-982c87fac696a3e44f4c799eaefbaeb0b05b3dce3c34bae40c8ecd92d700f42d3
ISBN 3030109275
9783030109271
ISSN 0302-9743
IngestDate Tue Jul 29 20:13:45 EDT 2025
Fri Apr 11 21:41:04 EDT 2025
IsPeerReviewed true
IsScholarly true
LCCallNum Q334-342
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p240t-982c87fac696a3e44f4c799eaefbaeb0b05b3dce3c34bae40c8ecd92d700f42d3
Notes Electronic supplementary materialThe online version of this chapter (https://doi.org/10.1007/978-3-030-10928-8_3) contains supplementary material, which is available to authorized users.
OCLC 1084467560
PQID EBC5925326_33_65
PageCount 18
ParticipantIDs springer_books_10_1007_978_3_030_10928_8_3
proquest_ebookcentralchapters_5925326_33_65
PublicationCentury 2000
PublicationDate 2019
PublicationDateYYYYMMDD 2019-01-01
PublicationDate_xml – year: 2019
  text: 2019
PublicationDecade 2010
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
– name: Cham
PublicationSeriesSubtitle Lecture Notes in Artificial Intelligence
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle European Conference, ECML PKDD 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part II
PublicationTitle Machine Learning and Knowledge Discovery in Databases
PublicationYear 2019
Publisher Springer International Publishing AG
Springer International Publishing
Publisher_xml – name: Springer International Publishing AG
– name: Springer International Publishing
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Steffen, Bernhard
Pandu Rangan, C.
Kanade, Takeo
Kittler, Josef
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
– sequence: 8
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
– sequence: 9
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
– sequence: 10
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
– sequence: 11
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
SSID ssj0002116558
ssj0002792
Score 2.0484838
Snippet Heterogeneous networks are widely used to model real-world semi-structured data. The key challenge of learning over such networks is the modeling of node...
SourceID springer
proquest
SourceType Publisher
StartPage 37
SubjectTerms Deep embedding
Heterogeneous networks
Similarity modeling
Title Similarity Modeling on Heterogeneous Networks via Automatic Path Discovery
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5925326&ppg=65
http://link.springer.com/10.1007/978-3-030-10928-8_3
Volume 11052
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwELZguaAeWqColBb5wIlVqsh2Eue4pVvQ8hASD3GzHMeW9sAGsYEDv56ZJE42ERe4RLtWHtZ8jvN5PPMNIYcsA5af5HkQ81gGQggXaKYZJrtnwjGbikpd_-IyPr0Vs_vo3teSb7JLyuyPeX03r-QzqEIb4IpZsh9Atr0pNMBvwBeOgDAcB-S372ZtKgxhGKT1Cql1quGZd5GhrKbB8Mwqr--fLjV-r5arI-R6_jCHdS3ScKyIVuWlF6h8BaYu4LEWo2Mv6zDx5fhlrseT57KoJV6vgDh2j1h1HWC2Us914F2HA-fjiv9rctJbbnJcP4Upq4umtPMnUDT27my8GoABlwZ4rQyk4t3Hx2-41wUjBtLX07_HUcoiYJiKcxVH62Q9kdGIbEyms_O71pXGUEIokpi54zsY1dpKXYdbwalaU3jQn97yYrAjXhGNm2_kCyafUMwKgS5ukTW72CZffekN2szEO2TWgUc9eLRY0B541INHATzagkcRPNqC953c_p_eHJ8GTVmM4BHoVxmkkhmZOG3iNNbcwuslTJKmVluXaZuFWRhlPDeWGy6gQYRGWpOnLE_C0AmW810yWhQL-4NQ46yRuQPey52IHZzOheMJvL2wKgAmuUfG3i6q2rxvIoZNbYWl6sGzR4686RSevFReExtMrrgCk6vK5Ar-_fzQrffJZjeAf5FR-fRsfwMbLLODZjS8AcAZXtE
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Machine+Learning+and+Knowledge+Discovery+in+Databases&rft.atitle=Similarity+Modeling+on+Heterogeneous+Networks+via+Automatic+Path+Discovery&rft.date=2019-01-01&rft.pub=Springer+International+Publishing+AG&rft.isbn=9783030109271&rft.volume=11052&rft_id=info:doi/10.1007%2F978-3-030-10928-8_3&rft.externalDBID=65&rft.externalDocID=EBC5925326_33_65
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5925326-l.jpg