利用MapReduce平台实现高效并行的频繁子图挖掘

TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent subgraph)。该算法基于Apriori思想,在扩展边生成新的子图时,使用已经挖掘出的k-1阶的频繁子图生成k阶的频繁子图。同时,检查是否存在待扩展生成的子图,设定生成的频繁子图表示规则,保证了频繁子图信息的唯一性。较同类算法相比,该算法在挖掘频繁子图时更具通用性,并且在扩展边时避免产生大量的复制图,从而使得算法的正确性得以保证,且运行效率显著提高。...

Full description

Saved in:
Bibliographic Details
Published in计算机科学与探索 no. 7; pp. 790 - 801
Main Authors 孙鹤立, 陈强, 刘玮, 黄健斌, 邹建华
Format Journal Article
LanguageChinese
Published 南京大学 计算机软件新技术国家重点实验室,南京 210023 2014
西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071
Subjects
Online AccessGet full text

Cover

Loading…
Abstract TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent subgraph)。该算法基于Apriori思想,在扩展边生成新的子图时,使用已经挖掘出的k-1阶的频繁子图生成k阶的频繁子图。同时,检查是否存在待扩展生成的子图,设定生成的频繁子图表示规则,保证了频繁子图信息的唯一性。较同类算法相比,该算法在挖掘频繁子图时更具通用性,并且在扩展边时避免产生大量的复制图,从而使得算法的正确性得以保证,且运行效率显著提高。
AbstractList TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent subgraph)。该算法基于Apriori思想,在扩展边生成新的子图时,使用已经挖掘出的k-1阶的频繁子图生成k阶的频繁子图。同时,检查是否存在待扩展生成的子图,设定生成的频繁子图表示规则,保证了频繁子图信息的唯一性。较同类算法相比,该算法在挖掘频繁子图时更具通用性,并且在扩展边时避免产生大量的复制图,从而使得算法的正确性得以保证,且运行效率显著提高。
Abstract_FL Frequent subgraph mining is an important problem in data mining domain and has been used widely. This paper proposes an efficient algorithm Cloud-GFSG (cloud-global frequent subgraph), by using MapReduce on Hadoop platform for mining frequent subgraphs. The algorithm is based on the principle of Apriori. It uses the discovered frequent subgraphs whose support is k-1 to generate the candidate frequent subgraphs whose support is k when it gener-ates new subgraphs by extending edge. Meanwhile, it checks whether there exists any subgraph which would be gener-ated and sets the frequent subgraph generation rules to ensure the uniqueness of the frequent subgraphs. Compared with the state-of-the-art algorithms, the proposed algorithm has more general function and can avoid generating replicate graphs while extending a new edge. Therefore, its correctness can be ensured and the efficiency had been improved significantly.
Author 陈强
孙鹤立
刘玮
黄健斌
邹建华
AuthorAffiliation 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071; 南京大学 计算机软件新技术国家重点实验室,南京 210023
AuthorAffiliation_xml – name: 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071; 南京大学 计算机软件新技术国家重点实验室,南京 210023
Author_FL CHEN Qiang
LIU Wei
ZOU Jianhua
HUANG Jianbin
SUN Heli
Author_FL_xml – sequence: 1
  fullname: SUN Heli
– sequence: 2
  fullname: CHEN Qiang
– sequence: 3
  fullname: LIU Wei
– sequence: 4
  fullname: HUANG Jianbin
– sequence: 5
  fullname: ZOU Jianhua
Author_xml – sequence: 1
  fullname: 孙鹤立
– sequence: 2
  fullname: 陈强
– sequence: 3
  fullname: 刘玮
– sequence: 4
  fullname: 黄健斌
– sequence: 5
  fullname: 邹建华
BookMark eNo9jz1LAzEAhjNUsNb-B1eHO5NcLrkDFyl-QUWQ7iWf0lNSMRZ1FBw6lKqgUhQEnRRRCoJwi_8md_ZfWFCcXniG5-GdAxXbtRqABQTDiLFkKQs7ztkQURYFKUFJiAiMIGYVUP1ns6DuXEfAmBCMGE2qYNn3X8rr5y1-sKNVT2qff_iLsX9_KIfjyeuouOn7_PP7cVDenU-ersr8zL9d-vuvYnBbDEfzYMbwfafrf1sDrbXVVmMjaG6vbzZWmoGkMA64FJrGhgmVGKUVglKIVCmTQoEJYUwqyhVRhnGDcMolw8oQgzESMeUyIVENLP5qj7k13O62s27v0E6D7cxleyenRw7D6VkGYRz9AORJYlY
ClassificationCodes TP311
ContentType Journal Article
Copyright Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
Copyright_xml – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
DBID 2B.
4A8
92I
93N
PSX
TCJ
DOI 10.3778/j.issn.1673-9418.1403027
DatabaseName Wanfang Data Journals - Hong Kong
WANFANG Data Centre
Wanfang Data Journals
万方数据期刊 - 香港版
China Online Journals (COJ)
China Online Journals (COJ)
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
DocumentTitle_FL Using MapReduce Platform to Achieve Efficient Parallel Mining of Frequent Subgraphs
EndPage 801
ExternalDocumentID jsjkxyts201407005
GrantInformation_xml – fundername: The National Natural Science Foundation of China under Grant Nos.61173093,61202182; the Natural Science Foundation of Shaanxi Province of China under Grant Nos.2013JM8019,2014JQ8359; the Postdoctoral Science Foundation of China under Grant No.2012M521776; the Fundamental Research Funds for the Central Universities of China under Grant Nos. K50510230001, BDY10
GroupedDBID 2B.
4A8
92I
93N
ALMA_UNASSIGNED_HOLDINGS
M~E
PSX
TCJ
ID FETCH-LOGICAL-c605-acbe65f7bd8fded10cbb9ddf90b24477cd6ad4df7af129ac72df4f221b56ac843
ISSN 1673-9418
IngestDate Thu May 29 04:00:16 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 7
Keywords frequent subgraph mining
Hadoop平台
Hadoop platform
频繁子图挖掘
MapReduce
Language Chinese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c605-acbe65f7bd8fded10cbb9ddf90b24477cd6ad4df7af129ac72df4f221b56ac843
PageCount 12
ParticipantIDs wanfang_journals_jsjkxyts201407005
PublicationCentury 2000
PublicationDate 2014
PublicationDateYYYYMMDD 2014-01-01
PublicationDate_xml – year: 2014
  text: 2014
PublicationDecade 2010
PublicationTitle 计算机科学与探索
PublicationTitle_FL Journal of Frontiers of Computer Science & Technology
PublicationYear 2014
Publisher 南京大学 计算机软件新技术国家重点实验室,南京 210023
西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071
Publisher_xml – name: 南京大学 计算机软件新技术国家重点实验室,南京 210023
– name: 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071
SSID ssib054421768
ssib002040941
ssib002423894
ssib051375751
ssib023646573
ssib036438069
ssib002040926
Score 1.9411446
Snippet TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent...
SourceID wanfang
SourceType Aggregation Database
StartPage 790
Title 利用MapReduce平台实现高效并行的频繁子图挖掘
URI https://d.wanfangdata.com.cn/periodical/jsjkxyts201407005
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Na9RAFA-1XryIouI3Is6ppCbZyXyAl2SbpQj1IBV6K5lMolRYxd2C9iAIHnooVUGlKAh6UkQpCIVe_G_Stf-F771kd9P6QfUSJi9vfjO_98i-N7MzE8e5IlMjvMJL3dC3xuU8MK4xuXW1TqVOea4traqcuyFmb_HrC-HCxKGtxqql5b6ZzlZ-u6_kf7wKMvAr7pL9B8-OQEEAZfAvXMHDcD2Qj1kSMqVYpFkimeYsUnPp_Zt4GGuOj2LN4hbpdFjsYSFKmE5QWSUk0SyKmVYsEUwTVF1LsARgfabahBwxxUk5YNpHCegonwBnmCZkHbM4QRyoogUVoC3VzH0JM0FYQMCeSGq3zeKIJJLACTMCBM5ihSAVFDSN7QLH0XLoYfsauwY9ijjBxEzFYxWNz2ti2FKzMoiRO1kjSpp14pgYh8gyCqmbyKw5ReKPJ0dJcYb4cCQTtalrHCkN-Uz9lb1iMSh1qH5M5qcW0UXQboTLUSpl1KmMPVP7E5WBocQeIJUATTF2NZglIjeSBE3UQUNUkr2dngrwoNxWI0wJ2XI1H0eu5eF66yoMyeoTrMOMppou2h8sW1IqCpYIOT2CnMYTHL3qwIZ9R5Ev9ZbuPnzU76GNIVTg0cGHAxieYUCce5yM0z6IDLo5bMV7vmf_NOTJoziA3zAQ4TiNhtuW8sQozQ79lsS_B0f38JPiy2qX67DX1SI9pHT1T4RoW163SLu3Gxnk_DHnaD30uxRV7_FxZ2LlzgnnWrn6afDy4-i9Lbe_lc82y6_vBuubu583dl6tlttbP96vDd483f3wYrD9pPzyvHz7fWft9c76xklnvpPMt2fd-osmbia80E0zk4uwkMaqwubW9zJjtLWF9gxk2VJmVqSW20KmBaThaSYDW_AiCHwTijRTvHXKmeze6-angWxuMxhLWAt1uW-EEX4GQyvBLR7QZ7IzzuWa7GL9g9Vb_MV9Zw-idM45guVq2vG8M9l_sJxfgES8by6S138CC6ej9w
linkProvider ISSN International Centre
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E5%88%A9%E7%94%A8MapReduce%E5%B9%B3%E5%8F%B0%E5%AE%9E%E7%8E%B0%E9%AB%98%E6%95%88%E5%B9%B6%E8%A1%8C%E7%9A%84%E9%A2%91%E7%B9%81%E5%AD%90%E5%9B%BE%E6%8C%96%E6%8E%98&rft.jtitle=%E8%AE%A1%E7%AE%97%E6%9C%BA%E7%A7%91%E5%AD%A6%E4%B8%8E%E6%8E%A2%E7%B4%A2&rft.au=%E5%AD%99%E9%B9%A4%E7%AB%8B&rft.au=%E9%99%88%E5%BC%BA&rft.au=%E5%88%98%E7%8E%AE&rft.au=%E9%BB%84%E5%81%A5%E6%96%8C&rft.date=2014&rft.pub=%E5%8D%97%E4%BA%AC%E5%A4%A7%E5%AD%A6+%E8%AE%A1%E7%AE%97%E6%9C%BA%E8%BD%AF%E4%BB%B6%E6%96%B0%E6%8A%80%E6%9C%AF%E5%9B%BD%E5%AE%B6%E9%87%8D%E7%82%B9%E5%AE%9E%E9%AA%8C%E5%AE%A4%EF%BC%8C%E5%8D%97%E4%BA%AC+210023&rft.issn=1673-9418&rft.issue=7&rft.spage=790&rft.epage=801&rft_id=info:doi/10.3778%2Fj.issn.1673-9418.1403027&rft.externalDocID=jsjkxyts201407005
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fjsjkxyts%2Fjsjkxyts.jpg