利用MapReduce平台实现高效并行的频繁子图挖掘
TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent subgraph)。该算法基于Apriori思想,在扩展边生成新的子图时,使用已经挖掘出的k-1阶的频繁子图生成k阶的频繁子图。同时,检查是否存在待扩展生成的子图,设定生成的频繁子图表示规则,保证了频繁子图信息的唯一性。较同类算法相比,该算法在挖掘频繁子图时更具通用性,并且在扩展边时避免产生大量的复制图,从而使得算法的正确性得以保证,且运行效率显著提高。...
Saved in:
Published in | 计算机科学与探索 no. 7; pp. 790 - 801 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | Chinese |
Published |
南京大学 计算机软件新技术国家重点实验室,南京 210023
2014
西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071 |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent subgraph)。该算法基于Apriori思想,在扩展边生成新的子图时,使用已经挖掘出的k-1阶的频繁子图生成k阶的频繁子图。同时,检查是否存在待扩展生成的子图,设定生成的频繁子图表示规则,保证了频繁子图信息的唯一性。较同类算法相比,该算法在挖掘频繁子图时更具通用性,并且在扩展边时避免产生大量的复制图,从而使得算法的正确性得以保证,且运行效率显著提高。 |
---|---|
AbstractList | TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent subgraph)。该算法基于Apriori思想,在扩展边生成新的子图时,使用已经挖掘出的k-1阶的频繁子图生成k阶的频繁子图。同时,检查是否存在待扩展生成的子图,设定生成的频繁子图表示规则,保证了频繁子图信息的唯一性。较同类算法相比,该算法在挖掘频繁子图时更具通用性,并且在扩展边时避免产生大量的复制图,从而使得算法的正确性得以保证,且运行效率显著提高。 |
Abstract_FL | Frequent subgraph mining is an important problem in data mining domain and has been used widely. This paper proposes an efficient algorithm Cloud-GFSG (cloud-global frequent subgraph), by using MapReduce on Hadoop platform for mining frequent subgraphs. The algorithm is based on the principle of Apriori. It uses the discovered frequent subgraphs whose support is k-1 to generate the candidate frequent subgraphs whose support is k when it gener-ates new subgraphs by extending edge. Meanwhile, it checks whether there exists any subgraph which would be gener-ated and sets the frequent subgraph generation rules to ensure the uniqueness of the frequent subgraphs. Compared with the state-of-the-art algorithms, the proposed algorithm has more general function and can avoid generating replicate graphs while extending a new edge. Therefore, its correctness can be ensured and the efficiency had been improved significantly. |
Author | 陈强 孙鹤立 刘玮 黄健斌 邹建华 |
AuthorAffiliation | 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071; 南京大学 计算机软件新技术国家重点实验室,南京 210023 |
AuthorAffiliation_xml | – name: 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071; 南京大学 计算机软件新技术国家重点实验室,南京 210023 |
Author_FL | CHEN Qiang LIU Wei ZOU Jianhua HUANG Jianbin SUN Heli |
Author_FL_xml | – sequence: 1 fullname: SUN Heli – sequence: 2 fullname: CHEN Qiang – sequence: 3 fullname: LIU Wei – sequence: 4 fullname: HUANG Jianbin – sequence: 5 fullname: ZOU Jianhua |
Author_xml | – sequence: 1 fullname: 孙鹤立 – sequence: 2 fullname: 陈强 – sequence: 3 fullname: 刘玮 – sequence: 4 fullname: 黄健斌 – sequence: 5 fullname: 邹建华 |
BookMark | eNo9jz1LAzEAhjNUsNb-B1eHO5NcLrkDFyl-QUWQ7iWf0lNSMRZ1FBw6lKqgUhQEnRRRCoJwi_8md_ZfWFCcXniG5-GdAxXbtRqABQTDiLFkKQs7ztkQURYFKUFJiAiMIGYVUP1ns6DuXEfAmBCMGE2qYNn3X8rr5y1-sKNVT2qff_iLsX9_KIfjyeuouOn7_PP7cVDenU-ersr8zL9d-vuvYnBbDEfzYMbwfafrf1sDrbXVVmMjaG6vbzZWmoGkMA64FJrGhgmVGKUVglKIVCmTQoEJYUwqyhVRhnGDcMolw8oQgzESMeUyIVENLP5qj7k13O62s27v0E6D7cxleyenRw7D6VkGYRz9AORJYlY |
ClassificationCodes | TP311 |
ContentType | Journal Article |
Copyright | Copyright © Wanfang Data Co. Ltd. All Rights Reserved. |
Copyright_xml | – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved. |
DBID | 2B. 4A8 92I 93N PSX TCJ |
DOI | 10.3778/j.issn.1673-9418.1403027 |
DatabaseName | Wanfang Data Journals - Hong Kong WANFANG Data Centre Wanfang Data Journals 万方数据期刊 - 香港版 China Online Journals (COJ) China Online Journals (COJ) |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
DocumentTitle_FL | Using MapReduce Platform to Achieve Efficient Parallel Mining of Frequent Subgraphs |
EndPage | 801 |
ExternalDocumentID | jsjkxyts201407005 |
GrantInformation_xml | – fundername: The National Natural Science Foundation of China under Grant Nos.61173093,61202182; the Natural Science Foundation of Shaanxi Province of China under Grant Nos.2013JM8019,2014JQ8359; the Postdoctoral Science Foundation of China under Grant No.2012M521776; the Fundamental Research Funds for the Central Universities of China under Grant Nos. K50510230001, BDY10 |
GroupedDBID | 2B. 4A8 92I 93N ALMA_UNASSIGNED_HOLDINGS M~E PSX TCJ |
ID | FETCH-LOGICAL-c605-acbe65f7bd8fded10cbb9ddf90b24477cd6ad4df7af129ac72df4f221b56ac843 |
ISSN | 1673-9418 |
IngestDate | Thu May 29 04:00:16 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 7 |
Keywords | frequent subgraph mining Hadoop平台 Hadoop platform 频繁子图挖掘 MapReduce |
Language | Chinese |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c605-acbe65f7bd8fded10cbb9ddf90b24477cd6ad4df7af129ac72df4f221b56ac843 |
PageCount | 12 |
ParticipantIDs | wanfang_journals_jsjkxyts201407005 |
PublicationCentury | 2000 |
PublicationDate | 2014 |
PublicationDateYYYYMMDD | 2014-01-01 |
PublicationDate_xml | – year: 2014 text: 2014 |
PublicationDecade | 2010 |
PublicationTitle | 计算机科学与探索 |
PublicationTitle_FL | Journal of Frontiers of Computer Science & Technology |
PublicationYear | 2014 |
Publisher | 南京大学 计算机软件新技术国家重点实验室,南京 210023 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071 |
Publisher_xml | – name: 南京大学 计算机软件新技术国家重点实验室,南京 210023 – name: 西安交通大学 电子与信息工程学院,西安,710049%西安电子科技大学 软件学院,西安,710071%北京邮电大学 信息与通信工程学院,北京,100876%西安电子科技大学 软件学院,西安 710071 |
SSID | ssib054421768 ssib002040941 ssib002423894 ssib051375751 ssib023646573 ssib036438069 ssib002040926 |
Score | 1.9411446 |
Snippet | TP311; 频繁子图挖掘是数据挖掘领域的一个重要问题,并且有着广泛的应用。在Hadoop平台上实现了一种基于MapReduce的高效频繁子图挖掘算法Cloud-GFSG(cloud-global frequent... |
SourceID | wanfang |
SourceType | Aggregation Database |
StartPage | 790 |
Title | 利用MapReduce平台实现高效并行的频繁子图挖掘 |
URI | https://d.wanfangdata.com.cn/periodical/jsjkxyts201407005 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Na9RAFA-1XryIouI3Is6ppCbZyXyAl2SbpQj1IBV6K5lMolRYxd2C9iAIHnooVUGlKAh6UkQpCIVe_G_Stf-F771kd9P6QfUSJi9vfjO_98i-N7MzE8e5IlMjvMJL3dC3xuU8MK4xuXW1TqVOea4traqcuyFmb_HrC-HCxKGtxqql5b6ZzlZ-u6_kf7wKMvAr7pL9B8-OQEEAZfAvXMHDcD2Qj1kSMqVYpFkimeYsUnPp_Zt4GGuOj2LN4hbpdFjsYSFKmE5QWSUk0SyKmVYsEUwTVF1LsARgfabahBwxxUk5YNpHCegonwBnmCZkHbM4QRyoogUVoC3VzH0JM0FYQMCeSGq3zeKIJJLACTMCBM5ihSAVFDSN7QLH0XLoYfsauwY9ijjBxEzFYxWNz2ti2FKzMoiRO1kjSpp14pgYh8gyCqmbyKw5ReKPJ0dJcYb4cCQTtalrHCkN-Uz9lb1iMSh1qH5M5qcW0UXQboTLUSpl1KmMPVP7E5WBocQeIJUATTF2NZglIjeSBE3UQUNUkr2dngrwoNxWI0wJ2XI1H0eu5eF66yoMyeoTrMOMppou2h8sW1IqCpYIOT2CnMYTHL3qwIZ9R5Ev9ZbuPnzU76GNIVTg0cGHAxieYUCce5yM0z6IDLo5bMV7vmf_NOTJoziA3zAQ4TiNhtuW8sQozQ79lsS_B0f38JPiy2qX67DX1SI9pHT1T4RoW163SLu3Gxnk_DHnaD30uxRV7_FxZ2LlzgnnWrn6afDy4-i9Lbe_lc82y6_vBuubu583dl6tlttbP96vDd483f3wYrD9pPzyvHz7fWft9c76xklnvpPMt2fd-osmbia80E0zk4uwkMaqwubW9zJjtLWF9gxk2VJmVqSW20KmBaThaSYDW_AiCHwTijRTvHXKmeze6-angWxuMxhLWAt1uW-EEX4GQyvBLR7QZ7IzzuWa7GL9g9Vb_MV9Zw-idM45guVq2vG8M9l_sJxfgES8by6S138CC6ej9w |
linkProvider | ISSN International Centre |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E5%88%A9%E7%94%A8MapReduce%E5%B9%B3%E5%8F%B0%E5%AE%9E%E7%8E%B0%E9%AB%98%E6%95%88%E5%B9%B6%E8%A1%8C%E7%9A%84%E9%A2%91%E7%B9%81%E5%AD%90%E5%9B%BE%E6%8C%96%E6%8E%98&rft.jtitle=%E8%AE%A1%E7%AE%97%E6%9C%BA%E7%A7%91%E5%AD%A6%E4%B8%8E%E6%8E%A2%E7%B4%A2&rft.au=%E5%AD%99%E9%B9%A4%E7%AB%8B&rft.au=%E9%99%88%E5%BC%BA&rft.au=%E5%88%98%E7%8E%AE&rft.au=%E9%BB%84%E5%81%A5%E6%96%8C&rft.date=2014&rft.pub=%E5%8D%97%E4%BA%AC%E5%A4%A7%E5%AD%A6+%E8%AE%A1%E7%AE%97%E6%9C%BA%E8%BD%AF%E4%BB%B6%E6%96%B0%E6%8A%80%E6%9C%AF%E5%9B%BD%E5%AE%B6%E9%87%8D%E7%82%B9%E5%AE%9E%E9%AA%8C%E5%AE%A4%EF%BC%8C%E5%8D%97%E4%BA%AC+210023&rft.issn=1673-9418&rft.issue=7&rft.spage=790&rft.epage=801&rft_id=info:doi/10.3778%2Fj.issn.1673-9418.1403027&rft.externalDocID=jsjkxyts201407005 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fjsjkxyts%2Fjsjkxyts.jpg |