Papers' similarity based on the summarization merits
This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each paper. Currently, This system will take into account 2 different types of summarization for papers based on the different types of keywords,...
Saved in:
Published in | 2015 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC) pp. 137 - 142 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.10.2015
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each paper. Currently, This system will take into account 2 different types of summarization for papers based on the different types of keywords,i.e, Normal keywords and Stemmed keywords. On the contrast to the current and existing recommendation systems for research papers that are using citation and/or Page Rank data, our system works dependent from them but dependent to the textual content of the paper. Our experiment, which was conducting regarding to one of the citation-based papers' recommendation systems, Google Scholar, as a baseline, shows that citation-based systems like Google scholar are very vulnerable to ignore more related but less cited papers while systems based on the textual value of papers can be more successful to recommend papers that are more similar to the input paper. However, comparing full-textual content of papers is a time consuming and aggressive process, while achieving a summarized version of papers and comparing them, can be both faster and reusable. In addition, we show that the ranked listing that Google scholar returns, can be formulated and predicted based on the citation scores. Furthermore, we show that how statistically, Normal keyword summarization can be a better choice between the two types of summarization of papers. As a future work, we will build a synonym-acronym dictionary for scholarly papers in computer science and engineering field, to add the synonym-acronym comparison to the system. |
---|---|
AbstractList | This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each paper. Currently, This system will take into account 2 different types of summarization for papers based on the different types of keywords,i.e, Normal keywords and Stemmed keywords. On the contrast to the current and existing recommendation systems for research papers that are using citation and/or Page Rank data, our system works dependent from them but dependent to the textual content of the paper. Our experiment, which was conducting regarding to one of the citation-based papers' recommendation systems, Google Scholar, as a baseline, shows that citation-based systems like Google scholar are very vulnerable to ignore more related but less cited papers while systems based on the textual value of papers can be more successful to recommend papers that are more similar to the input paper. However, comparing full-textual content of papers is a time consuming and aggressive process, while achieving a summarized version of papers and comparing them, can be both faster and reusable. In addition, we show that the ranked listing that Google scholar returns, can be formulated and predicted based on the citation scores. Furthermore, we show that how statistically, Normal keyword summarization can be a better choice between the two types of summarization of papers. As a future work, we will build a synonym-acronym dictionary for scholarly papers in computer science and engineering field, to add the synonym-acronym comparison to the system. |
Author | Alli, Vahid Alli, Mostafa Ling Feng |
Author_xml | – sequence: 1 givenname: Mostafa surname: Alli fullname: Alli, Mostafa email: allim10@mails.tsinghua.edu.cn organization: Dept..of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China – sequence: 2 givenname: Vahid surname: Alli fullname: Alli, Vahid email: allivahid@ymail.com organization: Dept. of Math., Islamic Azad Univ., Tehran, Iran – sequence: 3 surname: Ling Feng fullname: Ling Feng email: fengling@tsinghua.edu.cn organization: Dept..of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China |
BookMark | eNotz0FLAzEQBeAU9FBrf4B4yc3TbjPJZJMcdalaKCjYe0m2Ewx0t2WzHuqvN2BPAx-Px5s7djOcBmLsAUQNINzqZf3V1lKAro1qtDMwY0tnLGBjlDVW6TnDT3-mMT_xnPp09GOaLjz4TAd-Gvj0TTz_9H3hXz-lIj2VRL5nt9EfMy2vd8F2r-td-15tP9427fO2SiDVVDWhA-FDjNB5q6VzWkujDs50uqAKGEjFgBHRIaJFpxGlEUJCMQtqwR7_axMR7c9jKkMu--sr6g_t2kF- |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/BESC.2015.7365971 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library Online IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Economics Computer Science |
EISBN | 9781467387835 1467387835 |
EndPage | 142 |
ExternalDocumentID | 7365971 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i123t-6bc10abff1ca8529955273d97c5bff3b4be3fb4f4494448495442700214f4813 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:37:40 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i123t-6bc10abff1ca8529955273d97c5bff3b4be3fb4f4494448495442700214f4813 |
PageCount | 6 |
ParticipantIDs | ieee_primary_7365971 |
PublicationCentury | 2000 |
PublicationDate | 20151001 |
PublicationDateYYYYMMDD | 2015-10-01 |
PublicationDate_xml | – month: 10 year: 2015 text: 20151001 day: 01 |
PublicationDecade | 2010 |
PublicationTitle | 2015 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC) |
PublicationTitleAbbrev | BESC |
PublicationYear | 2015 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.6242855 |
Snippet | This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 137 |
SubjectTerms | Computational modeling Computer science Economics Electronic mail Information science Mathematical model |
Title | Papers' similarity based on the summarization merits |
URI | https://ieeexplore.ieee.org/document/7365971 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Na8JAEB2sl_Zkq5Z-s4dCL92YmN3EvVYUKViEWvAm-wmhNIrGS399Z5Noaemht2VYyLJDeG925s0A3PeNUOhaS5ljjjKVMiqROFMkH9Lz535ovRp5-pJM3tjzgi8a8HjQwlhry-IzG_hlmcs3K73zT2W9NE6Q_2Ksc5QKUWm16kRlFIre0-h16Gu1eFDv-zEwpcSLcQum-y9VZSLvwa5Qgf781YTxv0c5he63Mo_MDphzBg2bt6G1H81A6j-1Dcd7wfG2A2wm10jyHsg2-8gwkEXeTTx4GbLKCfI_UgnYakEm8SmcYtuF-Xg0H05oPSyBZgg-BU2UjkKpnIu0HHAEmbK1mhGp5miMFVM2dgo9wgTDkAzjIsZ80rkfoW0QxefQzFe5vQDijOR8IIzyU9PjhEknOXNWGyRjQuv4Ejr-Ppbrqh3Gsr6Kq7_N13DifVLVv91As9js7C3ieKHuSgd-AdO5noQ |
link.rule.ids | 310,311,783,787,792,793,799,27937,55086 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Na8JAEB3EHuzJVi397h4KvXSjMbsxuVYU26oIteBN9hOkNJEaL_31nU2ipaWH3pZhIcsO4b3ZmTcDcNvVsUTXGsoss5TJHqMCiTNF8iEcf-52jFMjT6bh6JU9LfiiAvd7LYwxJi8-M55b5rl8naqteypr94IQ-S_GOgfIq6OwUGuVqUq_E7cfBi99V63FvXLnj5EpOWIM6zDZfasoFHnztpn01OevNoz_PcwRtL61eWS2R51jqJikAfXdcAZS_qsNqO0kx5smsJlYI827I5vV-wpDWWTexMGXJmlCkAGSQsJWSjKJS-JkmxbMh4N5f0TLcQl0hfCT0VAqvyOktb4SEUeYyZur6binOBoDyaQJrESfsJhhUIaREWMu7dz10Rb5wQlUkzQxp0CsFpxHsZZubnoQMmEFZ9YojXQsVio4g6a7j-W6aIixLK_i_G_zDdRG88l4OX6cPl_AofNPUQ13CdXsY2uuENUzeZ078wtNbqHP |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2015+International+Conference+on+Behavioral%2C+Economic+and+Socio-cultural+Computing+%28BESC%29&rft.atitle=Papers%27+similarity+based+on+the+summarization+merits&rft.au=Alli%2C+Mostafa&rft.au=Alli%2C+Vahid&rft.au=Ling+Feng&rft.date=2015-10-01&rft.pub=IEEE&rft.spage=137&rft.epage=142&rft_id=info:doi/10.1109%2FBESC.2015.7365971&rft.externalDocID=7365971 |