Papers' similarity based on the summarization merits

This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each paper. Currently, This system will take into account 2 different types of summarization for papers based on the different types of keywords,...

Full description

Saved in:
Bibliographic Details
Published in2015 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC) pp. 137 - 142
Main Authors Alli, Mostafa, Alli, Vahid, Ling Feng
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2015
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each paper. Currently, This system will take into account 2 different types of summarization for papers based on the different types of keywords,i.e, Normal keywords and Stemmed keywords. On the contrast to the current and existing recommendation systems for research papers that are using citation and/or Page Rank data, our system works dependent from them but dependent to the textual content of the paper. Our experiment, which was conducting regarding to one of the citation-based papers' recommendation systems, Google Scholar, as a baseline, shows that citation-based systems like Google scholar are very vulnerable to ignore more related but less cited papers while systems based on the textual value of papers can be more successful to recommend papers that are more similar to the input paper. However, comparing full-textual content of papers is a time consuming and aggressive process, while achieving a summarized version of papers and comparing them, can be both faster and reusable. In addition, we show that the ranked listing that Google scholar returns, can be formulated and predicted based on the citation scores. Furthermore, we show that how statistically, Normal keyword summarization can be a better choice between the two types of summarization of papers. As a future work, we will build a synonym-acronym dictionary for scholarly papers in computer science and engineering field, to add the synonym-acronym comparison to the system.
AbstractList This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each paper. Currently, This system will take into account 2 different types of summarization for papers based on the different types of keywords,i.e, Normal keywords and Stemmed keywords. On the contrast to the current and existing recommendation systems for research papers that are using citation and/or Page Rank data, our system works dependent from them but dependent to the textual content of the paper. Our experiment, which was conducting regarding to one of the citation-based papers' recommendation systems, Google Scholar, as a baseline, shows that citation-based systems like Google scholar are very vulnerable to ignore more related but less cited papers while systems based on the textual value of papers can be more successful to recommend papers that are more similar to the input paper. However, comparing full-textual content of papers is a time consuming and aggressive process, while achieving a summarized version of papers and comparing them, can be both faster and reusable. In addition, we show that the ranked listing that Google scholar returns, can be formulated and predicted based on the citation scores. Furthermore, we show that how statistically, Normal keyword summarization can be a better choice between the two types of summarization of papers. As a future work, we will build a synonym-acronym dictionary for scholarly papers in computer science and engineering field, to add the synonym-acronym comparison to the system.
Author Alli, Vahid
Alli, Mostafa
Ling Feng
Author_xml – sequence: 1
  givenname: Mostafa
  surname: Alli
  fullname: Alli, Mostafa
  email: allim10@mails.tsinghua.edu.cn
  organization: Dept..of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
– sequence: 2
  givenname: Vahid
  surname: Alli
  fullname: Alli, Vahid
  email: allivahid@ymail.com
  organization: Dept. of Math., Islamic Azad Univ., Tehran, Iran
– sequence: 3
  surname: Ling Feng
  fullname: Ling Feng
  email: fengling@tsinghua.edu.cn
  organization: Dept..of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
BookMark eNotz0FLAzEQBeAU9FBrf4B4yc3TbjPJZJMcdalaKCjYe0m2Ewx0t2WzHuqvN2BPAx-Px5s7djOcBmLsAUQNINzqZf3V1lKAro1qtDMwY0tnLGBjlDVW6TnDT3-mMT_xnPp09GOaLjz4TAd-Gvj0TTz_9H3hXz-lIj2VRL5nt9EfMy2vd8F2r-td-15tP9427fO2SiDVVDWhA-FDjNB5q6VzWkujDs50uqAKGEjFgBHRIaJFpxGlEUJCMQtqwR7_axMR7c9jKkMu--sr6g_t2kF-
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/BESC.2015.7365971
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Economics
Computer Science
EISBN 9781467387835
1467387835
EndPage 142
ExternalDocumentID 7365971
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i123t-6bc10abff1ca8529955273d97c5bff3b4be3fb4f4494448495442700214f4813
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:40 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i123t-6bc10abff1ca8529955273d97c5bff3b4be3fb4f4494448495442700214f4813
PageCount 6
ParticipantIDs ieee_primary_7365971
PublicationCentury 2000
PublicationDate 20151001
PublicationDateYYYYMMDD 2015-10-01
PublicationDate_xml – month: 10
  year: 2015
  text: 20151001
  day: 01
PublicationDecade 2010
PublicationTitle 2015 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)
PublicationTitleAbbrev BESC
PublicationYear 2015
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.6242855
Snippet This paper proposes a Research paper Similarity system that measures the similarity of an input paper with other papers based on the summarized version of each...
SourceID ieee
SourceType Publisher
StartPage 137
SubjectTerms Computational modeling
Computer science
Economics
Electronic mail
Google
Information science
Mathematical model
Title Papers' similarity based on the summarization merits
URI https://ieeexplore.ieee.org/document/7365971
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Na8JAEB2sl_Zkq5Z-s4dCL92YmN3EvVYUKViEWvAm-wmhNIrGS399Z5Noaemht2VYyLJDeG925s0A3PeNUOhaS5ljjjKVMiqROFMkH9Lz535ovRp5-pJM3tjzgi8a8HjQwlhry-IzG_hlmcs3K73zT2W9NE6Q_2Ksc5QKUWm16kRlFIre0-h16Gu1eFDv-zEwpcSLcQum-y9VZSLvwa5Qgf781YTxv0c5he63Mo_MDphzBg2bt6G1H81A6j-1Dcd7wfG2A2wm10jyHsg2-8gwkEXeTTx4GbLKCfI_UgnYakEm8SmcYtuF-Xg0H05oPSyBZgg-BU2UjkKpnIu0HHAEmbK1mhGp5miMFVM2dgo9wgTDkAzjIsZ80rkfoW0QxefQzFe5vQDijOR8IIzyU9PjhEknOXNWGyRjQuv4Ejr-Ppbrqh3Gsr6Kq7_N13DifVLVv91As9js7C3ieKHuSgd-AdO5noQ
link.rule.ids 310,311,783,787,792,793,799,27937,55086
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Na8JAEB3EHuzJVi397h4KvXSjMbsxuVYU26oIteBN9hOkNJEaL_31nU2ipaWH3pZhIcsO4b3ZmTcDcNvVsUTXGsoss5TJHqMCiTNF8iEcf-52jFMjT6bh6JU9LfiiAvd7LYwxJi8-M55b5rl8naqteypr94IQ-S_GOgfIq6OwUGuVqUq_E7cfBi99V63FvXLnj5EpOWIM6zDZfasoFHnztpn01OevNoz_PcwRtL61eWS2R51jqJikAfXdcAZS_qsNqO0kx5smsJlYI827I5vV-wpDWWTexMGXJmlCkAGSQsJWSjKJS-JkmxbMh4N5f0TLcQl0hfCT0VAqvyOktb4SEUeYyZur6binOBoDyaQJrESfsJhhUIaREWMu7dz10Rb5wQlUkzQxp0CsFpxHsZZubnoQMmEFZ9YojXQsVio4g6a7j-W6aIixLK_i_G_zDdRG88l4OX6cPl_AofNPUQ13CdXsY2uuENUzeZ078wtNbqHP
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2015+International+Conference+on+Behavioral%2C+Economic+and+Socio-cultural+Computing+%28BESC%29&rft.atitle=Papers%27+similarity+based+on+the+summarization+merits&rft.au=Alli%2C+Mostafa&rft.au=Alli%2C+Vahid&rft.au=Ling+Feng&rft.date=2015-10-01&rft.pub=IEEE&rft.spage=137&rft.epage=142&rft_id=info:doi/10.1109%2FBESC.2015.7365971&rft.externalDocID=7365971