A Multi-Levels Geo-Location based Crawling Method for Social Media Platforms

The large size and the dynamic nature of the Web highlight the need for continuous support and updating of Web based information retrieval systems. Crawlers facilitate the process by following the hyperlinks in Web pages to automatically download a partial snapshot of the Web. While some systems rel...

Full description

Saved in:
Bibliographic Details
Published in2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) pp. 494 - 498
Main Authors Alzubi, Shadi, Aqel, Darah, Mughaid, Alaa, Jararweh, Yaser
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2019
Subjects
Online AccessGet full text
DOI10.1109/SNAMS.2019.8931856

Cover

Abstract The large size and the dynamic nature of the Web highlight the need for continuous support and updating of Web based information retrieval systems. Crawlers facilitate the process by following the hyperlinks in Web pages to automatically download a partial snapshot of the Web. While some systems rely on crawlers that exhaustively crawl the Web, others incorporate focus within their crawlers to harvest application or topic specific collections. This project studied web crawling and scraping at many different levels. It will aggregate information from multiple sources into one central location. It Specifics a program for downloading web pages. Given an initial set of seed URLs, it recursively downloads every page that is linked from pages in the set, that have content satisfies specific criterion. Social media, web applications, and mobile applications have been employed together in the proposed system to manage the search in the rapidly growing worldwide web. Applying the proposed system is resulting in a fast and comfortable search engine that fulfill the users requests based on specific geolocations.
AbstractList The large size and the dynamic nature of the Web highlight the need for continuous support and updating of Web based information retrieval systems. Crawlers facilitate the process by following the hyperlinks in Web pages to automatically download a partial snapshot of the Web. While some systems rely on crawlers that exhaustively crawl the Web, others incorporate focus within their crawlers to harvest application or topic specific collections. This project studied web crawling and scraping at many different levels. It will aggregate information from multiple sources into one central location. It Specifics a program for downloading web pages. Given an initial set of seed URLs, it recursively downloads every page that is linked from pages in the set, that have content satisfies specific criterion. Social media, web applications, and mobile applications have been employed together in the proposed system to manage the search in the rapidly growing worldwide web. Applying the proposed system is resulting in a fast and comfortable search engine that fulfill the users requests based on specific geolocations.
Author Alzubi, Shadi
Mughaid, Alaa
Jararweh, Yaser
Aqel, Darah
Author_xml – sequence: 1
  givenname: Shadi
  surname: Alzubi
  fullname: Alzubi, Shadi
  organization: Al Zaytoonah University of Jordan,Computer Science Department,Amman,Jordan
– sequence: 2
  givenname: Darah
  surname: Aqel
  fullname: Aqel, Darah
  organization: Al Zaytoonah University of Jordan,Computer Science Department,Amman,Jordan
– sequence: 3
  givenname: Alaa
  surname: Mughaid
  fullname: Mughaid, Alaa
  organization: The Hashemite University,Computer Science Department,Zarqa,Jordan
– sequence: 4
  givenname: Yaser
  surname: Jararweh
  fullname: Jararweh, Yaser
  organization: Mathematics and Computer Science, Duquesne University,Pittsburgh,PA,USA,15282
BookMark eNotj8FKxDAURSPowhn9Ad3kB1rzkjZNl6XoKLQqVMHd8Nq-aCDTSFsV_96As7pwuBzu3bDTKUzE2BWIFECUN91j1XapFFCmplRgcn3CNlBIA7LM9Ns5ayrefvnVJQ19k1_4jkLShAFXFybe40Ijr2f88W565y2tH2HkNsy8C4NDH8nokD97XCM8LBfszKJf6PKYW_Z6d_tS3yfN0-6hrprESaHWxCgQYMuBVGGhB1SAqARCJozNTK-LLBIocjH0SstRa2Vjc0TIUcbppLbs-t_riGj_ObsDzr_740H1B9ZaSSo
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SNAMS.2019.8931856
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 172812946X
9781728129464
EndPage 498
ExternalDocumentID 8931856
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i203t-83101f9ce37f1b1a31aa30a1408f48b67431a1750cb362d663f37fda15a2728e3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:00 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-83101f9ce37f1b1a31aa30a1408f48b67431a1750cb362d663f37fda15a2728e3
PageCount 5
ParticipantIDs ieee_primary_8931856
PublicationCentury 2000
PublicationDate 2019-Oct.
PublicationDateYYYYMMDD 2019-10-01
PublicationDate_xml – month: 10
  year: 2019
  text: 2019-Oct.
PublicationDecade 2010
PublicationTitle 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
PublicationTitleAbbrev SNAMS
PublicationYear 2019
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.7611793
Snippet The large size and the dynamic nature of the Web highlight the need for continuous support and updating of Web based information retrieval systems. Crawlers...
SourceID ieee
SourceType Publisher
StartPage 494
SubjectTerms Cloud computing
Cluster computing
Crawlers
Crawling
Data set sampling
Facebook
Geo- Locations
Scraping
Search engines
Security
Social Media
Web pages
Title A Multi-Levels Geo-Location based Crawling Method for Social Media Platforms
URI https://ieeexplore.ieee.org/document/8931856
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NawIxEA3qqae2aOk3OfTY6K6J-3EsUitFRWgFbzJJJlBa3KIrhf76TjZqaemht2VJ2CWT8F6S92YYu8lym0tnjFBZZIVSUgktjRMyURAZacChNziPJ8lwph7nvXmN3e69MIhYic-w7R-ru3xbmI0_KusQthK8JHVWp2kWvFo7H0yUd54mtPf1Yi2Kfmj4o2JKBRiDQzbefSroRF7bm1K3zeevLIz__Zcj1vq25vHpHnSOWQ2XTTa645WRVoy8BGjNH7AQoyIcxnGPU5b3V_Dhned8XJWM5sRVebDmcn9XA3z6BqUnsOsWmw3un_tDsS2TIF66kSyFLxUWu9ygTF2sY5AxgIyAdk6ZU5n2LoMYiCVERhNaWaIYjlpaiHvQTbsZyhPWWBZLPGUcYuqutSEakCtAyDNLS94C2jTBxKoz1vQjsXgPmTAW20E4__v1BTvw0QjSt0vWKFcbvCIIL_V1Fbsvga6d1g
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEB1qPehJpRW_zcGjaXebdLt7lGKtulsKttBbmXwsiKUr7RbBX-9kt60oHryFkJCQIbyX5L0MwE0YmUikWnMZeoZLKSRXQqdcBBI9LTSm1hmck0HQH8unSXtSgdutF8ZaW4jPbMMVi7d8k-mVuyprErYSvAQ7sEu4L9ulW2vjhPGi5suATr9OrkXxL5v-yJlSQEbvAJLNYKVS5K2xylVDf_76h_G_szmE-rc5jw23sHMEFTuvQXzHCistj50IaMkebMbjrLyOYw6pDOsu8MN5z1lSJI1mxFZZac5l7rUG2XCGuaOwyzqMe_ejbp-vEyXw15Yncu6ShflppK3opL7yUfiIwkM6O4WpDJXzGfhIPMHTivDKEMlIqaVBv42tTiu04hiq82xuT4ChT92V0kQEIokWo9DQpjdoTSewgZGnUHMrMX0v_8KYrhfh7O_qa9jrj5J4Gj8Ons9h30WmFMJdQDVfrOwlAXquroo4fgH2u6Ej
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2019+Sixth+International+Conference+on+Social+Networks+Analysis%2C+Management+and+Security+%28SNAMS%29&rft.atitle=A+Multi-Levels+Geo-Location+based+Crawling+Method+for+Social+Media+Platforms&rft.au=Alzubi%2C+Shadi&rft.au=Aqel%2C+Darah&rft.au=Mughaid%2C+Alaa&rft.au=Jararweh%2C+Yaser&rft.date=2019-10-01&rft.pub=IEEE&rft.spage=494&rft.epage=498&rft_id=info:doi/10.1109%2FSNAMS.2019.8931856&rft.externalDocID=8931856