Network topology optimization for data aggregation with splitting

In this paper, we develop algorithms for the data aggregation problem which arises in the context of big-data applications that employ the MapReduce operation. For the case when source racks can send their data to the aggregator using multiple paths, we show that an aggregation tree topology that mi...

Full description

Saved in:
Bibliographic Details
Published inIEEE International Symposium on Signal Processing and Information Technology pp. 000398 - 000403
Main Authors Das, Soham, Sahni, Sartaj
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2014
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In this paper, we develop algorithms for the data aggregation problem which arises in the context of big-data applications that employ the MapReduce operation. For the case when source racks can send their data to the aggregator using multiple paths, we show that an aggregation tree topology that minimizes aggregation time can be constructed in polynomial time. We consider also the problem of constructing aggregation trees that minimize total network traffic subject to the primary constraint that aggregation time is minimized. Heuristics for this problem are presented. Experiments show that allowing multiple paths reduces aggregation time by up to 99% relative to the aggregation trees constructed using the LPT rule [3]. This reduction in aggregation time, however, comes with up to 35% increase in total network traffic when racks have more than 2 optical links.
AbstractList In this paper, we develop algorithms for the data aggregation problem which arises in the context of big-data applications that employ the MapReduce operation. For the case when source racks can send their data to the aggregator using multiple paths, we show that an aggregation tree topology that minimizes aggregation time can be constructed in polynomial time. We consider also the problem of constructing aggregation trees that minimize total network traffic subject to the primary constraint that aggregation time is minimized. Heuristics for this problem are presented. Experiments show that allowing multiple paths reduces aggregation time by up to 99% relative to the aggregation trees constructed using the LPT rule [3]. This reduction in aggregation time, however, comes with up to 35% increase in total network traffic when racks have more than 2 optical links.
Author Das, Soham
Sahni, Sartaj
Author_xml – sequence: 1
  givenname: Soham
  surname: Das
  fullname: Das, Soham
  email: sdas@cise.ufl.edu
  organization: Dept. of Comput. & Inf. Sci. & Eng., Univ. of Florida, Gainesville, FL, USA
– sequence: 2
  givenname: Sartaj
  surname: Sahni
  fullname: Sahni, Sartaj
  email: sahni@cise.ufl.edu
  organization: Dept. of Comput. & Inf. Sci. & Eng., Univ. of Florida, Gainesville, FL, USA
BookMark eNotj91qwjAYQDNQmHN9Am_yAu3yJWl-LkX2U5BtYO8lTdIaVpvSBsQ9_QZ6deBcHDhPaDHEwSO0AVIAEP1SHQ7fVV1QAryQjBBB6QPKtFTApdagAPQCrSgImkvF2SPK5jk0hJaEaa7lCm0_fbrE6QenOMY-dlccxxTO4dekEAfcxgk7kww2XTf57iYvIZ3wPPYhpTB0z2jZmn722Z1rVL-91ruPfP_1Xu22-zxQolLOuC55KZ1qCFOSN64EQ41tKdXCgpZCE6Ha0juwjXWOeG5b5j1Ywf-dYGu0uWWD9_44TuFspuvx_sz-AAZqTiU
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISSPIT.2014.7300622
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781479918119
1479918113
9781479918126
1479918121
EndPage 000403
ExternalDocumentID 7300622
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-i208t-3495457d8b03874bd51a2acf2296c19769068f5ed1cbcdd0e4cf3ee1c64ed163
IEDL.DBID RIE
ISSN 2162-7843
IngestDate Wed Aug 27 02:05:48 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-3495457d8b03874bd51a2acf2296c19769068f5ed1cbcdd0e4cf3ee1c64ed163
PageCount 6
ParticipantIDs ieee_primary_7300622
PublicationCentury 2000
PublicationDate 20141201
PublicationDateYYYYMMDD 2014-12-01
PublicationDate_xml – month: 12
  year: 2014
  text: 20141201
  day: 01
PublicationDecade 2010
PublicationTitle IEEE International Symposium on Signal Processing and Information Technology
PublicationTitleAbbrev ISSPIT
PublicationYear 2014
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib025039497
ssj0003188634
Score 1.57377
Snippet In this paper, we develop algorithms for the data aggregation problem which arises in the context of big-data applications that employ the MapReduce operation....
SourceID ieee
SourceType Publisher
StartPage 000398
SubjectTerms Approximation algorithms
Big Data applications
Clustering algorithms
Complexity theory
Data Center Networks
Map-Reduce tasks
Network topology
Optical fiber communication
Optical switches
Software Defined networking
Topology
Title Network topology optimization for data aggregation with splitting
URI https://ieeexplore.ieee.org/document/7300622
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA61J734aMU3OXh0t0k2G5OjiKUVWgqt0FvJa4uIrej2oL_eyWZbH3jwEsIclpBk55H5vhmELsFHVa6wMiFOawhQYDC08ImAP8-DAdaSBu7wYCh6D_x-mk8b6GrDhfHeV-Azn4Zplct3S7sKT2WdUFtdMFC4WxC4Ra7W-u6AJc8UrzNiQQvDXZWiSiozKsCJlDyriw5Rojr98XjUnwRkF0_rr_5or1JZl-4uGqzXFUElT-mqNKn9-FWy8b8L30PtLx4fHm0s1D5q-MUB2vlWgrCFboYRB47L2C3hHS9BiTzX7EwMLi0OIFKs5xCYz6MwvN3iN_BeK8x0G026d5PbXlK3VUgeGZFlkkFMxPNrJ01IXXPjcqqZtgVjSlgK7okiQha5d9Qa6xzx3BaZ99QKDjKRHaLmYrnwRwgTqXjhtGeZCvnVwgS-v2RUK6mIcvQYtcJezF5i4YxZvQ0nf4tP0XY4j4gVOUPN8nXlz8Hil-aiOupP1leofQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5VMAALjxbxxgMjaWPHMfaIEFULbVWpQepWObZTIUSLIB3g13NO0vIQA0sUeYgs27nvzvd9dwAX6KMqmxkZhFZrDFDwkdLMBQL_PIcArCX12uH-QHQe-N04HtfgcqWFcc4V5DPX9K9FLt_OzcJflbV8bXXB0OCuI-7HrFRrLU8PYnmkeJUT83YYT6sURVqZUYFupORRVXaIhqrVHY2G3cRzu3iz-u6PBisFvrS3ob-cWUkreWou8rRpPn4Vbfzv1Heg8aXkI8MVRu1Czc32YOtbEcI6XA9KJjjJy34J72SOZuS50mcSdGqJp5ESPcXQfFoO-ttb8ob-a8GabkDSvk1uOkHVWCF4ZKHMgwijIh5fWZn65DVPbUw10yZjTAlD0UFRoZBZ7Cw1qbE2dNxkkXPUCI5jItqHtdl85g6AhFLxzGrHIuUzrFnqFf-SUa2kCpWlh1D3azF5KUtnTKplOPp7-Bw2Okm_N-l1B_fHsOn3pmSOnMBa_rpwp4j_eXpWbPsn1xmrxw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Symposium+on+Signal+Processing+and+Information+Technology&rft.atitle=Network+topology+optimization+for+data+aggregation+with+splitting&rft.au=Das%2C+Soham&rft.au=Sahni%2C+Sartaj&rft.date=2014-12-01&rft.pub=IEEE&rft.issn=2162-7843&rft.spage=000398&rft.epage=000403&rft_id=info:doi/10.1109%2FISSPIT.2014.7300622&rft.externalDocID=7300622
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2162-7843&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2162-7843&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2162-7843&client=summon