QPOPSS: Query and Parallelism Optimized Space-Saving for finding frequent stream elements

The frequent elements problem, a key component in demanding stream-data analytics, involves selecting elements whose occurrence exceeds a user-specified threshold. Fast, memory-efficient ϵ-approximate synopsis algorithms select all frequent elements but may overestimate them depending on ϵ (user-def...

Full description

Saved in:
Bibliographic Details
Published inJournal of parallel and distributed computing Vol. 204; p. 105134
Main Authors Jarlow, Victor, Stylianopoulos, Charalampos, Papatriantafilou, Marina
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.10.2025
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The frequent elements problem, a key component in demanding stream-data analytics, involves selecting elements whose occurrence exceeds a user-specified threshold. Fast, memory-efficient ϵ-approximate synopsis algorithms select all frequent elements but may overestimate them depending on ϵ (user-defined parameter). Evolving applications demand performance only achievable by parallelization. However, algorithmic guarantees concerning concurrent updates and queries have been overlooked. We propose Query and Parallelism Optimized Space-Saving (QPOPSS ), providing concurrency guarantees. A cornerstone of the design is a new approach for the main data structure for the Space-Saving algorithm, enabling support of very fast queries. QPOPSS combines minimal overlap with concurrent updates, distributing work and using fine-grained thread synchronization to achieve high throughput, accuracy, and low memory use. Our analysis shows space and approximation bounds under various concurrency and data distribution conditions. Our empirical evaluation relative to representative state-of-the-art methods reveals that QPOPSS 's multithreaded throughput scales linearly while maintaining the highest accuracy, with orders of magnitude smaller memory footprint. •QPOPSS: a memory-efficient parallel algorithm for frequent element detection in data streams.•QOSS: a query-optimized variant of Space-Saving with improved top-k query performance.•Open-source implementation with comprehensive evaluation on real and synthetic datasets.•QPOPSS outperforms state-of-the-art in accuracy and memory efficiency under tight constraints.•Strong scalability and robustness on skewed data and large query workloads.
AbstractList The frequent elements problem, a key component in demanding stream-data analytics, involves selecting elements whose occurrence exceeds a user-specified threshold. Fast, memory-efficient ϵ-approximate synopsis algorithms select all frequent elements but may overestimate them depending on ϵ (user-defined parameter). Evolving applications demand performance only achievable by parallelization. However, algorithmic guarantees concerning concurrent updates and queries have been overlooked. We propose Query and Parallelism Optimized Space-Saving (QPOPSS ), providing concurrency guarantees. A cornerstone of the design is a new approach for the main data structure for the Space-Saving algorithm, enabling support of very fast queries. QPOPSS combines minimal overlap with concurrent updates, distributing work and using fine-grained thread synchronization to achieve high throughput, accuracy, and low memory use. Our analysis shows space and approximation bounds under various concurrency and data distribution conditions. Our empirical evaluation relative to representative state-of-the-art methods reveals that QPOPSS 's multithreaded throughput scales linearly while maintaining the highest accuracy, with orders of magnitude smaller memory footprint.
The frequent elements problem, a key component in demanding stream-data analytics, involves selecting elements whose occurrence exceeds a user-specified threshold. Fast, memory-efficient ϵ-approximate synopsis algorithms select all frequent elements but may overestimate them depending on ϵ (user-defined parameter). Evolving applications demand performance only achievable by parallelization. However, algorithmic guarantees concerning concurrent updates and queries have been overlooked. We propose Query and Parallelism Optimized Space-Saving (QPOPSS ), providing concurrency guarantees. A cornerstone of the design is a new approach for the main data structure for the Space-Saving algorithm, enabling support of very fast queries. QPOPSS combines minimal overlap with concurrent updates, distributing work and using fine-grained thread synchronization to achieve high throughput, accuracy, and low memory use. Our analysis shows space and approximation bounds under various concurrency and data distribution conditions. Our empirical evaluation relative to representative state-of-the-art methods reveals that QPOPSS 's multithreaded throughput scales linearly while maintaining the highest accuracy, with orders of magnitude smaller memory footprint. •QPOPSS: a memory-efficient parallel algorithm for frequent element detection in data streams.•QOSS: a query-optimized variant of Space-Saving with improved top-k query performance.•Open-source implementation with comprehensive evaluation on real and synthetic datasets.•QPOPSS outperforms state-of-the-art in accuracy and memory efficiency under tight constraints.•Strong scalability and robustness on skewed data and large query workloads.
ArticleNumber 105134
Author Stylianopoulos, Charalampos
Jarlow, Victor
Papatriantafilou, Marina
Author_xml – sequence: 1
  givenname: Victor
  orcidid: 0009-0001-6661-5783
  surname: Jarlow
  fullname: Jarlow, Victor
  email: victor.jarlow@ri.se, victor.jarlow@outlook.com
  organization: Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Gothenburg, Sweden
– sequence: 2
  givenname: Charalampos
  orcidid: 0000-0002-6845-9163
  surname: Stylianopoulos
  fullname: Stylianopoulos, Charalampos
  email: charalampos.stylianopoulos@gmail.com
  organization: Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Gothenburg, Sweden
– sequence: 3
  givenname: Marina
  orcidid: 0000-0001-9094-8871
  surname: Papatriantafilou
  fullname: Papatriantafilou, Marina
  email: ptrianta@chalmers.se
  organization: Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Gothenburg, Sweden
BackLink https://gup.ub.gu.se/publication/352301$$DView record from Swedish Publication Index
https://research.chalmers.se/publication/547212$$DView record from Swedish Publication Index
BookMark eNp9kctuFDEQRb0IEkngB1j5B3rwu23EBkU8IkWaiRoWrEoed3niUb-wu4PC19OTQexgVVVX955F3StyMYwDEvKGsw1n3Lw9bo5TGzaCCb0Kmkt1QS5ZrWRVS65fkqtSjoxxrmt7Sb7f77a7pnlH7xfMT9QPLd357LsOu1R6up3m1Kdf2NJm8gGrxj-m4UDjmGlMQ_u8Z_yx4DDTMmf0PcUO-_Usr8iL6LuCr__Ma_Lt08evN1-qu-3n25sPd1VQnM9VW1uLkjETa6Wj1Toax6JRMkYlpa5RSW72XIq9CIwFy61z0hnnEF10zMpr0py55SdOyx6mnHqfn2D0CTIW9Dk8QHjwXY-5QEHQpg7a1QoktgaUEHuwJnJA6Y3QVmph3Eqt_kk9LBOs0mE50Va7ZHz1i7M_5LGUjPFvgjM41QJHONUCp1rgXMsaen8O4fqfx4QZSkg4BGxTxjBDO6b_xX8DlfmY7w
Cites_doi 10.1109/JSSC.2005.864128
10.1016/S0304-3975(03)00400-6
10.14778/2367502.2367508
10.1145/3584699
10.1002/cpe.3182
10.1016/0167-6423(82)90012-0
10.1109/TNET.2021.3132385
10.1016/j.ins.2015.09.003
10.14778/1687627.1687653
10.1145/762471.762473
10.1145/3516431.3516433
10.1016/j.jalgor.2003.12.001
10.1145/1166074.1166084
10.14778/1454159.1454225
10.1016/j.datak.2008.11.001
10.1145/859716.859719
10.14778/3514061.3514068
10.1145/6617.6621
ContentType Journal Article
Copyright 2025 The Author(s)
Copyright_xml – notice: 2025 The Author(s)
DBID 6I.
AAFTH
AAYXX
CITATION
ADTPV
AOWAS
F1U
F1S
DOI 10.1016/j.jpdc.2025.105134
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
SwePub
SwePub Articles
SWEPUB Göteborgs universitet
SWEPUB Chalmers tekniska högskola
DatabaseTitle CrossRef
DatabaseTitleList


DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
ExternalDocumentID oai_research_chalmers_se_567c5974_3ed6_422b_86f1_e3a625835269
oai_gup_ub_gu_se_352301
10_1016_j_jpdc_2025_105134
S0743731525001017
GroupedDBID --K
--M
-~X
.~1
0R~
1B1
1~.
1~5
29L
4.4
457
4G.
5GY
5VS
6I.
7-5
71M
8P~
9JN
AAEDT
AAEDW
AAFTH
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AATTM
AAXKI
AAXUO
AAYFN
AAYWO
ABBOA
ABDPE
ABEFU
ABFNM
ABFSI
ABJNI
ABMAC
ABWVN
ABXDB
ACDAQ
ACGFS
ACNNM
ACRLP
ACRPL
ACVFH
ACZNC
ADBBV
ADCNI
ADEZE
ADFGL
ADHUB
ADJOM
ADMUD
ADNMO
ADTZH
ADVLN
AEBSH
AECPX
AEIPS
AEKER
AENEX
AEUPX
AFJKZ
AFPUW
AFTJW
AFXIZ
AGCQF
AGHFR
AGQPQ
AGRNS
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIGII
AIIUN
AIKHN
AITUG
AKBMS
AKRWK
AKYEP
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
ANKPU
AOUOD
APXCP
ASPBG
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
BNPGV
CAG
COF
CS3
DM4
DU5
E.L
EBS
EFBJH
EFKBS
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
H~9
IHE
J1W
JJJVA
K-O
KOM
LG5
LG9
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SET
SEW
SPC
SPCBC
SST
SSV
SSZ
T5K
TN5
TWZ
WUQ
XJT
XOL
XPP
ZMT
ZU3
ZY4
~G-
AAYXX
CITATION
ADTPV
AOWAS
F1U
F1S
ID FETCH-LOGICAL-c411t-d788e3006f745f855f690f643ff43357e4316b132b2c00c8189939699ee9f9083
IEDL.DBID .~1
ISSN 0743-7315
1096-0848
IngestDate Thu Aug 21 06:52:28 EDT 2025
Wed Aug 27 03:57:15 EDT 2025
Tue Aug 05 12:11:47 EDT 2025
Sat Aug 09 17:30:59 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License This is an open access article under the CC BY license.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c411t-d788e3006f745f855f690f643ff43357e4316b132b2c00c8189939699ee9f9083
ORCID 0000-0001-9094-8871
0000-0002-6845-9163
0009-0001-6661-5783
OpenAccessLink https://www.sciencedirect.com/science/article/pii/S0743731525001017
ParticipantIDs swepub_primary_oai_research_chalmers_se_567c5974_3ed6_422b_86f1_e3a625835269
swepub_primary_oai_gup_ub_gu_se_352301
crossref_primary_10_1016_j_jpdc_2025_105134
elsevier_sciencedirect_doi_10_1016_j_jpdc_2025_105134
PublicationCentury 2000
PublicationDate 2025-10-01
PublicationDateYYYYMMDD 2025-10-01
PublicationDate_xml – month: 10
  year: 2025
  text: 2025-10-01
  day: 01
PublicationDecade 2020
PublicationTitle Journal of parallel and distributed computing
PublicationYear 2025
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References Hernandez, Xiao, Tudor (br0240) 2020
Ngo, Papatriantafilou (br0470) 2025; 18
Gong, Yang, Zhang, Li, Uhlig, Chen, Uden, Li (br0210) 2018
Estan, Varghese (br0190) 2003; 21
Demaine, López-Ortiz, Munro (br0180) 2002; vol. 2461
Cormode (br0120) 2022; 50
Ben-Basat, Einziger, Friedman, Kassner (br0060) 2016; 35
Cafaro, Pulimeno, Tempesta (br0090) 2016; 329
Rinberg, Keidar (br0370) 2020; 179
Anderson, Bevan, Lang, Liberty, Rhodes, Thaler (br0040) 2017
Afek, Bremler-Barr, Cohen, Feibish, Shagam (br0020) 2016
Manku, Motwani (br0310) 2012; 5
Misra, Gries (br0340) 1982; 2
Charikar, Chen, Farach-Colton (br0100) 2004; 312
Zhang, Sun, Zhang, Xu, Wu (br0440) 2014; 26
Al-Fares, Radhakrishnan, Raghavan, Huang, Vahdat (br0030) 2010; vol. 7(1)
Stich, Cordonnier, Jaggi (br0420) 2018; 31
Brualdi (br0080) 2010
Manerikar, Palpanas (br0300) 2009; 68
Rinberg, Keidar (br0380) 2023; 70
Stylianopoulos, Walulya, Almgren, Landsiedel, Papatriantafilou (br0430) 2020; 15
Hsu, Indyk, Katabi, Vakilian (br0250) 2019
Pagiamtzis, Sheikholeslami (br0360) 2006; 41
May, Duh, Van Durme, Lall (br0320) 2017
Github (br0260) 2025
Nikolakopoulos, Gidenstam, Papatriantafilou, Tsigas (br0350) 2015
Cormode (br0110) 2018
Cormode, Hadjieleftheriou (br0140) 2008; 1
Ben Basat, Einziger, Keslassy, Orda, Vargaftik, Waisbard (br0070) 2022; 30
Henzinger, Kirsch, Payer, Sezgin, Sokolova (br0230) jan 2013; 48
Zhao, Agrawal, El Abbadi, Metwally (br0450) 2022; 15
Cormode, Muthukrishnan (br0150) 2005; 55
Mandal, Jiang, Shrivastava, Sarkar (br0290) 2018; 32
Karp, Shenker, Papadimitriou (br0270) 2003; 28
Atkinson, Sack, Santoro, Strothotte (br0050) 1986; 29
Harrison, Cai, Gupta, Rexford (br0220) 2018
Das, Antony, Agrawal, El Abbadi (br0170) 2009; 2
Metwally, Agrawal, El Abbadi (br0330) 2006; 31
(br0390) 2003
Kumar, Sung, Xu, Wang (br0280) 2004
Roy, Khan, Alonso (br0400) 2016
Das, Antony, Agrawal, El Abbadi (br0160) 2009; 25
br0010
Garofalakis, Gehrke, Rastogi (br0200) 2016
Zipf (br0460) 1949
Cormode, Hadjieleftheriou (br0130) 2005
Sivaraman, Narayana, Rottenstreich, Muthukrishnan, Rexford (br0410) 2017
Zipf (10.1016/j.jpdc.2025.105134_br0460) 1949
Das (10.1016/j.jpdc.2025.105134_br0170) 2009; 2
Cormode (10.1016/j.jpdc.2025.105134_br0150) 2005; 55
Zhang (10.1016/j.jpdc.2025.105134_br0440) 2014; 26
Al-Fares (10.1016/j.jpdc.2025.105134_br0030) 2010; vol. 7(1)
Cormode (10.1016/j.jpdc.2025.105134_br0140) 2008; 1
Cormode (10.1016/j.jpdc.2025.105134_br0130)
Demaine (10.1016/j.jpdc.2025.105134_br0180) 2002; vol. 2461
Estan (10.1016/j.jpdc.2025.105134_br0190) 2003; 21
Hernandez (10.1016/j.jpdc.2025.105134_br0240) 2020
Rinberg (10.1016/j.jpdc.2025.105134_br0380) 2023; 70
Roy (10.1016/j.jpdc.2025.105134_br0400) 2016
Cormode (10.1016/j.jpdc.2025.105134_br0110) 2018
Garofalakis (10.1016/j.jpdc.2025.105134_br0200) 2016
Ngo (10.1016/j.jpdc.2025.105134_br0470) 2025; 18
Github (10.1016/j.jpdc.2025.105134_br0260)
(10.1016/j.jpdc.2025.105134_br0390) 2003
Ben-Basat (10.1016/j.jpdc.2025.105134_br0060) 2016; 35
Misra (10.1016/j.jpdc.2025.105134_br0340) 1982; 2
Hsu (10.1016/j.jpdc.2025.105134_br0250) 2019
Zhao (10.1016/j.jpdc.2025.105134_br0450) 2022; 15
Rinberg (10.1016/j.jpdc.2025.105134_br0370) 2020; 179
Sivaraman (10.1016/j.jpdc.2025.105134_br0410) 2017
Henzinger (10.1016/j.jpdc.2025.105134_br0230) 2013; 48
Stylianopoulos (10.1016/j.jpdc.2025.105134_br0430) 2020; 15
Das (10.1016/j.jpdc.2025.105134_br0160) 2009; 25
Harrison (10.1016/j.jpdc.2025.105134_br0220) 2018
Metwally (10.1016/j.jpdc.2025.105134_br0330) 2006; 31
Nikolakopoulos (10.1016/j.jpdc.2025.105134_br0350) 2015
Mandal (10.1016/j.jpdc.2025.105134_br0290) 2018; 32
Stich (10.1016/j.jpdc.2025.105134_br0420) 2018; 31
Manku (10.1016/j.jpdc.2025.105134_br0310) 2012; 5
Ben Basat (10.1016/j.jpdc.2025.105134_br0070) 2022; 30
Afek (10.1016/j.jpdc.2025.105134_br0020)
Pagiamtzis (10.1016/j.jpdc.2025.105134_br0360) 2006; 41
Atkinson (10.1016/j.jpdc.2025.105134_br0050) 1986; 29
Kumar (10.1016/j.jpdc.2025.105134_br0280) 2004
Charikar (10.1016/j.jpdc.2025.105134_br0100) 2004; 312
May (10.1016/j.jpdc.2025.105134_br0320) 2017
Cormode (10.1016/j.jpdc.2025.105134_br0120) 2022; 50
Gong (10.1016/j.jpdc.2025.105134_br0210) 2018
Anderson (10.1016/j.jpdc.2025.105134_br0040) 2017
Brualdi (10.1016/j.jpdc.2025.105134_br0080) 2010
Manerikar (10.1016/j.jpdc.2025.105134_br0300) 2009; 68
Cafaro (10.1016/j.jpdc.2025.105134_br0090) 2016; 329
Karp (10.1016/j.jpdc.2025.105134_br0270) 2003; 28
References_xml – start-page: 177
  year: 2004
  end-page: 188
  ident: br0280
  article-title: Data streaming algorithms for efficient and accurate estimation of flow size distribution
  publication-title: Proceedings of the Joint ACM International Conference on Measurement and Modeling of Computer Systems
– start-page: 909
  year: 2018
  end-page: 921
  ident: br0210
  article-title: HeavyKeeper: an accurate algorithm for finding top-k elephant flows
  publication-title: USENIX Annual Technical Conference
– volume: 35
  start-page: 1
  year: 2016
  end-page: 9
  ident: br0060
  article-title: Heavy Hitters in streams and sliding windows
  publication-title: IEEE Int. Conf. Comput. Commun.
– volume: 2
  start-page: 143
  year: 1982
  end-page: 152
  ident: br0340
  article-title: Finding repeated elements
  publication-title: Sci. Comput. Program.
– volume: 18
  year: 2025
  ident: br0470
  article-title: Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing
  publication-title: Proc. VLDB Endow.
– year: 2017
  ident: br0320
  article-title: Streaming word embeddings with the space-saving algorithm
– volume: 329
  start-page: 1
  year: 2016
  end-page: 19
  ident: br0090
  article-title: A parallel space saving algorithm for frequent items and the Hurwitz zeta distribution
  publication-title: Inf. Sci.
– start-page: 167
  year: 2018
  end-page: 168
  ident: br0110
  article-title: Data summarization and distributed computation
  publication-title: Proceedings of the ACM Symposium on Principles of Distributed Computing
– start-page: 1
  year: 2018
  end-page: 7
  ident: br0220
  article-title: Network-wide heavy hitter detection with commodity switches
  publication-title: Proceedings of the Symposium on SDN Research
– volume: 32
  start-page: 10921
  year: 2018
  end-page: 10931
  ident: br0290
  article-title: Topkapi: parallel and fast sketches for finding top-k frequent elements
  publication-title: Proc. Int. Conf. Neural Inf. Process. Syst.
– start-page: 164
  year: 2017
  end-page: 176
  ident: br0410
  article-title: Heavy-hitter detection entirely in the data plane
  publication-title: Proceedings of the ACM Symposium on SDN Research
– volume: 179
  start-page: 2:1
  year: 2020
  end-page: 2:17
  ident: br0370
  article-title: Intermediate value linearizability: a quantitative correctness criterion
  publication-title: International Symposium on Distributed Computing
– volume: vol. 2461
  start-page: 348
  year: 2002
  end-page: 360
  ident: br0180
  article-title: Frequency Estimation of Internet Packet Streams with Limited Space
  publication-title: Algorithms — ESA 2002
– volume: 5
  start-page: 346
  year: 2012
  end-page: 357
  ident: br0310
  article-title: Approximate frequency counts over data streams
  publication-title: Proc. VLDB Endow.
– volume: 25
  start-page: 1323
  year: 2009
  end-page: 1326
  ident: br0160
  article-title: CoTS: a scalable framework for parallelizing frequency counting over data streams
  publication-title: IEEE Int. Conf. Data Eng.
– volume: 1
  start-page: 1530
  year: 2008
  end-page: 1541
  ident: br0140
  article-title: Finding frequent items in data streams
  publication-title: VLDB Endow.
– volume: 15
  start-page: 1215
  year: 2022
  end-page: 1227
  ident: br0450
  article-title: Spacesaving±: an optimal algorithm for frequency estimation and frequent items in the bounded-deletion model
  publication-title: Proc. VLDB Endow.
– volume: 68
  start-page: 415
  year: 2009
  end-page: 430
  ident: br0300
  article-title: Frequent items in streaming data: an experimental evaluation of the state-of-the-art
  publication-title: Data Knowl. Eng.
– volume: 70
  year: 2023
  ident: br0380
  article-title: Intermediate value linearizability: a quantitative correctness criterion
  publication-title: J. ACM
– ident: br0010
  article-title: Anonymized Internet traces 2019
– year: 2025
  ident: br0260
– volume: 2
  start-page: 217
  year: 2009
  end-page: 228
  ident: br0170
  article-title: Thread cooperation in multicore architectures for frequency counting over multiple data streams
  publication-title: Proc. VLDB Endow.
– volume: 48
  start-page: 317
  year: jan 2013
  end-page: 328
  ident: br0230
  article-title: Quantitative relaxation of concurrent data structures
– volume: 30
  start-page: 1440
  year: 2022
  end-page: 1453
  ident: br0070
  article-title: Memento: making sliding windows efficient for heavy Hitters
  publication-title: IEEE/ACM Trans. Netw.
– start-page: 1
  year: 2020
  end-page: 9
  ident: br0240
  article-title: Eraia-enabling intelligence data pipelines for iot-based application systems
  publication-title: IEEE International Conference on Pervasive Computing and Communications
– volume: 29
  start-page: 996
  year: 1986
  end-page: 1000
  ident: br0050
  article-title: Min-max heaps and generalized priority queues
  publication-title: Commun. ACM
– volume: 15
  start-page: 1
  year: 2020
  end-page: 16
  ident: br0430
  article-title: : a parallel design with support for fast and accurate concurrent operations
  publication-title: Eur. Conf. Comput. Syst.
– volume: 55
  start-page: 58
  year: 2005
  end-page: 75
  ident: br0150
  article-title: An improved data stream summary: the count-min sketch and its applications
  publication-title: J. Algorithms
– year: 1949
  ident: br0460
  article-title: Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology
– year: 2010
  ident: br0080
  article-title: Introductory Combinatorics
– volume: 312
  start-page: 3
  year: 2004
  end-page: 15
  ident: br0100
  article-title: Finding frequent items in data streams
  publication-title: Theor. Comput. Sci.
– start-page: 1
  year: 2016
  end-page: 9
  ident: br0200
  article-title: Data Stream Management: A Brave New World
– volume: 21
  start-page: 270
  year: 2003
  end-page: 313
  ident: br0190
  article-title: New directions in traffic measurement and accounting: focusing on the elephants, ignoring the mice
  publication-title: ACM Trans. Comput. Syst.
– volume: 26
  start-page: 2856
  year: 2014
  end-page: 2879
  ident: br0440
  article-title: An efficient framework for parallel and continuous frequent item monitoring
  publication-title: Concurr. Comput., Pract. Exp.
– volume: 28
  start-page: 51
  year: 2003
  end-page: 55
  ident: br0270
  article-title: A simple algorithm for finding frequent elements in streams and bags
  publication-title: ACM Trans. Database Syst.
– year: 2019
  ident: br0250
  article-title: Learning-based frequency estimation algorithms
  publication-title: International Conference on Learning Representations
– volume: 31
  start-page: 4452
  year: 2018
  end-page: 4463
  ident: br0420
  article-title: Sparsified sgd with memory
  publication-title: Adv. Neural Inf. Process. Syst.
– volume: vol. 7(1)
  start-page: 281
  year: 2010
  end-page: 296
  ident: br0030
  article-title: Hedera: dynamic flow scheduling for data center networks
  publication-title: USENIX Conference on Networked Systems Design and Implementation
– year: 2003
  ident: br0390
  publication-title: Unix Systems Programming: Communication, Concurrency, and Threads
– start-page: 268
  year: 2017
  end-page: 282
  ident: br0040
  article-title: A high-performance algorithm for identifying frequent items in data streams
  publication-title: Proceedings of the 2017 Internet Measurement Conference
– volume: 50
  start-page: 6
  year: 2022
  end-page: 15
  ident: br0120
  article-title: Current trends in data summaries
  publication-title: SIGMOD Rec.
– start-page: 239
  year: 2015
  end-page: 248
  ident: br0350
  article-title: A consistency framework for iteration operations in concurrent data structures
  publication-title: IEEE International Parallel and Distributed Processing Symposium
– start-page: 1449
  year: 2016
  end-page: 1463
  ident: br0400
  article-title: Augmented sketch: faster and more accurate stream processing
  publication-title: ACM International Conference on Management of Data
– year: 2005
  ident: br0130
  article-title: Finding frequent items website
– volume: 31
  start-page: 1095
  year: 2006
  end-page: 1133
  ident: br0330
  article-title: An integrated efficient solution for computing frequent and top-
  publication-title: ACM Trans. Database Syst.
– year: 2016
  ident: br0020
  article-title: Efficient distinct heavy Hitters for DNS DDoS attack detection
– volume: 41
  start-page: 712
  year: 2006
  end-page: 727
  ident: br0360
  article-title: Content-Addressable Memory (CAM) circuits and architectures: a tutorial and survey
  publication-title: IEEE J. Solid-State Circuits
– volume: 35
  start-page: 1
  year: 2016
  ident: 10.1016/j.jpdc.2025.105134_br0060
  article-title: Heavy Hitters in streams and sliding windows
  publication-title: IEEE Int. Conf. Comput. Commun.
– volume: 41
  start-page: 712
  issue: 3
  year: 2006
  ident: 10.1016/j.jpdc.2025.105134_br0360
  article-title: Content-Addressable Memory (CAM) circuits and architectures: a tutorial and survey
  publication-title: IEEE J. Solid-State Circuits
  doi: 10.1109/JSSC.2005.864128
– volume: 312
  start-page: 3
  issue: 1
  year: 2004
  ident: 10.1016/j.jpdc.2025.105134_br0100
  article-title: Finding frequent items in data streams
  publication-title: Theor. Comput. Sci.
  doi: 10.1016/S0304-3975(03)00400-6
– start-page: 1
  year: 2018
  ident: 10.1016/j.jpdc.2025.105134_br0220
  article-title: Network-wide heavy hitter detection with commodity switches
– volume: 5
  start-page: 346
  year: 2012
  ident: 10.1016/j.jpdc.2025.105134_br0310
  article-title: Approximate frequency counts over data streams
  publication-title: Proc. VLDB Endow.
  doi: 10.14778/2367502.2367508
– volume: 25
  start-page: 1323
  issue: 1
  year: 2009
  ident: 10.1016/j.jpdc.2025.105134_br0160
  article-title: CoTS: a scalable framework for parallelizing frequency counting over data streams
  publication-title: IEEE Int. Conf. Data Eng.
– start-page: 909
  year: 2018
  ident: 10.1016/j.jpdc.2025.105134_br0210
  article-title: HeavyKeeper: an accurate algorithm for finding top-k elephant flows
– volume: 32
  start-page: 10921
  year: 2018
  ident: 10.1016/j.jpdc.2025.105134_br0290
  article-title: Topkapi: parallel and fast sketches for finding top-k frequent elements
  publication-title: Proc. Int. Conf. Neural Inf. Process. Syst.
– volume: 70
  issue: 2
  year: 2023
  ident: 10.1016/j.jpdc.2025.105134_br0380
  article-title: Intermediate value linearizability: a quantitative correctness criterion
  publication-title: J. ACM
  doi: 10.1145/3584699
– volume: 26
  start-page: 2856
  year: 2014
  ident: 10.1016/j.jpdc.2025.105134_br0440
  article-title: An efficient framework for parallel and continuous frequent item monitoring
  publication-title: Concurr. Comput., Pract. Exp.
  doi: 10.1002/cpe.3182
– volume: vol. 7(1)
  start-page: 281
  year: 2010
  ident: 10.1016/j.jpdc.2025.105134_br0030
  article-title: Hedera: dynamic flow scheduling for data center networks
– year: 2019
  ident: 10.1016/j.jpdc.2025.105134_br0250
  article-title: Learning-based frequency estimation algorithms
– volume: 2
  start-page: 143
  issue: 2
  year: 1982
  ident: 10.1016/j.jpdc.2025.105134_br0340
  article-title: Finding repeated elements
  publication-title: Sci. Comput. Program.
  doi: 10.1016/0167-6423(82)90012-0
– start-page: 1
  year: 2016
  ident: 10.1016/j.jpdc.2025.105134_br0200
– start-page: 1449
  year: 2016
  ident: 10.1016/j.jpdc.2025.105134_br0400
  article-title: Augmented sketch: faster and more accurate stream processing
– volume: 30
  start-page: 1440
  issue: 4
  year: 2022
  ident: 10.1016/j.jpdc.2025.105134_br0070
  article-title: Memento: making sliding windows efficient for heavy Hitters
  publication-title: IEEE/ACM Trans. Netw.
  doi: 10.1109/TNET.2021.3132385
– volume: 31
  start-page: 4452
  year: 2018
  ident: 10.1016/j.jpdc.2025.105134_br0420
  article-title: Sparsified sgd with memory
  publication-title: Adv. Neural Inf. Process. Syst.
– year: 2010
  ident: 10.1016/j.jpdc.2025.105134_br0080
– volume: 329
  start-page: 1
  year: 2016
  ident: 10.1016/j.jpdc.2025.105134_br0090
  article-title: A parallel space saving algorithm for frequent items and the Hurwitz zeta distribution
  publication-title: Inf. Sci.
  doi: 10.1016/j.ins.2015.09.003
– ident: 10.1016/j.jpdc.2025.105134_br0130
– volume: 179
  start-page: 2:1
  year: 2020
  ident: 10.1016/j.jpdc.2025.105134_br0370
  article-title: Intermediate value linearizability: a quantitative correctness criterion
  publication-title: International Symposium on Distributed Computing
– volume: 2
  start-page: 217
  issue: 1
  year: 2009
  ident: 10.1016/j.jpdc.2025.105134_br0170
  article-title: Thread cooperation in multicore architectures for frequency counting over multiple data streams
  publication-title: Proc. VLDB Endow.
  doi: 10.14778/1687627.1687653
– volume: 15
  start-page: 1
  year: 2020
  ident: 10.1016/j.jpdc.2025.105134_br0430
  article-title: Delegation sketch: a parallel design with support for fast and accurate concurrent operations
  publication-title: Eur. Conf. Comput. Syst.
– year: 1949
  ident: 10.1016/j.jpdc.2025.105134_br0460
– volume: 28
  start-page: 51
  issue: 1
  year: 2003
  ident: 10.1016/j.jpdc.2025.105134_br0270
  article-title: A simple algorithm for finding frequent elements in streams and bags
  publication-title: ACM Trans. Database Syst.
  doi: 10.1145/762471.762473
– start-page: 239
  year: 2015
  ident: 10.1016/j.jpdc.2025.105134_br0350
  article-title: A consistency framework for iteration operations in concurrent data structures
– start-page: 164
  year: 2017
  ident: 10.1016/j.jpdc.2025.105134_br0410
  article-title: Heavy-hitter detection entirely in the data plane
– volume: 50
  start-page: 6
  issue: 4
  year: 2022
  ident: 10.1016/j.jpdc.2025.105134_br0120
  article-title: Current trends in data summaries
  publication-title: SIGMOD Rec.
  doi: 10.1145/3516431.3516433
– volume: 55
  start-page: 58
  issue: 1
  year: 2005
  ident: 10.1016/j.jpdc.2025.105134_br0150
  article-title: An improved data stream summary: the count-min sketch and its applications
  publication-title: J. Algorithms
  doi: 10.1016/j.jalgor.2003.12.001
– ident: 10.1016/j.jpdc.2025.105134_br0020
– volume: 31
  start-page: 1095
  issue: 3
  year: 2006
  ident: 10.1016/j.jpdc.2025.105134_br0330
  article-title: An integrated efficient solution for computing frequent and top- k elements in data streams
  publication-title: ACM Trans. Database Syst.
  doi: 10.1145/1166074.1166084
– start-page: 268
  year: 2017
  ident: 10.1016/j.jpdc.2025.105134_br0040
  article-title: A high-performance algorithm for identifying frequent items in data streams
– volume: 1
  start-page: 1530
  issue: 2
  year: 2008
  ident: 10.1016/j.jpdc.2025.105134_br0140
  article-title: Finding frequent items in data streams
  publication-title: VLDB Endow.
  doi: 10.14778/1454159.1454225
– volume: 18
  issue: 9
  year: 2025
  ident: 10.1016/j.jpdc.2025.105134_br0470
  article-title: Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing
  publication-title: Proc. VLDB Endow.
– start-page: 167
  year: 2018
  ident: 10.1016/j.jpdc.2025.105134_br0110
  article-title: Data summarization and distributed computation
– start-page: 177
  year: 2004
  ident: 10.1016/j.jpdc.2025.105134_br0280
  article-title: Data streaming algorithms for efficient and accurate estimation of flow size distribution
– ident: 10.1016/j.jpdc.2025.105134_br0260
– volume: vol. 2461
  start-page: 348
  year: 2002
  ident: 10.1016/j.jpdc.2025.105134_br0180
  article-title: Frequency Estimation of Internet Packet Streams with Limited Space
– volume: 68
  start-page: 415
  issue: 4
  year: 2009
  ident: 10.1016/j.jpdc.2025.105134_br0300
  article-title: Frequent items in streaming data: an experimental evaluation of the state-of-the-art
  publication-title: Data Knowl. Eng.
  doi: 10.1016/j.datak.2008.11.001
– volume: 48
  start-page: 317
  issue: 1
  year: 2013
  ident: 10.1016/j.jpdc.2025.105134_br0230
  article-title: Quantitative relaxation of concurrent data structures
– volume: 21
  start-page: 270
  issue: 3
  year: 2003
  ident: 10.1016/j.jpdc.2025.105134_br0190
  article-title: New directions in traffic measurement and accounting: focusing on the elephants, ignoring the mice
  publication-title: ACM Trans. Comput. Syst.
  doi: 10.1145/859716.859719
– year: 2017
  ident: 10.1016/j.jpdc.2025.105134_br0320
– start-page: 1
  year: 2020
  ident: 10.1016/j.jpdc.2025.105134_br0240
  article-title: Eraia-enabling intelligence data pipelines for iot-based application systems
– year: 2003
  ident: 10.1016/j.jpdc.2025.105134_br0390
– volume: 15
  start-page: 1215
  issue: 6
  year: 2022
  ident: 10.1016/j.jpdc.2025.105134_br0450
  article-title: Spacesaving±: an optimal algorithm for frequency estimation and frequent items in the bounded-deletion model
  publication-title: Proc. VLDB Endow.
  doi: 10.14778/3514061.3514068
– volume: 29
  start-page: 996
  issue: 10
  year: 1986
  ident: 10.1016/j.jpdc.2025.105134_br0050
  article-title: Min-max heaps and generalized priority queues
  publication-title: Commun. ACM
  doi: 10.1145/6617.6621
SSID ssj0011578
Score 2.4245262
Snippet The frequent elements problem, a key component in demanding stream-data analytics, involves selecting elements whose occurrence exceeds a user-specified...
SourceID swepub
crossref
elsevier
SourceType Open Access Repository
Index Database
Publisher
StartPage 105134
SubjectTerms Computer and Information Sciences
Data- och informationsvetenskap (Datateknik)
Title QPOPSS: Query and Parallelism Optimized Space-Saving for finding frequent stream elements
URI https://dx.doi.org/10.1016/j.jpdc.2025.105134
https://gup.ub.gu.se/publication/352301
https://research.chalmers.se/publication/547212
Volume 204
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwELYQvXDpi6JuC8gHxKVKdxM_EveGUNG2vBalSPRk2c4YFnWX1T4O5dDf3pk8UCvRHrgllq1YY3sezsz3MbbnQYBRhU5iyEIiq5wwIDON5yqk0VeFjjUZzOmZHl7Kr1fqao0ddrUwlFbZ6v5Gp9faum3pt9Lsz8bjfknGLxfE31MDpVFFuZQ57fKPvx7SPAhLpuigOKl3WzjT5HjdziqCMcwU0d2mQv7TOP2JIlpbnqOX7HnrMvKDZlav2BpMX7MXHR0Db0_nJvt-MTofleUnfrGC-U_uphUfuTlxpfwYLyb8HJXDZHwPFS8xToakdHSXwNFp5fWPa3qe14nVS04VJG7CocktX7xhl0efvx0Ok5Y5IQkyTZdJhYEtCDxQMZcqFkpFDIIjOh8xSiFUDlQA7zEQ9VkYDAIabXRTjDYGwESDXtkWW5_eTeEt406G3EQ_cK6Q0qmBlwG80WnIK8Lez3rsQycyO2sAMmyXOXZrScCWBGwbAfeY6qRq_1pmixr8v-P2myV4-AbBYl-vZhabrld2AVbQ_XbaYyePdGyBk25suKlZaRY0QOk8UDhlBVTayizzFrdmakE4DArJM9Xm3RPn-55t0FuT9bfN1pfzFeyg97L0u_X23GXPDr4cD89-A8yV7Z8
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELZKOcCFN2J5-gBcUNiNX4mROCCg2tJtu1VaqZyM49jtVt3tarMrVA78Kf4gM4lTgQQckHqLnDixPtvzcGa-IeR56bnXMldJcMwlosqQA5Ip2FcuDWWVq9AUg9neUcMD8elQHq6RH10uDIZVRtnfyvRGWseWfkSzP59M-gUqv4xj_Z6GKC2LkZVb_vwr-G31280PMMkvGNv4uP9-mMTSAokTabpMKvD8PIcVFzIhQy5lAC8xgHYOQXAuM48Z4iV4aiVzg4EDrQZ6XCutvddBg9kC771CrgoQF1g24fX3i7gSJK_JO-5PHF7M1GmDyk7mFfImMon1dVMu_qoNf6UtbVTdxi1yI9qo9F0Lw22y5md3yM2u_gON4uAu-bw33h0XxRu6t_KLc2pnFR3bBRZnOZ3UU7oL0mg6-eYrWoBj7pPC4uEFBSuZNn_K8XrRRHIvKaas2Cn1bTB7fY8cXAqe98n67GzmHxBqhct0KAfW5kJYOSiF86VWqcsqJPtnPfKqg8zMW0YO04WqnRgE2CDApgW4R2SHqvltXRlQGf_s97KdgotvIA_30WpuoOloZWpvOB6opz0y-sODkanp2LjjpgxOjR2kyhz6b4b7ShnBWGlgL6TGcwteKJrCSj_8z_E-I9eG-9sjM9rc2XpEruOdNuTwMVlfLlb-CZhOy_Jps1Qp-XLZe-Mn59AmbA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=QPOPSS%3A+Query+and+Parallelism+Optimized+Space-Saving+for+finding+frequent+stream+elements&rft.jtitle=Journal+of+parallel+and+distributed+computing&rft.au=Jarlow%2C+Victor&rft.au=Stylianopoulos%2C+Charalampos&rft.au=Papatriantafilou%2C+Marina&rft.date=2025-10-01&rft.issn=1096-0848&rft.volume=204&rft_id=info:doi/10.1016%2Fj.jpdc.2025.105134&rft.externalDocID=oai_research_chalmers_se_567c5974_3ed6_422b_86f1_e3a625835269
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0743-7315&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0743-7315&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0743-7315&client=summon