CROB: Implementing a Large Instruction Window through Compression

Current processors require a large number of in-flight instructions in order to look for further parallelism and hide the increasing gap between memory latency and processor cycle time. These in-flight instructions are typically stored in centralized structures called reorder buffer (ROB), which is...

Full description

Saved in:
Bibliographic Details
Published inTransactions on High-Performance Embedded Architectures and Compilers III pp. 115 - 134
Main Authors Latorre, Fernando, Magklis, Grigorios, González, Jose, Chaparro, Pedro, González, Antonio
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2011
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3642194478
9783642194474
ISSN0302-9743
1611-3349
DOI10.1007/978-3-642-19448-1_7

Cover

Abstract Current processors require a large number of in-flight instructions in order to look for further parallelism and hide the increasing gap between memory latency and processor cycle time. These in-flight instructions are typically stored in centralized structures called reorder buffer (ROB), which is a centerpiece to handle precise exceptions and recover a safe state in the event of a branch misprediction. However, this structure is becoming so big that it is difficult to fit it in the power budget of future processors designs. In this paper we propose a novel ROB microarchitecture named CROB (Compressed ROB) that can compress ROB entries and therefore give the illusion of having a larger virtual ROB than the number of ROB entries. The performance study of CROB shows a tremendous benefit, with an average speedup of 20% and 12% for a 128-entry and 256-entry ROB respectively. For some benchmark categories such as SpecFP2000, speedup raise up to 30%.
AbstractList Current processors require a large number of in-flight instructions in order to look for further parallelism and hide the increasing gap between memory latency and processor cycle time. These in-flight instructions are typically stored in centralized structures called reorder buffer (ROB), which is a centerpiece to handle precise exceptions and recover a safe state in the event of a branch misprediction. However, this structure is becoming so big that it is difficult to fit it in the power budget of future processors designs. In this paper we propose a novel ROB microarchitecture named CROB (Compressed ROB) that can compress ROB entries and therefore give the illusion of having a larger virtual ROB than the number of ROB entries. The performance study of CROB shows a tremendous benefit, with an average speedup of 20% and 12% for a 128-entry and 256-entry ROB respectively. For some benchmark categories such as SpecFP2000, speedup raise up to 30%.
Author González, Antonio
González, Jose
Latorre, Fernando
Chaparro, Pedro
Magklis, Grigorios
Author_xml – sequence: 1
  givenname: Fernando
  surname: Latorre
  fullname: Latorre, Fernando
  email: fernando.latorre@intel.com
  organization: Intel Barcelona Research Center, Intel Labs, UPC, Spain
– sequence: 2
  givenname: Grigorios
  surname: Magklis
  fullname: Magklis, Grigorios
  email: grigorios.magklis@intel.com
  organization: Intel Barcelona Research Center, Intel Labs, UPC, Spain
– sequence: 3
  givenname: Jose
  surname: González
  fullname: González, Jose
  email: pepe.gonzalez@intel.com
  organization: Intel Barcelona Research Center, Intel Labs, UPC, Spain
– sequence: 4
  givenname: Pedro
  surname: Chaparro
  fullname: Chaparro, Pedro
  email: pedro.chaparro.monferrer@intel.com
  organization: Intel Barcelona Research Center, Intel Labs, UPC, Spain
– sequence: 5
  givenname: Antonio
  surname: González
  fullname: González, Antonio
  email: antonio.gonzalez@intel.com
  organization: Intel Barcelona Research Center, Intel Labs, UPC, Spain
BookMark eNo1kNFKAzEQRaNWsK39Al_yA9FMJt0kvtWlaqFQEMXHsLtJ2mqbLZst_r6x6rwMnAvD3DMig9hGT8gN8FvgXN0ZpRmyQgoGRkrNwKozMskUMzshOCdDKAAYojQXZPQfKD0gQ45cMKMkXpFJSh88j1SF0mZIZuXL6uGeLvaHnd_72G_jmlZ0WXVrTxcx9d2x6bdtpO_b6Nov2m-69rje0LLdHzqfUo6uyWWodslP_vaYvD3OX8tntlw9LcrZkiUAoVjB9XSKQkhXaC6C8cpUTeGcMk0N4AQ2PnDHcxUTXK10kC7wSkOoAUODiGMCv3fToctf-s7WbfuZLHD748hmGxZtrm1PQmx2hN-QH1Zm
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2011
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2011
DOI 10.1007/978-3-642-19448-1_7
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783642194481
3642194486
EISSN 1611-3349
Editor Stenström, Per
Editor_xml – sequence: 1
  givenname: Per
  surname: Stenström
  fullname: Stenström, Per
  email: per@ce.chalmers.se
EndPage 134
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-s1127-608553224d6802f9e79ac6dd79cb11d23cef0d04489fdb78f4df0a81fb13fc333
ISBN 3642194478
9783642194474
ISSN 0302-9743
IngestDate Tue Jul 29 20:11:17 EDT 2025
IsPeerReviewed true
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1127-608553224d6802f9e79ac6dd79cb11d23cef0d04489fdb78f4df0a81fb13fc333
PageCount 20
ParticipantIDs springer_books_10_1007_978_3_642_19448_1_7
PublicationCentury 2000
PublicationDate 2011
PublicationDateYYYYMMDD 2011-01-01
PublicationDate_xml – year: 2011
  text: 2011
PublicationDecade 2010
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationTitle Transactions on High-Performance Embedded Architectures and Compilers III
PublicationYear 2011
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0000476789
ssj0002792
Score 1.8365151
Snippet Current processors require a large number of in-flight instructions in order to look for further parallelism and hide the increasing gap between memory latency...
SourceID springer
SourceType Publisher
StartPage 115
SubjectTerms Average Speedup
Circular Buffer
Logical Register
Register File
Segment Size
Title CROB: Implementing a Large Instruction Window through Compression
URI http://link.springer.com/10.1007/978-3-642-19448-1_7
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LbxMxELbScEEcKC_xKMgHTkSL1rHj3SD1EKrQh0JBqEW9rfxYVwjIoiQIqf-t_40Z27trWoRULqtoFcXe-ZzZ8Tcznwl5iUH8mAue6dqWGcS3IlOCsawwtjATrZyzmNF9fywPTsXR2eRsMLhMqpZ-bvRrc_HXvpL_QRXuAa7YJXsDZLsfhRvwGfCFKyAM1yvB7580a_-WCW0JnvLHio3sY9IHMP8OlrEQUM6SZEGQZEYvAO5gtR4dHvbHWOP-O1DSkV1uer76_Gs8OH0ftvPN6kvTxeL7zfLCp9vZt8BGY1IhqRr4gTKPoRTYrpp0ie59-vAWGQmvUOzLlrBhcrTA4nRfxRClbcF1wVx-dWcK4exD-W5YUmjrer27iOmQ42bjq8xG7YkVrQNLGQ7f55cyHC3DOfqHAJhvRhHgfYUIh_60PWHg72HHFFxoHVy8ROFGHoRSo9tmoaU0RgAs0KvXXi5pPQkMluFosAWvii2yVZRiSG7N5keLzx3Fl4tCejH_GBigVmNIaoVJYatRO-kyiEH1D9EpZAUR5CsjXsvb-3DoZJvcwRYZiuCCee-RQb28T-625qbR3A_IDBF-Q1N8qaIeX5rgSwO-NOJLE3wfktN385O9gyye2ZGtIXIvMol1j_CSEFaW-dhN62KqjLS2mBrNmB1zU7vc5vAYU2d1UTphXa5K5jTjznDOH5HhslnWjwktrZNCc2PFRAmupHLSSjtRMlfaiLF-Ql61NqjwX7iuWgluMFjFKzBY5Q1WgcGe3uTLz8jtfh3ukCGYo34OsedGv4gY_wYwgX2H
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Transactions+on+High-Performance+Embedded+Architectures+and+Compilers+III&rft.au=Latorre%2C+Fernando&rft.au=Magklis%2C+Grigorios&rft.au=Gonz%C3%A1lez%2C+Jose&rft.au=Chaparro%2C+Pedro&rft.atitle=CROB%3A+Implementing+a+Large+Instruction+Window+through+Compression&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2011-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642194474&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=115&rft.epage=134&rft_id=info:doi/10.1007%2F978-3-642-19448-1_7
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon