23.9 An 8-channel 4.5Gb 180GB/s 18ns-row-latency RAM for the last level cache

In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory l...

Full description

Saved in:
Bibliographic Details
Published in2017 IEEE International Solid-State Circuits Conference (ISSCC) pp. 404 - 405
Main Authors Ting, Tah-Kang Joseph, Gyh-Bin Wang, Ming-Hung Wang, Chun-Peng Wu, Chun-Kai Wang, Chun-Wei Lo, Li-Chin Tien, Der-Min Yuan, Yung-Ching Hsieh, Jenn-Shiang Lai, Wen-Pin Hsu, Chien-Chih Huang, Chi-Kang Chen, Yung-Fa Chou, Ding-Ming Kwai, Zhe Wang, Wei Wu, Tomishima, Shigeki, Stolt, Pat, Shih-Lien Lu
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.02.2017
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory latency. As shown by the cache feature table in Fig. 23.9.1, there is a big latency gap between SRAM caches in the CPU and the external DRAM main memory. As a key element for future computing systems, the last level cache (LLC) should have a high random access bandwidth, a low random access latency, a density of 1 to 8Gb, and all signal pads located on one side of the chip [1]. A logic-process-based solution was proposed [2], but it is not scalable, and has a high standby current due to its need for frequent refresh. HBM2 was also proposed [3], but its row latency is not better than conventional DRAM, and its random-access bandwidth is still limited by t FAW , as shown in Fig. 23.9.1. This paper describes the high-bandwidth low-latency (HBLL) RAM design: how it overcomes these challenges and meets requirements in a cost-effective way.
AbstractList In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory latency. As shown by the cache feature table in Fig. 23.9.1, there is a big latency gap between SRAM caches in the CPU and the external DRAM main memory. As a key element for future computing systems, the last level cache (LLC) should have a high random access bandwidth, a low random access latency, a density of 1 to 8Gb, and all signal pads located on one side of the chip [1]. A logic-process-based solution was proposed [2], but it is not scalable, and has a high standby current due to its need for frequent refresh. HBM2 was also proposed [3], but its row latency is not better than conventional DRAM, and its random-access bandwidth is still limited by t FAW , as shown in Fig. 23.9.1. This paper describes the high-bandwidth low-latency (HBLL) RAM design: how it overcomes these challenges and meets requirements in a cost-effective way.
Author Chun-Kai Wang
Li-Chin Tien
Shih-Lien Lu
Ding-Ming Kwai
Tomishima, Shigeki
Der-Min Yuan
Chien-Chih Huang
Gyh-Bin Wang
Chi-Kang Chen
Yung-Fa Chou
Jenn-Shiang Lai
Yung-Ching Hsieh
Chun-Wei Lo
Ting, Tah-Kang Joseph
Wei Wu
Stolt, Pat
Zhe Wang
Chun-Peng Wu
Wen-Pin Hsu
Ming-Hung Wang
Author_xml – sequence: 1
  givenname: Tah-Kang Joseph
  surname: Ting
  fullname: Ting, Tah-Kang Joseph
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 2
  surname: Gyh-Bin Wang
  fullname: Gyh-Bin Wang
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 3
  surname: Ming-Hung Wang
  fullname: Ming-Hung Wang
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 4
  surname: Chun-Peng Wu
  fullname: Chun-Peng Wu
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 5
  surname: Chun-Kai Wang
  fullname: Chun-Kai Wang
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 6
  surname: Chun-Wei Lo
  fullname: Chun-Wei Lo
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 7
  surname: Li-Chin Tien
  fullname: Li-Chin Tien
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 8
  surname: Der-Min Yuan
  fullname: Der-Min Yuan
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 9
  surname: Yung-Ching Hsieh
  fullname: Yung-Ching Hsieh
  organization: Piecemakers Technol., Hsinchu, Taiwan
– sequence: 10
  surname: Jenn-Shiang Lai
  fullname: Jenn-Shiang Lai
  organization: ITRI, Hsinchu, Taiwan
– sequence: 11
  surname: Wen-Pin Hsu
  fullname: Wen-Pin Hsu
  organization: ITRI, Hsinchu, Taiwan
– sequence: 12
  surname: Chien-Chih Huang
  fullname: Chien-Chih Huang
  organization: ITRI, Hsinchu, Taiwan
– sequence: 13
  surname: Chi-Kang Chen
  fullname: Chi-Kang Chen
  organization: ITRI, Hsinchu, Taiwan
– sequence: 14
  surname: Yung-Fa Chou
  fullname: Yung-Fa Chou
  organization: ITRI, Hsinchu, Taiwan
– sequence: 15
  surname: Ding-Ming Kwai
  fullname: Ding-Ming Kwai
  organization: ITRI, Hsinchu, Taiwan
– sequence: 16
  surname: Zhe Wang
  fullname: Zhe Wang
  organization: Intel, Hilsboro, OR, USA
– sequence: 17
  surname: Wei Wu
  fullname: Wei Wu
  organization: Intel, Hilsboro, OR, USA
– sequence: 18
  givenname: Shigeki
  surname: Tomishima
  fullname: Tomishima, Shigeki
  organization: Intel, Hilsboro, OR, USA
– sequence: 19
  givenname: Pat
  surname: Stolt
  fullname: Stolt, Pat
  organization: Intel, Hilsboro, OR, USA
– sequence: 20
  surname: Shih-Lien Lu
  fullname: Shih-Lien Lu
  organization: TSMC, Hsinchu, Taiwan
BookMark eNp9jrtuAjEQADdRkMLrB0izP2CztrmzXZJTQihogB45p0VHZHzR-ZSIv4eCmmqK0UgzgpfUJgaYKZJKkZ-vd7uqkpqUldZZWhj9BFNvnSrIk7GF088w1MaWwpVUvsIo5x8iKnzphrDRRnpcJnSibkJKHHEhi9U3Kker93m-MWXRtf8ihp5TfcHtcoPHtsO-YYwh9xj571bVoW54AoNjiJmnd47h7fNjX32JEzMffrvTOXSXw_3SPLZXP-U9CA
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/ISSCC.2017.7870432
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781509037582
1509037586
EISSN 2376-8606
EndPage 405
ExternalDocumentID 7870432
Genre orig-research
GroupedDBID 29G
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
JC5
M43
OCL
RIE
RIG
RIL
RIO
RNS
ID FETCH-ieee_primary_78704323
IEDL.DBID RIE
IngestDate Wed Jun 26 19:24:10 EDT 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-ieee_primary_78704323
ParticipantIDs ieee_primary_7870432
PublicationCentury 2000
PublicationDate 2017-Feb.
PublicationDateYYYYMMDD 2017-02-01
PublicationDate_xml – month: 02
  year: 2017
  text: 2017-Feb.
PublicationDecade 2010
PublicationTitle 2017 IEEE International Solid-State Circuits Conference (ISSCC)
PublicationTitleAbbrev ISSCC
PublicationYear 2017
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0005968
Score 4.05307
Snippet In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of...
SourceID ieee
SourceType Publisher
StartPage 404
SubjectTerms Arrays
Bandwidth
Decoding
Random access memory
Semiconductor device measurement
Timing
Title 23.9 An 8-channel 4.5Gb 180GB/s 18ns-row-latency RAM for the last level cache
URI https://ieeexplore.ieee.org/document/7870432
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8MgFH-ZO-nFj82oU8PBo6Uf0K4cZ-M2TWqM02S3pRR2cWGLaw_618tr96FmB08QEuAFCO8H_H4PgJuIWwwqJSp6FN5WBcqxR7DMrmVfaaHCzK_0FelTNHzjj-Nw3IDbjRZGa12RzzTFbPWWr-Z5iVdlLi4uzuyGu9cVotZqbekcIorXohhPuA-jUZIgc6tLV7V-fZ9SeY_-IaTrfmvSyDstC0nzrz8hGf9r2BG0tzo98rzxQMfQ0OYEDn6EGGxBGjAqSM-Q2EGNr9Ezwmk4kMSPvcGdu7SpWTr2LO7MMoTPn-SllxKLZIlFhsRC64LMkFdEcoz83IZO__41GTpo3GRRR6qYrOxip9A0c6PPgAguFQu8KRfWL4dMST9U2dTnMs4Ykyw-h9auFi52F3dgH4e4ZjFfQrP4KPWVddKFvK5m5xueK5Ht
link.rule.ids 310,311,783,787,792,793,799,27937,55086
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8MgGH6zzIN68WMz6vzg4NHSD2htj7Nx63RdjJvJbk0p7GLDjGsP-uuFdh9qdvAEIQHeAOF9gOd5AbjxqMKgjGlFD9e3VQ431BEsVWvZ5iLgbmpX-op45EWv9HHqThtwu9bCCCEq8pnAOlu95fN5VuqrMlMvLkrUhrujcLXv1WqtDaEj8PyVLMYKzMF4HIaau3WHl_V-faBS-Y_eAcSrnmvayBsuC4azrz9BGf9r2iG0N0o99Lz2QUfQEPIY9n8EGWxB7BAcoK5EvqFVvlLkiGK3z5DtW_17c6FSuTDUadzIUw2gP9FLN0YKyyKFDZEC1wXKNbMIZTr2cxs6vYdJGBnauOS9jlWRLO0iJ9CUcylOAQWUceJYMxooz-wSzmyXpzObMj8lhBH_DFrbWjjfXnwNu9EkHibDweipA3t6uGtO8wU0i49SXCqXXbCraqa-AXMglTg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+IEEE+International+Solid-State+Circuits+Conference+%28ISSCC%29&rft.atitle=23.9+An+8-channel+4.5Gb+180GB%2Fs+18ns-row-latency+RAM+for+the+last+level+cache&rft.au=Ting%2C+Tah-Kang+Joseph&rft.au=Gyh-Bin+Wang&rft.au=Ming-Hung+Wang&rft.au=Chun-Peng+Wu&rft.date=2017-02-01&rft.pub=IEEE&rft.eissn=2376-8606&rft.spage=404&rft.epage=405&rft_id=info:doi/10.1109%2FISSCC.2017.7870432&rft.externalDocID=7870432