23.9 An 8-channel 4.5Gb 180GB/s 18ns-row-latency RAM for the last level cache
In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory l...
Saved in:
Published in | 2017 IEEE International Solid-State Circuits Conference (ISSCC) pp. 404 - 405 |
---|---|
Main Authors | , , , , , , , , , , , , , , , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.02.2017
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory latency. As shown by the cache feature table in Fig. 23.9.1, there is a big latency gap between SRAM caches in the CPU and the external DRAM main memory. As a key element for future computing systems, the last level cache (LLC) should have a high random access bandwidth, a low random access latency, a density of 1 to 8Gb, and all signal pads located on one side of the chip [1]. A logic-process-based solution was proposed [2], but it is not scalable, and has a high standby current due to its need for frequent refresh. HBM2 was also proposed [3], but its row latency is not better than conventional DRAM, and its random-access bandwidth is still limited by t FAW , as shown in Fig. 23.9.1. This paper describes the high-bandwidth low-latency (HBLL) RAM design: how it overcomes these challenges and meets requirements in a cost-effective way. |
---|---|
AbstractList | In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory latency. As shown by the cache feature table in Fig. 23.9.1, there is a big latency gap between SRAM caches in the CPU and the external DRAM main memory. As a key element for future computing systems, the last level cache (LLC) should have a high random access bandwidth, a low random access latency, a density of 1 to 8Gb, and all signal pads located on one side of the chip [1]. A logic-process-based solution was proposed [2], but it is not scalable, and has a high standby current due to its need for frequent refresh. HBM2 was also proposed [3], but its row latency is not better than conventional DRAM, and its random-access bandwidth is still limited by t FAW , as shown in Fig. 23.9.1. This paper describes the high-bandwidth low-latency (HBLL) RAM design: how it overcomes these challenges and meets requirements in a cost-effective way. |
Author | Chun-Kai Wang Li-Chin Tien Shih-Lien Lu Ding-Ming Kwai Tomishima, Shigeki Der-Min Yuan Chien-Chih Huang Gyh-Bin Wang Chi-Kang Chen Yung-Fa Chou Jenn-Shiang Lai Yung-Ching Hsieh Chun-Wei Lo Ting, Tah-Kang Joseph Wei Wu Stolt, Pat Zhe Wang Chun-Peng Wu Wen-Pin Hsu Ming-Hung Wang |
Author_xml | – sequence: 1 givenname: Tah-Kang Joseph surname: Ting fullname: Ting, Tah-Kang Joseph organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 2 surname: Gyh-Bin Wang fullname: Gyh-Bin Wang organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 3 surname: Ming-Hung Wang fullname: Ming-Hung Wang organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 4 surname: Chun-Peng Wu fullname: Chun-Peng Wu organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 5 surname: Chun-Kai Wang fullname: Chun-Kai Wang organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 6 surname: Chun-Wei Lo fullname: Chun-Wei Lo organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 7 surname: Li-Chin Tien fullname: Li-Chin Tien organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 8 surname: Der-Min Yuan fullname: Der-Min Yuan organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 9 surname: Yung-Ching Hsieh fullname: Yung-Ching Hsieh organization: Piecemakers Technol., Hsinchu, Taiwan – sequence: 10 surname: Jenn-Shiang Lai fullname: Jenn-Shiang Lai organization: ITRI, Hsinchu, Taiwan – sequence: 11 surname: Wen-Pin Hsu fullname: Wen-Pin Hsu organization: ITRI, Hsinchu, Taiwan – sequence: 12 surname: Chien-Chih Huang fullname: Chien-Chih Huang organization: ITRI, Hsinchu, Taiwan – sequence: 13 surname: Chi-Kang Chen fullname: Chi-Kang Chen organization: ITRI, Hsinchu, Taiwan – sequence: 14 surname: Yung-Fa Chou fullname: Yung-Fa Chou organization: ITRI, Hsinchu, Taiwan – sequence: 15 surname: Ding-Ming Kwai fullname: Ding-Ming Kwai organization: ITRI, Hsinchu, Taiwan – sequence: 16 surname: Zhe Wang fullname: Zhe Wang organization: Intel, Hilsboro, OR, USA – sequence: 17 surname: Wei Wu fullname: Wei Wu organization: Intel, Hilsboro, OR, USA – sequence: 18 givenname: Shigeki surname: Tomishima fullname: Tomishima, Shigeki organization: Intel, Hilsboro, OR, USA – sequence: 19 givenname: Pat surname: Stolt fullname: Stolt, Pat organization: Intel, Hilsboro, OR, USA – sequence: 20 surname: Shih-Lien Lu fullname: Shih-Lien Lu organization: TSMC, Hsinchu, Taiwan |
BookMark | eNp9jrtuAjEQADdRkMLrB0izP2CztrmzXZJTQihogB45p0VHZHzR-ZSIv4eCmmqK0UgzgpfUJgaYKZJKkZ-vd7uqkpqUldZZWhj9BFNvnSrIk7GF088w1MaWwpVUvsIo5x8iKnzphrDRRnpcJnSibkJKHHEhi9U3Kker93m-MWXRtf8ihp5TfcHtcoPHtsO-YYwh9xj571bVoW54AoNjiJmnd47h7fNjX32JEzMffrvTOXSXw_3SPLZXP-U9CA |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/ISSCC.2017.7870432 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library Online IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISBN | 9781509037582 1509037586 |
EISSN | 2376-8606 |
EndPage | 405 |
ExternalDocumentID | 7870432 |
Genre | orig-research |
GroupedDBID | 29G 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI JC5 M43 OCL RIE RIG RIL RIO RNS |
ID | FETCH-ieee_primary_78704323 |
IEDL.DBID | RIE |
IngestDate | Wed Jun 26 19:24:10 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-ieee_primary_78704323 |
ParticipantIDs | ieee_primary_7870432 |
PublicationCentury | 2000 |
PublicationDate | 2017-Feb. |
PublicationDateYYYYMMDD | 2017-02-01 |
PublicationDate_xml | – month: 02 year: 2017 text: 2017-Feb. |
PublicationDecade | 2010 |
PublicationTitle | 2017 IEEE International Solid-State Circuits Conference (ISSCC) |
PublicationTitleAbbrev | ISSCC |
PublicationYear | 2017 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0005968 |
Score | 4.05307 |
Snippet | In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 404 |
SubjectTerms | Arrays Bandwidth Decoding Random access memory Semiconductor device measurement Timing |
Title | 23.9 An 8-channel 4.5Gb 180GB/s 18ns-row-latency RAM for the last level cache |
URI | https://ieeexplore.ieee.org/document/7870432 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8MgFH-ZO-nFj82oU8PBo6Uf0K4cZ-M2TWqM02S3pRR2cWGLaw_618tr96FmB08QEuAFCO8H_H4PgJuIWwwqJSp6FN5WBcqxR7DMrmVfaaHCzK_0FelTNHzjj-Nw3IDbjRZGa12RzzTFbPWWr-Z5iVdlLi4uzuyGu9cVotZqbekcIorXohhPuA-jUZIgc6tLV7V-fZ9SeY_-IaTrfmvSyDstC0nzrz8hGf9r2BG0tzo98rzxQMfQ0OYEDn6EGGxBGjAqSM-Q2EGNr9Ezwmk4kMSPvcGdu7SpWTr2LO7MMoTPn-SllxKLZIlFhsRC64LMkFdEcoz83IZO__41GTpo3GRRR6qYrOxip9A0c6PPgAguFQu8KRfWL4dMST9U2dTnMs4Ykyw-h9auFi52F3dgH4e4ZjFfQrP4KPWVddKFvK5m5xueK5Ht |
link.rule.ids | 310,311,783,787,792,793,799,27937,55086 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8MgGH6zzIN68WMz6vzg4NHSD2htj7Nx63RdjJvJbk0p7GLDjGsP-uuFdh9qdvAEIQHeAOF9gOd5AbjxqMKgjGlFD9e3VQ431BEsVWvZ5iLgbmpX-op45EWv9HHqThtwu9bCCCEq8pnAOlu95fN5VuqrMlMvLkrUhrujcLXv1WqtDaEj8PyVLMYKzMF4HIaau3WHl_V-faBS-Y_eAcSrnmvayBsuC4azrz9BGf9r2iG0N0o99Lz2QUfQEPIY9n8EGWxB7BAcoK5EvqFVvlLkiGK3z5DtW_17c6FSuTDUadzIUw2gP9FLN0YKyyKFDZEC1wXKNbMIZTr2cxs6vYdJGBnauOS9jlWRLO0iJ9CUcylOAQWUceJYMxooz-wSzmyXpzObMj8lhBH_DFrbWjjfXnwNu9EkHibDweipA3t6uGtO8wU0i49SXCqXXbCraqa-AXMglTg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+IEEE+International+Solid-State+Circuits+Conference+%28ISSCC%29&rft.atitle=23.9+An+8-channel+4.5Gb+180GB%2Fs+18ns-row-latency+RAM+for+the+last+level+cache&rft.au=Ting%2C+Tah-Kang+Joseph&rft.au=Gyh-Bin+Wang&rft.au=Ming-Hung+Wang&rft.au=Chun-Peng+Wu&rft.date=2017-02-01&rft.pub=IEEE&rft.eissn=2376-8606&rft.spage=404&rft.epage=405&rft_id=info:doi/10.1109%2FISSCC.2017.7870432&rft.externalDocID=7870432 |