Hybrid Core Acceleration of UWB SIRE Radar Signal Processing
To move High-Performance Computing (HPC) closer to forward operating environments and missions, the Army Research Laboratory is developing approaches using hybrid, asymmetric core computing. By blending capabilities found in Graphics Processing Units (GPUs) and traditional von Neumann multicore Cent...
Saved in:
Published in | IEEE transactions on parallel and distributed systems Vol. 22; no. 1; pp. 46 - 57 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.01.2011
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 1045-9219 1558-2183 |
DOI | 10.1109/TPDS.2010.117 |
Cover
Abstract | To move High-Performance Computing (HPC) closer to forward operating environments and missions, the Army Research Laboratory is developing approaches using hybrid, asymmetric core computing. By blending capabilities found in Graphics Processing Units (GPUs) and traditional von Neumann multicore Central Processing Units (CPUs), approaches are being developed and optimized to provide at or near real-time processing speeds for research project applications. Algorithms are designed to partition work to resources best designed to handle the processing load. The use of commodity resources allows the design to be flexible throughout the life cycle without the costly and time-consuming delays associated with Application-Specific Integrated Circuit (ASIC) development. This paradigm allows for rapid technology transfer to end users. In this paper, we describe a synchronous impulse reconstruction radar imaging algorithm that has been designed for hybrid CPU-GPU processing. We discuss various optimizations such as asynchronous task partitioning between the CPU and GPU as well as data movement reduction. We also discuss analysis and design of the algorithms within the context of two programming models: NVIDIA's CUDA and AMD's ATI Brook+. Finally, we report on the speedup achieved by this approach that allowed us to take a code once restricted to postprocessing and transform it into one that exceeds real-time performance requirements. |
---|---|
AbstractList | To move High-Performance Computing (HPC) closer to forward operating environments and missions, the Army Research Laboratory is developing approaches using hybrid, asymmetric core computing. By blending capabilities found in Graphics Processing Units (GPUs) and traditional von Neumann multicore Central Processing Units (CPUs), approaches are being developed and optimized to provide at or near real-time processing speeds for research project applications. Algorithms are designed to partition work to resources best designed to handle the processing load. The use of commodity resources allows the design to be flexible throughout the life cycle without the costly and time-consuming delays associated with Application-Specific Integrated Circuit (ASIC) development. This paradigm allows for rapid technology transfer to end users. In this paper, we describe a synchronous impulse reconstruction radar imaging algorithm that has been designed for hybrid CPU-GPU processing. We discuss various optimizations such as asynchronous task partitioning between the CPU and GPU as well as data movement reduction. We also discuss analysis and design of the algorithms within the context of two programming models: NVIDIA's CUDA and AMD's ATI Brook+. Finally, we report on the speedup achieved by this approach that allowed us to take a code once restricted to postprocessing and transform it into one that exceeds real-time performance requirements. |
Author | Ross, James A Richie, David A Nguyen, Lam H Song Jun Park Shires, Dale R Henz, Brian J |
Author_xml | – sequence: 1 surname: Song Jun Park fullname: Song Jun Park email: song.jun.park@us.army.mil organization: US Army Res. Lab., Aberdeen Proving Ground, MD, USA – sequence: 2 givenname: James A surname: Ross fullname: Ross, James A email: jross@hpti.com organization: High Performance Technol., Inc., Aberdeen Proving Ground, MD, USA – sequence: 3 givenname: Dale R surname: Shires fullname: Shires, Dale R email: dale.shires@us.army.mil organization: US Army Res. Lab., Aberdeen Proving Ground, MD, USA – sequence: 4 givenname: David A surname: Richie fullname: Richie, David A email: drichie@browndeertechnology.com organization: Brown Deer Technol., Forest Hill, MD, USA – sequence: 5 givenname: Brian J surname: Henz fullname: Henz, Brian J email: brian.j.henz@us.army.mil organization: US Army Res. Lab., Aberdeen Proving Ground, MD, USA – sequence: 6 givenname: Lam H surname: Nguyen fullname: Nguyen, Lam H email: lam.nguyenj@us.army.mil organization: US Army Res. Lab., Aberdeen Proving Ground, MD, USA |
BookMark | eNp1kM1rAjEQxUOxULU99tRLoIee1mY2WbOBXqy1VRAqftDjks1mJbJubLIe_O8btfQg9DTz4PeGN6-DWrWtNUL3QHoARDwvZ2-LXkxOkl-hNiRJGsWQ0lbYCUsiEYO4QR3vN4QASwhro5fxIXemwEPrNB4opSvtZGNsjW2JV1-veDGZj_BcFtLhhVnXssIzZ5X23tTrW3Rdysrru9_ZRav30XI4jqafH5PhYBopCryJCk2lVCkHnvMyJUmf5lqUZc4B4lwmkhGiWb-gTECfkUJSpXLGmeRKUMmEpl30dL67c_Z7r32TbY0PUStZa7v3mQDWjzkFGsjHC3Jj9y6k9hkQSiBkSNNA0TOlnPXe6TJTpjl93ThpqoBmx0KzY6HZsdAgeXBFF66dM1vpDv_yD2feaK3_2IRxzkDQHwfcf1c |
CODEN | ITDSEO |
CitedBy_id | crossref_primary_10_1007_s11390_012_1206_3 crossref_primary_10_1140_epjst_e2011_01398_x crossref_primary_10_1109_TCAD_2023_3340128 crossref_primary_10_1016_j_cageo_2011_08_007 crossref_primary_10_1142_S0218001422580083 crossref_primary_10_1140_epjst_e2012_01642_y crossref_primary_10_1145_2788396 crossref_primary_10_1109_TIM_2011_2152590 |
Cites_doi | 10.1109/DoD.HPCMP.UGC.2008.69 10.1155/2009/727965 10.1109/MCSE.2009.204 10.1117/12.438204 10.1117/12.719688 10.1109/CLUSTR.2009.5289125 10.1117/12.243085 10.1117/12.820480 10.1155/2008/930250 10.1145/1401132.1401148 10.1109/JPROC.2008.917757 10.21236/ada499569 10.1109/IGARSS.2005.1526560 10.1145/1401132.1401150 10.1145/1365490.1365498 10.1145/1365490.1365500 10.1145/1531743.1531766 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011 |
DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3 H8D |
DOI | 10.1109/TPDS.2010.117 |
DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005-present IEEE All-Society Periodicals Package (ASPP) 1998-Present IEEE Xplore CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional ANTE: Abstracts in New Technology & Engineering Engineering Research Database Aerospace Database |
DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional Aerospace Database Engineering Research Database ANTE: Abstracts in New Technology & Engineering |
DatabaseTitleList | Technology Research Database Aerospace Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISSN | 1558-2183 |
EndPage | 57 |
ExternalDocumentID | 2724262911 10_1109_TPDS_2010_117 5477419 |
Genre | orig-research |
GroupedDBID | --Z -~X .DC 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABFSI ABQJQ ABVLG ACGFO ACIWK AENEX AETIX AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 E.L EBS EJD HZ~ H~9 ICLAB IEDLZ IFIPE IFJZH IPLJI JAVBF LAI M43 MS~ O9- OCL P2P PQQKQ RIA RIE RNI RNS RZB TN5 TWZ UHB VH1 AAYOK AAYXX CITATION RIG 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3 H8D |
ID | FETCH-LOGICAL-c317t-de3aac8717b7f80563be9ffb7112ba5a400e46d3491640da3ccb474a7c93a49e3 |
IEDL.DBID | RIE |
ISSN | 1045-9219 |
IngestDate | Thu Sep 04 17:10:32 EDT 2025 Sun Jun 29 16:22:51 EDT 2025 Tue Jul 01 05:22:14 EDT 2025 Thu Apr 24 22:51:50 EDT 2025 Wed Aug 27 02:52:31 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c317t-de3aac8717b7f80563be9ffb7112ba5a400e46d3491640da3ccb474a7c93a49e3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23 |
PQID | 1030171788 |
PQPubID | 85437 |
PageCount | 12 |
ParticipantIDs | proquest_journals_1030171788 ieee_primary_5477419 proquest_miscellaneous_914627313 crossref_citationtrail_10_1109_TPDS_2010_117 crossref_primary_10_1109_TPDS_2010_117 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2011-Jan. 2011-01-00 20110101 |
PublicationDateYYYYMMDD | 2011-01-01 |
PublicationDate_xml | – month: 01 year: 2011 text: 2011-Jan. |
PublicationDecade | 2010 |
PublicationPlace | New York |
PublicationPlace_xml | – name: New York |
PublicationTitle | IEEE transactions on parallel and distributed systems |
PublicationTitleAbbrev | TPDS |
PublicationYear | 2011 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref13 Shen (ref2) 2008 ref15 ref14 ref10 ref21 (ref20) 2009 ref1 ref17 ref16 ref19 ref18 ref8 ref7 ref9 (ref22) 2009 ref4 ref3 ref6 ref5 Nguyen (ref12) (ref11) 2008 |
References_xml | – ident: ref1 doi: 10.1109/DoD.HPCMP.UGC.2008.69 – ident: ref14 doi: 10.1155/2009/727965 – volume-title: ATI Stream Computing User Guide year: 2009 ident: ref20 – volume-title: Nvidia Cuda Compute Unified Device Architecture Programming Guide Version 2.0 year: 2008 ident: ref11 – volume-title: master’s thesis year: 2008 ident: ref2 article-title: Evaluating Impulse C and Multiple Parallelism Partitions for a Low-Cost Reconfigurable Computing System – ident: ref5 doi: 10.1109/MCSE.2009.204 – ident: ref15 doi: 10.1117/12.438204 – ident: ref8 doi: 10.1117/12.719688 – ident: ref9 doi: 10.1109/CLUSTR.2009.5289125 – ident: ref13 doi: 10.1117/12.243085 – ident: ref17 doi: 10.1117/12.820480 – ident: ref10 doi: 10.1155/2008/930250 – ident: ref4 doi: 10.1145/1401132.1401148 – ident: ref6 doi: 10.1109/JPROC.2008.917757 – ident: ref7 doi: 10.21236/ada499569 – ident: ref16 doi: 10.1109/IGARSS.2005.1526560 – volume-title: OpenCL: Parallel Computing for Heterogeneous Devices year: 2009 ident: ref22 – ident: ref3 doi: 10.1145/1401132.1401150 – volume-title: Proc. SPIE Conf. ident: ref12 article-title: Obstacle Avoidance and Concealed Target Detection Using the Army Research Laboratory Ultra-Wideband Synchronous Impulse Reconstruction (UWB SIRE) Forward Imaging Radar – ident: ref18 doi: 10.1145/1365490.1365498 – ident: ref19 doi: 10.1145/1365490.1365500 – ident: ref21 doi: 10.1145/1531743.1531766 |
SSID | ssj0014504 |
Score | 2.0705564 |
Snippet | To move High-Performance Computing (HPC) closer to forward operating environments and missions, the Army Research Laboratory is developing approaches using... |
SourceID | proquest crossref ieee |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 46 |
SubjectTerms | Acceleration Algorithm design and analysis Algorithms Application specific integrated circuits Central Processing Unit Central processing units Computation computers in other systems-military Design Design engineering emerging technologies Graphics Heterogeneous (hybrid) systems High performance computing Mathematical models Military computing Partitioning Partitioning algorithms Radar signal processing Real time Reconstruction Signal processing algorithms signal processing systems Studies |
Title | Hybrid Core Acceleration of UWB SIRE Radar Signal Processing |
URI | https://ieeexplore.ieee.org/document/5477419 https://www.proquest.com/docview/1030171788 https://www.proquest.com/docview/914627313 |
Volume | 22 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8MwDLbGTnBgsIEoDJQD4rRuK0nXRuIyxqaBNIT2ELtVSZoiBNoQ6w7w63H6AvGQkHqo1NRNYrt2EvszwKmgwtN-6Nici47NfKZszjyDRSg9rZjfZsLkDo9uO8MZu5m78xI0ilwYrXUSfKab5jY5yw-Xam22ylouUmEG43MDxSzN1SpODJiblArE1YVrc1TDTzzN1vTuapIGcTlJXbJP-5MUVPnxF05My6ACo7xTaUTJU3Mdy6Z6_4bX-N9e78B25mOSbioUu1DSiypU8voNJFPnKmx9ASOswcXwzWRvkR7SJ12l0B6l0kGWEZndX5LJ9bhPxiIUSOLxwXwhyzLA1_dgNuhPe0M7q61gK_QYYjvUVAiFqyXkSeSjF0Sl5lEkPfS_pHAFqrZmnZAydB9ZOxRUKck8JjzFqWBc030oL5YLfQAk9FzVwQFSqc6ZbEsuFfel4JFynChyhAWNfMYDlQGPm_oXz0GyAGnzwDAoMAwy0OMWnBXNX1LEjb8a1sxkF42yebagnrMzyPRxFZhiag4O1vctIMVj1CRzPCIWerleBRyNBjpzDj38nfARbKYbyuaqQzl-Xetj9EhieZKI4gfQRNwf |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEJ4QPKgHH6ARRd2D8USBulvaTbwgQkCBGB6RW7O73RqjAaNw0F_vbFuK8ZGY9NCk2-nuzk7n28d8A3AmqHC1F9gW56JmMY8pizPXcBFKVyvmVZkwscO9fq09ZjcTZ5KBUhoLo7WODp_psrmN9vKDmVqYpbKKg1KY4fhcQ7_PnDhaK90zYE6ULBDnF47F0RBXjJqV0d31MD7GZUeZyVYeKEqp8uM_HDmX1jb0ltWKz5Q8lRdzWVYf3xgb_1vvHdhKUCapx8NiFzJ6moPtZQYHkhh0Dja_0BHm4bL9buK3SAPlk7pS6JHi8UFmIRnfX5FhZ9AkAxEIFPH4YL6QxBng63swbjVHjbaVZFewFGKGuRVoKoTC-RJqJfQQB1GpeRhKFxGYFI5A49asFlCGAJJVA0GVksxlwlWcCsY13YfsdDbVB0AC11E1bCCV6oLJquRScU8KHirbDkNbFKC07HFfJdTjJgPGsx9NQarcNwryjYIM-XgBztPiLzHnxl8F86az00JJPxeguFSnn1jkm2_SqdnYWM8rAEkfoy2ZDRIx1bPFm8_RbSCcs-nh74JPYb096nX9bqd_ewQb8fKyuYqQnb8u9DHik7k8iYblJ3_632w |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Hybrid+Core+Acceleration+of+UWB+SIRE+Radar+Signal+Processing&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Park%2C+Song+Jun&rft.au=Ross%2C+James&rft.au=Shires%2C+Dale&rft.au=Richie%2C+David&rft.date=2011-01-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=1045-9219&rft.eissn=1558-2183&rft.volume=22&rft.issue=1&rft.spage=46&rft_id=info:doi/10.1109%2FTPDS.2010.117&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=2724262911 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon |