Design and Performance Evaluation of Image Processing Algorithms on GPUs
In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitati...
Saved in:
Published in | IEEE transactions on parallel and distributed systems Vol. 22; no. 1; pp. 91 - 104 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.01.2011
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 1045-9219 1558-2183 |
DOI | 10.1109/TPDS.2010.115 |
Cover
Abstract | In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of image processing algorithms map readily to CUDA using multiview stereo matching, linear feature extraction, JPEG2000 image encoding, and nonphotorealistic rendering (NPR) as our example applications. The algorithms are carefully selected from major domains of image processing, so they inherently contain a variety of subalgorithms with diverse characteristics when implemented on the GPU. Performance is evaluated in terms of execution time and is compared to the fastest host-only version implemented using OpenMP. It is shown that the observed speedup varies extensively depending on the characteristics of each algorithm. Intensive analysis is conducted to show the appropriateness of the proposed metrics in predicting the effectiveness of an application for parallel implementation. |
---|---|
AbstractList | In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of image processing algorithms map readily to CUDA using multiview stereo matching, linear feature extraction, JPEG2000 image encoding, and nonphotorealistic rendering (NPR) as our example applications. The algorithms are carefully selected from major domains of image processing, so they inherently contain a variety of subalgorithms with diverse characteristics when implemented on the GPU. Performance is evaluated in terms of execution time and is compared to the fastest host-only version implemented using OpenMP. It is shown that the observed speedup varies extensively depending on the characteristics of each algorithm. Intensive analysis is conducted to show the appropriateness of the proposed metrics in predicting the effectiveness of an application for parallel implementation. |
Author | Man Hee Lee In Kyu Park Kim, Chris W Sungdae Cho Singhal, Nitin |
Author_xml | – sequence: 1 surname: In Kyu Park fullname: In Kyu Park email: pik@inha.ac.kr organization: Sch. of Inf. & Commun. Eng., Inha Univ., Incheon, South Korea – sequence: 2 givenname: Nitin surname: Singhal fullname: Singhal, Nitin email: nitin.singhal@ieee.org organization: Telecommun. Module Lab., Samsung Electron. Co., Ltd., Suwon, South Korea – sequence: 3 surname: Man Hee Lee fullname: Man Hee Lee email: maninara@hotmail.com organization: Sch. of Inf. & Commun. Eng., Inha Univ., Incheon, South Korea – sequence: 4 surname: Sungdae Cho fullname: Sungdae Cho email: s-d.cho@samsung.com organization: Telecommun. Module Lab., Samsung Electron. Co., Ltd., Suwon, South Korea – sequence: 5 givenname: Chris W surname: Kim fullname: Kim, Chris W email: chkim@nvidia.com organization: NVIDIA Corp., Seoul, South Korea |
BookMark | eNp10EFLwzAUB_AgE9ymR09eCh48deY1SZsexza3geDA7VyyNqkZbTKTVvDb227iYeAlyYPfe7z8R2hgrJEI3QOeAOD0ebuZv08ifCrZFRoCYzyMgJNB98aUhWkE6Q0aeX_AGCjDdIhWc-l1aQJhimAjnbKuFiaXweJLVK1otDWBVcG6FqUMNs7m0nttymBaldbp5qP2QSeWm52_RddKVF7e_d5jtHtZbGer8PVtuZ5NX8OccNKEESeCkj2nABEQlgtVyAJyTOlexIqReK8K2BcyLZJYYKW4AN6dCRcKMGExGaOn89yjs5-t9E1Wa5_LqhJG2tZnKdA4YkB6-XghD7Z1plsu60ZhSCDhrFPkrHJnvXdSZbluTh9vnNBVR7M-3KwPN-vD7cq-K7zoOjpdC_f9r384ey2l_LOMJgmFhPwArueE3g |
CODEN | ITDSEO |
CitedBy_id | crossref_primary_10_15803_ijnc_3_1_75 crossref_primary_10_1016_j_cviu_2010_11_017 crossref_primary_10_1177_1094342013518807 crossref_primary_10_1145_3291050 crossref_primary_10_1007_s11042_013_1440_x crossref_primary_10_1007_s11390_012_1303_3 crossref_primary_10_1002_mp_12157 crossref_primary_10_1007_s11554_020_00974_z crossref_primary_10_1364_AO_471747 crossref_primary_10_9708_jksci_2013_18_10_001 crossref_primary_10_1364_OE_416465 crossref_primary_10_1109_JSTARS_2016_2516503 crossref_primary_10_14372_IEMEK_2014_9_1_17 crossref_primary_10_9708_jksci_2014_19_8_001 crossref_primary_10_5772_61058 crossref_primary_10_1007_s11554_016_0623_x crossref_primary_10_1016_j_jocs_2014_07_003 crossref_primary_10_1049_iet_cdt_2013_0141 crossref_primary_10_1109_TII_2017_2724205 crossref_primary_10_1007_s11432_013_4887_5 crossref_primary_10_2514_1_A33755 crossref_primary_10_1007_s11554_019_00894_7 crossref_primary_10_1002_cpe_4372 crossref_primary_10_3389_fpsyt_2017_00026 crossref_primary_10_1016_j_cviu_2014_04_009 crossref_primary_10_1109_34_476511 crossref_primary_10_14372_IEMEK_2013_8_2_087 crossref_primary_10_3390_app13020870 crossref_primary_10_1016_j_jpdc_2012_05_009 crossref_primary_10_1109_TCE_2012_6227441 crossref_primary_10_1109_TC_2020_3035826 crossref_primary_10_3390_electronics13152971 crossref_primary_10_1007_s11554_016_0582_2 crossref_primary_10_1007_s11554_017_0710_7 crossref_primary_10_1002_cpe_1900 crossref_primary_10_1007_s11042_020_08756_2 crossref_primary_10_1109_JSTARS_2013_2239261 crossref_primary_10_1587_transinf_E95_D_1475 crossref_primary_10_1109_34_588025 crossref_primary_10_1109_TIFS_2013_2291314 crossref_primary_10_1016_j_asej_2016_11_002 crossref_primary_10_9708_jksci_2015_20_1_001 crossref_primary_10_1007_s11227_014_1343_2 crossref_primary_10_1109_TIP_2014_2311656 crossref_primary_10_5909_JBE_2014_19_3_316 crossref_primary_10_1109_TPDS_2024_3486219 crossref_primary_10_1007_s11227_015_1382_3 crossref_primary_10_7763_IJMLC_2015_V5_483 crossref_primary_10_1016_j_jvcir_2016_07_003 crossref_primary_10_1007_s10278_018_0110_y crossref_primary_10_1007_s11432_015_5516_2 |
Cites_doi | 10.1145/1459359.1459578 10.1155/asp/2006/42568 10.1016/0146-664X(80)90049-0 10.1109/ICIP.2009.5414207 10.1109/TCSVT.2005.846440 10.1109/TPAMI.1986.4767851 10.1109/34.192463 10.1093/oso/9780195094237.003.0002 10.1016/j.rti.2005.04.002 10.1145/1101149.1101334 10.1007/s11554-008-0085-x 10.1137/S0036141095289051 10.1145/1281500.1281576 10.1109/78.992147 10.1109/83.847830 10.1109/ICCV.1998.710815 10.1109/TCSVT.2007.903789 10.1111/j.1467-8659.2007.01012.x 10.1145/1345206.1345220 10.1109/MCG.2003.1210867 10.1145/882262.882362 10.1109/CVPR.2007.383246 10.1109/JPROC.2008.917757 10.1109/ICIP.2000.899223 10.1145/280814.280951 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011 |
DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3 |
DOI | 10.1109/TPDS.2010.115 |
DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional ANTE: Abstracts in New Technology & Engineering Engineering Research Database |
DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional Engineering Research Database ANTE: Abstracts in New Technology & Engineering |
DatabaseTitleList | Technology Research Database Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISSN | 1558-2183 |
EndPage | 104 |
ExternalDocumentID | 2724262901 10_1109_TPDS_2010_115 5477417 |
Genre | orig-research |
GroupedDBID | --Z -~X .DC 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABFSI ABQJQ ABVLG ACGFO ACIWK AENEX AETIX AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 E.L EBS EJD HZ~ H~9 ICLAB IEDLZ IFIPE IFJZH IPLJI JAVBF LAI M43 MS~ O9- OCL P2P PQQKQ RIA RIE RNI RNS RZB TN5 TWZ UHB VH1 AAYOK AAYXX CITATION RIG 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3 |
ID | FETCH-LOGICAL-c383t-283a43b84112135cafded1c044ba6f536bfd1bde9d76a0ff8a18ff878af103563 |
IEDL.DBID | RIE |
ISSN | 1045-9219 |
IngestDate | Thu Sep 04 17:59:46 EDT 2025 Mon Jun 30 04:31:41 EDT 2025 Tue Jul 01 05:22:14 EDT 2025 Thu Apr 24 22:50:52 EDT 2025 Wed Aug 27 02:52:31 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c383t-283a43b84112135cafded1c044ba6f536bfd1bde9d76a0ff8a18ff878af103563 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23 |
PQID | 1030171785 |
PQPubID | 85437 |
PageCount | 14 |
ParticipantIDs | proquest_miscellaneous_914625136 ieee_primary_5477417 crossref_citationtrail_10_1109_TPDS_2010_115 crossref_primary_10_1109_TPDS_2010_115 proquest_journals_1030171785 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2011-Jan. 2011-01-00 20110101 |
PublicationDateYYYYMMDD | 2011-01-01 |
PublicationDate_xml | – month: 01 year: 2011 text: 2011-Jan. |
PublicationDecade | 2010 |
PublicationPlace | New York |
PublicationPlace_xml | – name: New York |
PublicationTitle | IEEE transactions on parallel and distributed systems |
PublicationTitleAbbrev | TPDS |
PublicationYear | 2011 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref13 Frahm (ref16) 2008 ref35 Rost (ref10) 2006 ref12 ref34 ref15 ref37 ref14 Halffill (ref21) 2008 ref36 ref31 ref11 ref33 ref32 (ref20) 2009 Grama (ref22) 2003 ref2 ref17 ref38 ref19 (ref1) 2010 (ref6) 2010 (ref18) 2010 (ref4) 2010 ref24 ref23 Huang (ref30) ref25 Rabbani (ref27) 2002; 17 ref28 ref29 ref8 ref7 ref9 ref3 (ref26) 2000 (ref5) 2010 |
References_xml | – ident: ref13 doi: 10.1145/1459359.1459578 – year: 2000 ident: ref26 publication-title: Information Technology—JPEG2000 Image Coding System, ISO/IEC Int’l Standard 15444-1, ITU Recommendation T.800 – ident: ref32 doi: 10.1155/asp/2006/42568 – ident: ref24 doi: 10.1016/0146-664X(80)90049-0 – volume-title: Compute Unified Device Architecture (CUDA) year: 2010 ident: ref4 – ident: ref7 doi: 10.1109/ICIP.2009.5414207 – ident: ref11 doi: 10.1109/TCSVT.2005.846440 – ident: ref25 doi: 10.1109/TPAMI.1986.4767851 – ident: ref33 doi: 10.1109/34.192463 – ident: ref35 doi: 10.1093/oso/9780195094237.003.0002 – volume-title: Open Computing Language (OpenCL) year: 2010 ident: ref5 – ident: ref12 doi: 10.1016/j.rti.2005.04.002 – ident: ref15 doi: 10.1145/1101149.1101334 – ident: ref14 doi: 10.1007/s11554-008-0085-x – ident: ref34 doi: 10.1137/S0036141095289051 – ident: ref9 doi: 10.1145/1281500.1281576 – ident: ref29 doi: 10.1109/78.992147 – ident: ref28 doi: 10.1109/83.847830 – ident: ref37 doi: 10.1109/ICCV.1998.710815 – ident: ref31 doi: 10.1109/TCSVT.2007.903789 – volume-title: CUJ2K: Jpeg2000 Encoder on Cuda year: 2010 ident: ref18 – ident: ref2 doi: 10.1111/j.1467-8659.2007.01012.x – volume-title: NVIDIA CUDA Programming Guide 2.3 year: 2009 ident: ref20 – ident: ref17 doi: 10.1145/1345206.1345220 – volume-title: OpenMP Website year: 2010 ident: ref6 – ident: ref36 doi: 10.1109/MCG.2003.1210867 – volume-title: General Purpose GPU Programming (GPGPU) Website year: 2010 ident: ref1 – ident: ref8 doi: 10.1145/882262.882362 – ident: ref23 doi: 10.1109/CVPR.2007.383246 – start-page: 571 volume-title: Proc. IEEE Int’l Conf. Image Processing ident: ref30 article-title: Hardware Implementation of Shape-Adaptive Discrete Wavelet Transform with the JPEG Defaulted (9, 7) Filter Bank – volume-title: MicroProcessor Report Online year: 2008 ident: ref21 article-title: Parallel Processing with CUDA – volume-title: Proc. CVPR Workshop Visual Computer Vision on GPUs (CVGPU) year: 2008 ident: ref16 – ident: ref3 doi: 10.1109/JPROC.2008.917757 – volume-title: OpenGL(R) Shading Language year: 2006 ident: ref10 – volume-title: Introduction to Parallel Computing year: 2003 ident: ref22 – volume: 17 start-page: 3 issue: 1 year: 2002 ident: ref27 article-title: An Overview of the JPEG 2000 Still Image Compression Standard publication-title: Signal Processing: Image Comm. – ident: ref19 doi: 10.1109/ICIP.2000.899223 – ident: ref38 doi: 10.1145/280814.280951 |
SSID | ssj0014504 |
Score | 2.3283815 |
Snippet | In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using... |
SourceID | proquest crossref ieee |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 91 |
SubjectTerms | Algorithm design and analysis Algorithms Computer architecture Computer vision Concurrent computing CUDA Design engineering Document management GPGPU GPU Graphics processing unit Image processing Mathematical models parallel implementation Parallel processing Parallel programming Performance evaluation Programming Rendering Scattering Studies |
Title | Design and Performance Evaluation of Image Processing Algorithms on GPUs |
URI | https://ieeexplore.ieee.org/document/5477417 https://www.proquest.com/docview/1030171785 https://www.proquest.com/docview/914625136 |
Volume | 22 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dS8MwED90T_rgx1ScTsmD-GTnYpsueRxOncJkoAPfSpoPFbUV173413tJuyp-gFBKSy4hJLnkLnf3O4ADKyXDrV8HoYkVKijCBsKmIrAoavSUw0vx3u6j63g4ia7u2N0CHNWxMMYY73xmOu7T2_J1rmbuquyYRdgC7S3CIi6zMlarthhEzKcKRO2CBQLZ8BNP8_h2PLgpnbioy3775fzxCVV-7ML-aDlfhdG8U6VHyVNnVqQd9f4Nr_G_vV6DlUrGJP1yUazDgsmasDrP30Aqdm7C8hcwwg0YDrwzB5GZJuPPcAJyVuOBk9ySyxfcgEgVXoD1SP_5Pn97LB5epgQpLsaT6SZMzs9uT4dBlWchUKifFgFKGDIKUx5Rh-_GlLTaaKq6UZTK2LIwTq2mqTZC92LZtZZLyvHd49LSbsjicAsaWZ6ZbSBYHKOKxAU1PGJMpieMK6s5VUZYa2QLjuajn6gKhNzlwnhOvDLSFYmbrMRNFv6yFhzW5K8l-sZfhBtu4Guiasxb0J5PbVLx5jRxidUoarEca5G6GLnKmUpkZvLZNBF4gKDkF8Y7vze8C0vl5bJ72tAo3mZmD6WTIt33y_IDKlnhiA |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT-MwEB6xcGD3wBttWRZ8QJxIqYmd2kfEqzyKKtFK3CLHD3YFJIimF379jp00IFgkpChK5LFl2R7PjD3zDcCOU4rj1m-i2CYaDRTpIukyGTlUNbra46UEb_f-ddIbsYtbfjsDe00sjLU2OJ_Ztv8Md_mm0BN_VLbPGbZAu99gDuU-41W0VnNnwHhIFoj2BY8kMuIroub-cHB8U7lxUZ__9o0ECilVPuzDQbicLkJ_2q3Kp-S-PSmztn55h9j41X4vwUKtZZLDalksw4zNV2BxmsGB1Ay9Aj_ewBGuQu84uHMQlRsyeA0oICcNIjgpHDl_xC2I1AEGWI8cPtwVz3_LP49jghRng9F4DUanJ8OjXlRnWog0WqhlhDqGYnEmGPUIb1wrZ6yhusNYphLH4yRzhmbGStNNVMc5oajAd1coRzsxT-J1mM2L3P4EgsUJGklCUisY5yo74EI7I6i20jmrWrA3Hf1U1zDkPhvGQxrMkY5M_WSlfrLwl7dgtyF_qvA3PiNc9QPfENVj3oLN6dSmNXeOU59ajaIdK7AWaYqRr_xlicptMRmnEkUI6n5xsvH_hrdhvjfsX6VX59eXv-B7ddTsn02YLZ8n9jfqKmW2FZboPx-f5NU |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Design+and+Performance+Evaluation+of+Image+Processing+Algorithms+on+GPUs&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Park%2C+In+Kyu&rft.au=Singhal%2C+Nitin&rft.au=Lee%2C+Man+Hee&rft.au=Cho%2C+Sungdae&rft.date=2011-01-01&rft.issn=1045-9219&rft.volume=22&rft.issue=1&rft.spage=91&rft.epage=104&rft_id=info:doi/10.1109%2FTPDS.2010.115&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon |