Design and Performance Evaluation of Image Processing Algorithms on GPUs

In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitati...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on parallel and distributed systems Vol. 22; no. 1; pp. 91 - 104
Main Authors In Kyu Park, Singhal, Nitin, Man Hee Lee, Sungdae Cho, Kim, Chris W
Format Journal Article
LanguageEnglish
Published New York IEEE 01.01.2011
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1045-9219
1558-2183
DOI10.1109/TPDS.2010.115

Cover

Abstract In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of image processing algorithms map readily to CUDA using multiview stereo matching, linear feature extraction, JPEG2000 image encoding, and nonphotorealistic rendering (NPR) as our example applications. The algorithms are carefully selected from major domains of image processing, so they inherently contain a variety of subalgorithms with diverse characteristics when implemented on the GPU. Performance is evaluated in terms of execution time and is compared to the fastest host-only version implemented using OpenMP. It is shown that the observed speedup varies extensively depending on the characteristics of each algorithm. Intensive analysis is conducted to show the appropriateness of the proposed metrics in predicting the effectiveness of an application for parallel implementation.
AbstractList In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of image processing algorithms map readily to CUDA using multiview stereo matching, linear feature extraction, JPEG2000 image encoding, and nonphotorealistic rendering (NPR) as our example applications. The algorithms are carefully selected from major domains of image processing, so they inherently contain a variety of subalgorithms with diverse characteristics when implemented on the GPU. Performance is evaluated in terms of execution time and is compared to the fastest host-only version implemented using OpenMP. It is shown that the observed speedup varies extensively depending on the characteristics of each algorithm. Intensive analysis is conducted to show the appropriateness of the proposed metrics in predicting the effectiveness of an application for parallel implementation.
Author Man Hee Lee
In Kyu Park
Kim, Chris W
Sungdae Cho
Singhal, Nitin
Author_xml – sequence: 1
  surname: In Kyu Park
  fullname: In Kyu Park
  email: pik@inha.ac.kr
  organization: Sch. of Inf. & Commun. Eng., Inha Univ., Incheon, South Korea
– sequence: 2
  givenname: Nitin
  surname: Singhal
  fullname: Singhal, Nitin
  email: nitin.singhal@ieee.org
  organization: Telecommun. Module Lab., Samsung Electron. Co., Ltd., Suwon, South Korea
– sequence: 3
  surname: Man Hee Lee
  fullname: Man Hee Lee
  email: maninara@hotmail.com
  organization: Sch. of Inf. & Commun. Eng., Inha Univ., Incheon, South Korea
– sequence: 4
  surname: Sungdae Cho
  fullname: Sungdae Cho
  email: s-d.cho@samsung.com
  organization: Telecommun. Module Lab., Samsung Electron. Co., Ltd., Suwon, South Korea
– sequence: 5
  givenname: Chris W
  surname: Kim
  fullname: Kim, Chris W
  email: chkim@nvidia.com
  organization: NVIDIA Corp., Seoul, South Korea
BookMark eNp10EFLwzAUB_AgE9ymR09eCh48deY1SZsexza3geDA7VyyNqkZbTKTVvDb227iYeAlyYPfe7z8R2hgrJEI3QOeAOD0ebuZv08ifCrZFRoCYzyMgJNB98aUhWkE6Q0aeX_AGCjDdIhWc-l1aQJhimAjnbKuFiaXweJLVK1otDWBVcG6FqUMNs7m0nttymBaldbp5qP2QSeWm52_RddKVF7e_d5jtHtZbGer8PVtuZ5NX8OccNKEESeCkj2nABEQlgtVyAJyTOlexIqReK8K2BcyLZJYYKW4AN6dCRcKMGExGaOn89yjs5-t9E1Wa5_LqhJG2tZnKdA4YkB6-XghD7Z1plsu60ZhSCDhrFPkrHJnvXdSZbluTh9vnNBVR7M-3KwPN-vD7cq-K7zoOjpdC_f9r384ey2l_LOMJgmFhPwArueE3g
CODEN ITDSEO
CitedBy_id crossref_primary_10_15803_ijnc_3_1_75
crossref_primary_10_1016_j_cviu_2010_11_017
crossref_primary_10_1177_1094342013518807
crossref_primary_10_1145_3291050
crossref_primary_10_1007_s11042_013_1440_x
crossref_primary_10_1007_s11390_012_1303_3
crossref_primary_10_1002_mp_12157
crossref_primary_10_1007_s11554_020_00974_z
crossref_primary_10_1364_AO_471747
crossref_primary_10_9708_jksci_2013_18_10_001
crossref_primary_10_1364_OE_416465
crossref_primary_10_1109_JSTARS_2016_2516503
crossref_primary_10_14372_IEMEK_2014_9_1_17
crossref_primary_10_9708_jksci_2014_19_8_001
crossref_primary_10_5772_61058
crossref_primary_10_1007_s11554_016_0623_x
crossref_primary_10_1016_j_jocs_2014_07_003
crossref_primary_10_1049_iet_cdt_2013_0141
crossref_primary_10_1109_TII_2017_2724205
crossref_primary_10_1007_s11432_013_4887_5
crossref_primary_10_2514_1_A33755
crossref_primary_10_1007_s11554_019_00894_7
crossref_primary_10_1002_cpe_4372
crossref_primary_10_3389_fpsyt_2017_00026
crossref_primary_10_1016_j_cviu_2014_04_009
crossref_primary_10_1109_34_476511
crossref_primary_10_14372_IEMEK_2013_8_2_087
crossref_primary_10_3390_app13020870
crossref_primary_10_1016_j_jpdc_2012_05_009
crossref_primary_10_1109_TCE_2012_6227441
crossref_primary_10_1109_TC_2020_3035826
crossref_primary_10_3390_electronics13152971
crossref_primary_10_1007_s11554_016_0582_2
crossref_primary_10_1007_s11554_017_0710_7
crossref_primary_10_1002_cpe_1900
crossref_primary_10_1007_s11042_020_08756_2
crossref_primary_10_1109_JSTARS_2013_2239261
crossref_primary_10_1587_transinf_E95_D_1475
crossref_primary_10_1109_34_588025
crossref_primary_10_1109_TIFS_2013_2291314
crossref_primary_10_1016_j_asej_2016_11_002
crossref_primary_10_9708_jksci_2015_20_1_001
crossref_primary_10_1007_s11227_014_1343_2
crossref_primary_10_1109_TIP_2014_2311656
crossref_primary_10_5909_JBE_2014_19_3_316
crossref_primary_10_1109_TPDS_2024_3486219
crossref_primary_10_1007_s11227_015_1382_3
crossref_primary_10_7763_IJMLC_2015_V5_483
crossref_primary_10_1016_j_jvcir_2016_07_003
crossref_primary_10_1007_s10278_018_0110_y
crossref_primary_10_1007_s11432_015_5516_2
Cites_doi 10.1145/1459359.1459578
10.1155/asp/2006/42568
10.1016/0146-664X(80)90049-0
10.1109/ICIP.2009.5414207
10.1109/TCSVT.2005.846440
10.1109/TPAMI.1986.4767851
10.1109/34.192463
10.1093/oso/9780195094237.003.0002
10.1016/j.rti.2005.04.002
10.1145/1101149.1101334
10.1007/s11554-008-0085-x
10.1137/S0036141095289051
10.1145/1281500.1281576
10.1109/78.992147
10.1109/83.847830
10.1109/ICCV.1998.710815
10.1109/TCSVT.2007.903789
10.1111/j.1467-8659.2007.01012.x
10.1145/1345206.1345220
10.1109/MCG.2003.1210867
10.1145/882262.882362
10.1109/CVPR.2007.383246
10.1109/JPROC.2008.917757
10.1109/ICIP.2000.899223
10.1145/280814.280951
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Jan 2011
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
F28
FR3
DOI 10.1109/TPDS.2010.115
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
Engineering Research Database
ANTE: Abstracts in New Technology & Engineering
DatabaseTitleList Technology Research Database

Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2183
EndPage 104
ExternalDocumentID 2724262901
10_1109_TPDS_2010_115
5477417
Genre orig-research
GroupedDBID --Z
-~X
.DC
0R~
29I
4.4
5GY
5VS
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABFSI
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
AETIX
AGQYO
AGSQL
AHBIQ
AI.
AIBXA
AKJIK
AKQYR
ALLEH
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
E.L
EBS
EJD
HZ~
H~9
ICLAB
IEDLZ
IFIPE
IFJZH
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNI
RNS
RZB
TN5
TWZ
UHB
VH1
AAYOK
AAYXX
CITATION
RIG
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
F28
FR3
ID FETCH-LOGICAL-c383t-283a43b84112135cafded1c044ba6f536bfd1bde9d76a0ff8a18ff878af103563
IEDL.DBID RIE
ISSN 1045-9219
IngestDate Thu Sep 04 17:59:46 EDT 2025
Mon Jun 30 04:31:41 EDT 2025
Tue Jul 01 05:22:14 EDT 2025
Thu Apr 24 22:50:52 EDT 2025
Wed Aug 27 02:52:31 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c383t-283a43b84112135cafded1c044ba6f536bfd1bde9d76a0ff8a18ff878af103563
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
PQID 1030171785
PQPubID 85437
PageCount 14
ParticipantIDs proquest_miscellaneous_914625136
ieee_primary_5477417
crossref_citationtrail_10_1109_TPDS_2010_115
crossref_primary_10_1109_TPDS_2010_115
proquest_journals_1030171785
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2011-Jan.
2011-01-00
20110101
PublicationDateYYYYMMDD 2011-01-01
PublicationDate_xml – month: 01
  year: 2011
  text: 2011-Jan.
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on parallel and distributed systems
PublicationTitleAbbrev TPDS
PublicationYear 2011
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
Frahm (ref16) 2008
ref35
Rost (ref10) 2006
ref12
ref34
ref15
ref37
ref14
Halffill (ref21) 2008
ref36
ref31
ref11
ref33
ref32
(ref20) 2009
Grama (ref22) 2003
ref2
ref17
ref38
ref19
(ref1) 2010
(ref6) 2010
(ref18) 2010
(ref4) 2010
ref24
ref23
Huang (ref30)
ref25
Rabbani (ref27) 2002; 17
ref28
ref29
ref8
ref7
ref9
ref3
(ref26) 2000
(ref5) 2010
References_xml – ident: ref13
  doi: 10.1145/1459359.1459578
– year: 2000
  ident: ref26
  publication-title: Information Technology—JPEG2000 Image Coding System, ISO/IEC Int’l Standard 15444-1, ITU Recommendation T.800
– ident: ref32
  doi: 10.1155/asp/2006/42568
– ident: ref24
  doi: 10.1016/0146-664X(80)90049-0
– volume-title: Compute Unified Device Architecture (CUDA)
  year: 2010
  ident: ref4
– ident: ref7
  doi: 10.1109/ICIP.2009.5414207
– ident: ref11
  doi: 10.1109/TCSVT.2005.846440
– ident: ref25
  doi: 10.1109/TPAMI.1986.4767851
– ident: ref33
  doi: 10.1109/34.192463
– ident: ref35
  doi: 10.1093/oso/9780195094237.003.0002
– volume-title: Open Computing Language (OpenCL)
  year: 2010
  ident: ref5
– ident: ref12
  doi: 10.1016/j.rti.2005.04.002
– ident: ref15
  doi: 10.1145/1101149.1101334
– ident: ref14
  doi: 10.1007/s11554-008-0085-x
– ident: ref34
  doi: 10.1137/S0036141095289051
– ident: ref9
  doi: 10.1145/1281500.1281576
– ident: ref29
  doi: 10.1109/78.992147
– ident: ref28
  doi: 10.1109/83.847830
– ident: ref37
  doi: 10.1109/ICCV.1998.710815
– ident: ref31
  doi: 10.1109/TCSVT.2007.903789
– volume-title: CUJ2K: Jpeg2000 Encoder on Cuda
  year: 2010
  ident: ref18
– ident: ref2
  doi: 10.1111/j.1467-8659.2007.01012.x
– volume-title: NVIDIA CUDA Programming Guide 2.3
  year: 2009
  ident: ref20
– ident: ref17
  doi: 10.1145/1345206.1345220
– volume-title: OpenMP Website
  year: 2010
  ident: ref6
– ident: ref36
  doi: 10.1109/MCG.2003.1210867
– volume-title: General Purpose GPU Programming (GPGPU) Website
  year: 2010
  ident: ref1
– ident: ref8
  doi: 10.1145/882262.882362
– ident: ref23
  doi: 10.1109/CVPR.2007.383246
– start-page: 571
  volume-title: Proc. IEEE Int’l Conf. Image Processing
  ident: ref30
  article-title: Hardware Implementation of Shape-Adaptive Discrete Wavelet Transform with the JPEG Defaulted (9, 7) Filter Bank
– volume-title: MicroProcessor Report Online
  year: 2008
  ident: ref21
  article-title: Parallel Processing with CUDA
– volume-title: Proc. CVPR Workshop Visual Computer Vision on GPUs (CVGPU)
  year: 2008
  ident: ref16
– ident: ref3
  doi: 10.1109/JPROC.2008.917757
– volume-title: OpenGL(R) Shading Language
  year: 2006
  ident: ref10
– volume-title: Introduction to Parallel Computing
  year: 2003
  ident: ref22
– volume: 17
  start-page: 3
  issue: 1
  year: 2002
  ident: ref27
  article-title: An Overview of the JPEG 2000 Still Image Compression Standard
  publication-title: Signal Processing: Image Comm.
– ident: ref19
  doi: 10.1109/ICIP.2000.899223
– ident: ref38
  doi: 10.1145/280814.280951
SSID ssj0014504
Score 2.3283815
Snippet In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 91
SubjectTerms Algorithm design and analysis
Algorithms
Computer architecture
Computer vision
Concurrent computing
CUDA
Design engineering
Document management
GPGPU
GPU
Graphics processing unit
Image processing
Mathematical models
parallel implementation
Parallel processing
Parallel programming
Performance evaluation
Programming
Rendering
Scattering
Studies
Title Design and Performance Evaluation of Image Processing Algorithms on GPUs
URI https://ieeexplore.ieee.org/document/5477417
https://www.proquest.com/docview/1030171785
https://www.proquest.com/docview/914625136
Volume 22
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dS8MwED90T_rgx1ScTsmD-GTnYpsueRxOncJkoAPfSpoPFbUV173413tJuyp-gFBKSy4hJLnkLnf3O4ADKyXDrV8HoYkVKijCBsKmIrAoavSUw0vx3u6j63g4ia7u2N0CHNWxMMYY73xmOu7T2_J1rmbuquyYRdgC7S3CIi6zMlarthhEzKcKRO2CBQLZ8BNP8_h2PLgpnbioy3775fzxCVV-7ML-aDlfhdG8U6VHyVNnVqQd9f4Nr_G_vV6DlUrGJP1yUazDgsmasDrP30Aqdm7C8hcwwg0YDrwzB5GZJuPPcAJyVuOBk9ySyxfcgEgVXoD1SP_5Pn97LB5epgQpLsaT6SZMzs9uT4dBlWchUKifFgFKGDIKUx5Rh-_GlLTaaKq6UZTK2LIwTq2mqTZC92LZtZZLyvHd49LSbsjicAsaWZ6ZbSBYHKOKxAU1PGJMpieMK6s5VUZYa2QLjuajn6gKhNzlwnhOvDLSFYmbrMRNFv6yFhzW5K8l-sZfhBtu4Guiasxb0J5PbVLx5jRxidUoarEca5G6GLnKmUpkZvLZNBF4gKDkF8Y7vze8C0vl5bJ72tAo3mZmD6WTIt33y_IDKlnhiA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT-MwEB6xcGD3wBttWRZ8QJxIqYmd2kfEqzyKKtFK3CLHD3YFJIimF379jp00IFgkpChK5LFl2R7PjD3zDcCOU4rj1m-i2CYaDRTpIukyGTlUNbra46UEb_f-ddIbsYtbfjsDe00sjLU2OJ_Ztv8Md_mm0BN_VLbPGbZAu99gDuU-41W0VnNnwHhIFoj2BY8kMuIroub-cHB8U7lxUZ__9o0ECilVPuzDQbicLkJ_2q3Kp-S-PSmztn55h9j41X4vwUKtZZLDalksw4zNV2BxmsGB1Ay9Aj_ewBGuQu84uHMQlRsyeA0oICcNIjgpHDl_xC2I1AEGWI8cPtwVz3_LP49jghRng9F4DUanJ8OjXlRnWog0WqhlhDqGYnEmGPUIb1wrZ6yhusNYphLH4yRzhmbGStNNVMc5oajAd1coRzsxT-J1mM2L3P4EgsUJGklCUisY5yo74EI7I6i20jmrWrA3Hf1U1zDkPhvGQxrMkY5M_WSlfrLwl7dgtyF_qvA3PiNc9QPfENVj3oLN6dSmNXeOU59ajaIdK7AWaYqRr_xlicptMRmnEkUI6n5xsvH_hrdhvjfsX6VX59eXv-B7ddTsn02YLZ8n9jfqKmW2FZboPx-f5NU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Design+and+Performance+Evaluation+of+Image+Processing+Algorithms+on+GPUs&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Park%2C+In+Kyu&rft.au=Singhal%2C+Nitin&rft.au=Lee%2C+Man+Hee&rft.au=Cho%2C+Sungdae&rft.date=2011-01-01&rft.issn=1045-9219&rft.volume=22&rft.issue=1&rft.spage=91&rft.epage=104&rft_id=info:doi/10.1109%2FTPDS.2010.115&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon