HIDE: Hierarchical iterative decoding enhancement for multi‐view 3D human parameter regression

Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily trainable single‐view networks and the occlusion‐resistant capabilities of multi‐view images. The prevalent presence of object occlusion and self...

Full description

Saved in:
Bibliographic Details
Published inComputer animation and virtual worlds Vol. 35; no. 3
Main Authors Lin, Weitao, Zhang, Jiguang, Meng, Weiliang, Liu, Xianglong, Zhang, Xiaopeng
Format Journal Article
LanguageEnglish
Published Chichester Wiley Subscription Services, Inc 01.05.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily trainable single‐view networks and the occlusion‐resistant capabilities of multi‐view images. The prevalent presence of object occlusion and self‐occlusion in real‐world scenarios leads to issues of robustness and accuracy in predicting human body parameters. Additionally, many methods overlook the spatial connectivity of human joints in the global estimation of model pose parameters, resulting in cumulative errors in continuous joint parameters.To address these challenges, we propose a flexible and efficient iterative decoding strategy. By extending from single‐view images to multi‐view video inputs, we achieve local‐to‐global optimization. We utilize attention mechanisms to capture the rotational dependencies between any node in the human body and all its ancestor nodes, thereby enhancing pose decoding capability. We employ a parameter‐level iterative fusion of multi‐view image data to achieve flexible integration of global pose information, rapidly obtaining appropriate projection features from different viewpoints, ultimately resulting in precise parameter estimation. Through experiments, we validate the effectiveness of the HIDE method on the Human3.6M and 3DPW datasets, demonstrating significantly improved visualization results compared to previous methods.
AbstractList Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily trainable single‐view networks and the occlusion‐resistant capabilities of multi‐view images. The prevalent presence of object occlusion and self‐occlusion in real‐world scenarios leads to issues of robustness and accuracy in predicting human body parameters. Additionally, many methods overlook the spatial connectivity of human joints in the global estimation of model pose parameters, resulting in cumulative errors in continuous joint parameters.To address these challenges, we propose a flexible and efficient iterative decoding strategy. By extending from single‐view images to multi‐view video inputs, we achieve local‐to‐global optimization. We utilize attention mechanisms to capture the rotational dependencies between any node in the human body and all its ancestor nodes, thereby enhancing pose decoding capability. We employ a parameter‐level iterative fusion of multi‐view image data to achieve flexible integration of global pose information, rapidly obtaining appropriate projection features from different viewpoints, ultimately resulting in precise parameter estimation. Through experiments, we validate the effectiveness of the HIDE method on the Human3.6M and 3DPW datasets, demonstrating significantly improved visualization results compared to previous methods.
Author Zhang, Xiaopeng
Lin, Weitao
Zhang, Jiguang
Liu, Xianglong
Meng, Weiliang
Author_xml – sequence: 1
  givenname: Weitao
  orcidid: 0000-0003-1177-9809
  surname: Lin
  fullname: Lin, Weitao
  organization: Institute of Automation, Chinese Academy of Sciences
– sequence: 2
  givenname: Jiguang
  orcidid: 0000-0002-8212-1361
  surname: Zhang
  fullname: Zhang, Jiguang
  email: jiguang.zhang@ia.ac.cn
  organization: Institute of Automation, Chinese Academy of Sciences
– sequence: 3
  givenname: Weiliang
  orcidid: 0000-0002-3221-4981
  surname: Meng
  fullname: Meng, Weiliang
  email: weiliang.meng@ia.ac.cn
  organization: Institute of Automation, Chinese Academy of Sciences
– sequence: 4
  givenname: Xianglong
  orcidid: 0009-0003-6962-8322
  surname: Liu
  fullname: Liu, Xianglong
  organization: Institute of Automation, Chinese Academy of Sciences
– sequence: 5
  givenname: Xiaopeng
  orcidid: 0000-0002-0092-6474
  surname: Zhang
  fullname: Zhang, Xiaopeng
  organization: Institute of Automation, Chinese Academy of Sciences
BookMark eNp1kMFOAjEQhhuDiYAmPkITL14W2-62u3ojgEJC4kWNt7W0UyjZ7WK7QLj5CD6jT-IixoPR08wk3_9P8nVQy1UOEDqnpEcJYVdKbnqMCXGE2pQnIkpY-tz62QU9QZ0Qlg0pGCVt9DKeDEc3eGzBS68WVskC27o5arsBrEFV2ro5BreQTkEJrsam8rhcF7X9eHvfWNjieIgX61I6vJJeltCksYe5hxBs5U7RsZFFgLPv2UWPt6OHwTia3t9NBv1ppNh1LKLYZEYlwnBD2WxGORCdCZbGZJZllOjYACfAQHMKhiVSSq25UsxwrlPOEh130cWhd-Wr1zWEOl9Wa--al3lMUipIRjhtqMsDpXwVggeTr7wtpd_llOR7f3njL9_7a9DeL1TZutFSudpLW_wViA6BrS1g929xPug_ffGfxpWEgA
CitedBy_id crossref_primary_10_1007_s11227_025_07012_4
crossref_primary_10_1007_s00371_025_03815_x
Cites_doi 10.1109/TPAMI.2013.248
10.1007/978-3-319-10602-1_48
10.1007/978-3-319-46493-0_38
10.1109/3DV.2017.00064
10.1007/978-3-319-46454-1_34
10.1109/3DV.2017.00055
10.1145/2816795.2818013
10.1109/3DV53792.2021.00015
ContentType Journal Article
Copyright 2024 John Wiley & Sons Ltd.
2024 John Wiley & Sons, Ltd.
Copyright_xml – notice: 2024 John Wiley & Sons Ltd.
– notice: 2024 John Wiley & Sons, Ltd.
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1002/cav.2266
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList CrossRef
Computer and Information Systems Abstracts

DeliveryMethod fulltext_linktorsrc
Discipline Visual Arts
EISSN 1546-427X
EndPage n/a
ExternalDocumentID 10_1002_cav_2266
CAV2266
Genre article
GrantInformation_xml – fundername: Beihang University
  funderid: VRLAB2023B01
– fundername: Beijing Natural Science Foundation
  funderid: L231013
– fundername: Wenzhou Business School 2024 Talent launch program
  funderid: RC202401
– fundername: National Natural Science Foundation of China
  funderid: 52175493; 62162044; 62171321; 62365014; 62376271; U21A20515
GroupedDBID .3N
.4S
.DC
.GA
.Y3
05W
0R~
10A
1L6
1OB
1OC
29F
31~
33P
3SF
3WU
4.4
50Y
50Z
51W
51X
52M
52N
52O
52P
52S
52T
52U
52W
52X
5GY
5VS
66C
6J9
702
7PT
8-0
8-1
8-3
8-4
8-5
930
A03
AAESR
AAEVG
AAHQN
AAMMB
AAMNL
AANHP
AANLZ
AAONW
AASGY
AAXRX
AAYCA
AAZKR
ABCQN
ABCUV
ABEML
ABIJN
ABPVW
ACAHQ
ACBWZ
ACCZN
ACGFS
ACPOU
ACRPL
ACSCC
ACXBN
ACXQS
ACYXJ
ADBBV
ADEOM
ADIZJ
ADKYN
ADMGS
ADMLS
ADNMO
ADOZA
ADXAS
ADZMN
AEFGJ
AEIGN
AEIMD
AENEX
AEUYR
AFBPY
AFFPM
AFGKR
AFWVQ
AFZJQ
AGHNM
AGQPQ
AGXDD
AGYGG
AHBTC
AIDQK
AIDYY
AITYG
AIURR
AJXKR
ALMA_UNASSIGNED_HOLDINGS
ALUQN
ALVPJ
AMBMR
AMYDB
ARCSS
ASPBG
ATUGU
AUFTA
AVWKF
AZBYB
AZFZN
AZVAB
BAFTC
BDRZF
BFHJK
BHBCM
BMNLL
BROTX
BRXPI
BY8
CS3
D-E
D-F
DCZOG
DPXWK
DR2
DRFUL
DRSTM
DU5
EBS
EDO
EJD
F00
F01
F04
F5P
FEDTE
G-S
G.N
GNP
GODZA
HF~
HGLYW
HHY
HVGLF
HZ~
I-F
ITG
ITH
IX1
J0M
JPC
KQQ
LATKE
LAW
LC2
LC3
LEEKS
LH4
LITHE
LOXES
LP6
LP7
LUTES
LW6
LYRES
MEWTI
MK4
MRFUL
MRSTM
MSFUL
MSSTM
MXFUL
MXSTM
N9A
NF~
O66
O9-
OIG
P2W
P4D
PQQKQ
Q.N
Q11
QB0
QRW
R.K
ROL
RX1
RYL
SUPJJ
TN5
TUS
UB1
V2E
V8K
W8V
W99
WBKPD
WIH
WIK
WQJ
WXSBR
WYISQ
WZISG
XG1
XV2
~IA
~WT
AAHHS
AAYXX
ACCFJ
ADZOD
AEEZP
AEQDE
AIWBW
AJBDE
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c2936-3f8fc46f5f12bb15e0d862730b8810d3fe50e2ed51ef24aaadd5cc2f55d7524d3
IEDL.DBID DR2
ISSN 1546-4261
IngestDate Sat Jul 26 03:40:53 EDT 2025
Thu Apr 24 22:59:44 EDT 2025
Tue Jul 01 02:42:24 EDT 2025
Wed Aug 20 07:26:33 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 3
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c2936-3f8fc46f5f12bb15e0d862730b8810d3fe50e2ed51ef24aaadd5cc2f55d7524d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0003-1177-9809
0000-0002-0092-6474
0000-0002-8212-1361
0000-0002-3221-4981
0009-0003-6962-8322
PQID 3071608051
PQPubID 2034909
PageCount 13
ParticipantIDs proquest_journals_3071608051
crossref_primary_10_1002_cav_2266
crossref_citationtrail_10_1002_cav_2266
wiley_primary_10_1002_cav_2266_CAV2266
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate May/June 2024
2024-05-00
20240501
PublicationDateYYYYMMDD 2024-05-01
PublicationDate_xml – month: 05
  year: 2024
  text: May/June 2024
PublicationDecade 2020
PublicationPlace Chichester
PublicationPlace_xml – name: Chichester
PublicationTitle Computer animation and virtual worlds
PublicationYear 2024
Publisher Wiley Subscription Services, Inc
Publisher_xml – name: Wiley Subscription Services, Inc
References 2015; 34
2013; 36
2021; 34
2023
2022
2021
2020
2019
2018
2017
2016
2014
2017; 30:5998–6008
e_1_2_10_23_1
e_1_2_10_24_1
e_1_2_10_21_1
e_1_2_10_22_1
e_1_2_10_20_1
Vaswani A (e_1_2_10_19_1) 2017; 30
Zhang J (e_1_2_10_12_1) 2021; 34
e_1_2_10_2_1
e_1_2_10_4_1
e_1_2_10_18_1
e_1_2_10_3_1
e_1_2_10_6_1
e_1_2_10_16_1
e_1_2_10_5_1
e_1_2_10_17_1
e_1_2_10_8_1
e_1_2_10_14_1
e_1_2_10_7_1
e_1_2_10_15_1
e_1_2_10_9_1
e_1_2_10_13_1
e_1_2_10_34_1
e_1_2_10_10_1
e_1_2_10_33_1
e_1_2_10_11_1
e_1_2_10_32_1
e_1_2_10_31_1
e_1_2_10_30_1
e_1_2_10_29_1
e_1_2_10_27_1
e_1_2_10_28_1
e_1_2_10_25_1
e_1_2_10_26_1
References_xml – volume: 30:5998–6008
  year: 2017
  article-title: Attention is all you need
  publication-title: Adv Neural Inf Process Syst
– year: 2022
– year: 2021
– year: 2020
– year: 2023
– volume: 36
  start-page: 1325
  issue: 7
  year: 2013
  end-page: 1339
  article-title: Human3.6m: large scale datasets and predictive methods for 3d human sensing in natural environments
  publication-title: IEEE Trans Pattern Anal Mach Intell
– year: 2017
– year: 2016
– year: 2018
– volume: 34
  start-page: 1
  issue: 6
  year: 2015
  end-page: 16
  article-title: SMPL: a skinned multi‐person linear model
  publication-title: ACM Trans Graph
– year: 2019
– year: 2014
– volume: 34
  start-page: 13153
  year: 2021
  end-page: 13164
  article-title: Direct multi‐view multi‐person 3d pose estimation
  publication-title: Adv Neural Inf Process Syst
– ident: e_1_2_10_21_1
  doi: 10.1109/TPAMI.2013.248
– ident: e_1_2_10_5_1
– ident: e_1_2_10_30_1
– ident: e_1_2_10_16_1
  doi: 10.1007/978-3-319-10602-1_48
– ident: e_1_2_10_10_1
– ident: e_1_2_10_14_1
  doi: 10.1007/978-3-319-46493-0_38
– ident: e_1_2_10_28_1
– ident: e_1_2_10_24_1
  doi: 10.1109/3DV.2017.00064
– ident: e_1_2_10_2_1
  doi: 10.1007/978-3-319-46454-1_34
– ident: e_1_2_10_32_1
– ident: e_1_2_10_33_1
  doi: 10.1109/3DV.2017.00055
– volume: 34
  start-page: 13153
  year: 2021
  ident: e_1_2_10_12_1
  article-title: Direct multi‐view multi‐person 3d pose estimation
  publication-title: Adv Neural Inf Process Syst
– ident: e_1_2_10_13_1
  doi: 10.1145/2816795.2818013
– ident: e_1_2_10_27_1
– volume: 30
  year: 2017
  ident: e_1_2_10_19_1
  article-title: Attention is all you need
  publication-title: Adv Neural Inf Process Syst
– ident: e_1_2_10_8_1
– ident: e_1_2_10_23_1
– ident: e_1_2_10_20_1
– ident: e_1_2_10_29_1
– ident: e_1_2_10_34_1
– ident: e_1_2_10_9_1
– ident: e_1_2_10_18_1
– ident: e_1_2_10_31_1
– ident: e_1_2_10_26_1
  doi: 10.1109/3DV53792.2021.00015
– ident: e_1_2_10_15_1
– ident: e_1_2_10_4_1
– ident: e_1_2_10_7_1
– ident: e_1_2_10_25_1
– ident: e_1_2_10_6_1
– ident: e_1_2_10_17_1
– ident: e_1_2_10_22_1
– ident: e_1_2_10_3_1
– ident: e_1_2_10_11_1
SSID ssj0026210
Score 2.3810735
Snippet Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily...
SourceID proquest
crossref
wiley
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
SubjectTerms 3D human mesh recovery
body modeling
computer vision
deep learning
Global optimization
Human body
Occlusion
Parameter estimation
Title HIDE: Hierarchical iterative decoding enhancement for multi‐view 3D human parameter regression
URI https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcav.2266
https://www.proquest.com/docview/3071608051
Volume 35
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS-RAEG5EL3pQd1UcV6UXRE8Zk05XZsbb4Cizoh58IXiI_YqKGhdnxoMnf4K_cX_JVqWT8YGCeAqEbpL0o-qrytdfMbbiwlAlNlIBGGgFEpqtQKNjC-KWAZmELaeK1MDeftI9ljuncFqyKuksjNeHGCbcaGcU9po2uNK99RfRUKMe6ogdSG2bqFqEhw6GylEiEV6IAB8YUJRQ6c6GYr3q-NYTvcDL1yC18DLbU-ysej9PLrmuD_q6bh7fSTd-7wOm2WQJPnnbr5YfbMTlP9nEyVVv4O_2Zth5909na4N3r-hgclEn5YZ75WU0i9xisErOjrv8kpYLpRY5wl5e8BL_PT3TjwYed3hR-o-Trvgt8W34vbvwhNt8lh1vbx1tdoOyCkNgEAokQZw1MyOTDLJIaB2BCy1GQWgYdLMZhTbOHIROOAuRy4RUCg0mGCMyANsAIW08x0bzu9zNMy4gFsqquIEYUmpw2mA0JkHYRiRV0rA1tlbNSGpKiXKqlHGTenFlkeKYpTRmNfZ72PKvl-X4oM1iNalpuTF7KZq0KEGUDFGNrRaz82n_dLN9QteFrzb8xcYFQh5Ph1xko_37gVtCyNLXy2ys3dnbPVwuFul_U9bqSw
linkProvider Wiley-Blackwell
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NTtwwEB5ROBQOBVoQ21IwEoJTdhPH492lJ8SCwu8BAeJQKTi2AwgIFbvbQ088As_IkzCON8uPqFT1FCkaK4ntmflmMv4GYNmGoZImUgFqbAcCW-0gI8cWxG2NQoZtq8rUwP6BTI7FzimejsCP6iyM54cYJtycZpT22im4S0g3nllDtfpdJ_AgP8CYa-jtiPM7h0PuKC65pyKgRwYuTqiYZ0PeqEa-9kXPAPMlTC39zNYk_Kze0JeXXNX7vayu_7whb_zPT5iCTwP8ydb9hpmGEVt8homTy27f3-1-gbNku7O5xpJLdza5bJVyzTz5MllGZihedf6O2eLC7RiXXWSEfFlZmvh4_-D-NbC4w8ruf8xRi9-4kht2Z899zW0xA8dbm0cbSTBoxBBoQgMyiPNWroXMMY94lkVoQ0OBENmGrNWKQhPnFkPLrcHI5lwoRTYTteY5omkiFyaehdHitrBzwDjGXBkVNwlGigxtpikgE8hNMxJKNk0NVqslSfWApdw1y7hOPb8yT2nOUjdnNVgaSv7yzBzvyMxXq5oOdLObklWLJAFljGqwUi7PX8enG-sn7vr1XwUX4WNytL-X7m0f7H6DcU4IyFdHzsNo765vvxOC6WUL5U59Ao7F7NM
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bT9swFD4aTJrgYVcQZd3wpGl7Sps4Pm67t4pSlV3QNEFViYfg2A5UlFDRdg972k_Yb9wv2XGclA2BNPEUKbKT2D6X7zjH3wF4a8NQSROpADV2AoHtTpCSYwvijkYhw45VxdbAlwM5OBIfRzgqsyrdWRjPD7HccHOaUdhrp-BTkzWvSUO1-t4g7CBX4KF7qCvb0Pu2pI7iknsmAnpj4MKEing25M2q57-u6Bpf_o1SCzfTfwLH1Qf67JLzxmKeNvSPG9yN9xvBU3hcok_W9eLyDB7Y_DmsD8ezhb87ewEng_3e3gc2GLuTyUWhlAnz1MtkF5mhaNV5O2bzMycvbm-REe5lRWLi75-_3J8GFvdYUfuPOWLxC5dww67sqc-4zTfgqL93uDsIyjIMgSYsIIM4a2dayAyziKdphDY0FAaRZUjb7Sg0cWYxtNwajGzGhVJkMVFrniGaFnJh4k1YzS9zuwWMY8yVUXGLQKRI0aaawjGB3LQioWTL1OB9tSKJLjnKXamMSeLZlXlCc5a4OavBm2XLqefluKVNvVrUpNTMWUI2LZIEkzGqwbtide7sn-x2h-66_b8Nd-DR114_-bx_8OklrHGCPz41sg6r86uFfUXwZZ6-LuT0D-Ih64I
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=HIDE%3A+Hierarchical+iterative+decoding+enhancement+for+multi%E2%80%90view+3D+human+parameter+regression&rft.jtitle=Computer+animation+and+virtual+worlds&rft.au=Lin%2C+Weitao&rft.au=Zhang%2C+Jiguang&rft.au=Meng%2C+Weiliang&rft.au=Liu%2C+Xianglong&rft.date=2024-05-01&rft.issn=1546-4261&rft.eissn=1546-427X&rft.volume=35&rft.issue=3&rft_id=info:doi/10.1002%2Fcav.2266&rft.externalDBID=n%2Fa&rft.externalDocID=10_1002_cav_2266
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1546-4261&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1546-4261&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1546-4261&client=summon