HIDE: Hierarchical iterative decoding enhancement for multi‐view 3D human parameter regression
Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily trainable single‐view networks and the occlusion‐resistant capabilities of multi‐view images. The prevalent presence of object occlusion and self...
Saved in:
Published in | Computer animation and virtual worlds Vol. 35; no. 3 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Chichester
Wiley Subscription Services, Inc
01.05.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily trainable single‐view networks and the occlusion‐resistant capabilities of multi‐view images. The prevalent presence of object occlusion and self‐occlusion in real‐world scenarios leads to issues of robustness and accuracy in predicting human body parameters. Additionally, many methods overlook the spatial connectivity of human joints in the global estimation of model pose parameters, resulting in cumulative errors in continuous joint parameters.To address these challenges, we propose a flexible and efficient iterative decoding strategy. By extending from single‐view images to multi‐view video inputs, we achieve local‐to‐global optimization. We utilize attention mechanisms to capture the rotational dependencies between any node in the human body and all its ancestor nodes, thereby enhancing pose decoding capability. We employ a parameter‐level iterative fusion of multi‐view image data to achieve flexible integration of global pose information, rapidly obtaining appropriate projection features from different viewpoints, ultimately resulting in precise parameter estimation. Through experiments, we validate the effectiveness of the HIDE method on the Human3.6M and 3DPW datasets, demonstrating significantly improved visualization results compared to previous methods. |
---|---|
AbstractList | Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily trainable single‐view networks and the occlusion‐resistant capabilities of multi‐view images. The prevalent presence of object occlusion and self‐occlusion in real‐world scenarios leads to issues of robustness and accuracy in predicting human body parameters. Additionally, many methods overlook the spatial connectivity of human joints in the global estimation of model pose parameters, resulting in cumulative errors in continuous joint parameters.To address these challenges, we propose a flexible and efficient iterative decoding strategy. By extending from single‐view images to multi‐view video inputs, we achieve local‐to‐global optimization. We utilize attention mechanisms to capture the rotational dependencies between any node in the human body and all its ancestor nodes, thereby enhancing pose decoding capability. We employ a parameter‐level iterative fusion of multi‐view image data to achieve flexible integration of global pose information, rapidly obtaining appropriate projection features from different viewpoints, ultimately resulting in precise parameter estimation. Through experiments, we validate the effectiveness of the HIDE method on the Human3.6M and 3DPW datasets, demonstrating significantly improved visualization results compared to previous methods. |
Author | Zhang, Xiaopeng Lin, Weitao Zhang, Jiguang Liu, Xianglong Meng, Weiliang |
Author_xml | – sequence: 1 givenname: Weitao orcidid: 0000-0003-1177-9809 surname: Lin fullname: Lin, Weitao organization: Institute of Automation, Chinese Academy of Sciences – sequence: 2 givenname: Jiguang orcidid: 0000-0002-8212-1361 surname: Zhang fullname: Zhang, Jiguang email: jiguang.zhang@ia.ac.cn organization: Institute of Automation, Chinese Academy of Sciences – sequence: 3 givenname: Weiliang orcidid: 0000-0002-3221-4981 surname: Meng fullname: Meng, Weiliang email: weiliang.meng@ia.ac.cn organization: Institute of Automation, Chinese Academy of Sciences – sequence: 4 givenname: Xianglong orcidid: 0009-0003-6962-8322 surname: Liu fullname: Liu, Xianglong organization: Institute of Automation, Chinese Academy of Sciences – sequence: 5 givenname: Xiaopeng orcidid: 0000-0002-0092-6474 surname: Zhang fullname: Zhang, Xiaopeng organization: Institute of Automation, Chinese Academy of Sciences |
BookMark | eNp1kMFOAjEQhhuDiYAmPkITL14W2-62u3ojgEJC4kWNt7W0UyjZ7WK7QLj5CD6jT-IixoPR08wk3_9P8nVQy1UOEDqnpEcJYVdKbnqMCXGE2pQnIkpY-tz62QU9QZ0Qlg0pGCVt9DKeDEc3eGzBS68WVskC27o5arsBrEFV2ro5BreQTkEJrsam8rhcF7X9eHvfWNjieIgX61I6vJJeltCksYe5hxBs5U7RsZFFgLPv2UWPt6OHwTia3t9NBv1ppNh1LKLYZEYlwnBD2WxGORCdCZbGZJZllOjYACfAQHMKhiVSSq25UsxwrlPOEh130cWhd-Wr1zWEOl9Wa--al3lMUipIRjhtqMsDpXwVggeTr7wtpd_llOR7f3njL9_7a9DeL1TZutFSudpLW_wViA6BrS1g929xPug_ffGfxpWEgA |
CitedBy_id | crossref_primary_10_1007_s11227_025_07012_4 crossref_primary_10_1007_s00371_025_03815_x |
Cites_doi | 10.1109/TPAMI.2013.248 10.1007/978-3-319-10602-1_48 10.1007/978-3-319-46493-0_38 10.1109/3DV.2017.00064 10.1007/978-3-319-46454-1_34 10.1109/3DV.2017.00055 10.1145/2816795.2818013 10.1109/3DV53792.2021.00015 |
ContentType | Journal Article |
Copyright | 2024 John Wiley & Sons Ltd. 2024 John Wiley & Sons, Ltd. |
Copyright_xml | – notice: 2024 John Wiley & Sons Ltd. – notice: 2024 John Wiley & Sons, Ltd. |
DBID | AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
DOI | 10.1002/cav.2266 |
DatabaseName | CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
DatabaseTitleList | CrossRef Computer and Information Systems Abstracts |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Visual Arts |
EISSN | 1546-427X |
EndPage | n/a |
ExternalDocumentID | 10_1002_cav_2266 CAV2266 |
Genre | article |
GrantInformation_xml | – fundername: Beihang University funderid: VRLAB2023B01 – fundername: Beijing Natural Science Foundation funderid: L231013 – fundername: Wenzhou Business School 2024 Talent launch program funderid: RC202401 – fundername: National Natural Science Foundation of China funderid: 52175493; 62162044; 62171321; 62365014; 62376271; U21A20515 |
GroupedDBID | .3N .4S .DC .GA .Y3 05W 0R~ 10A 1L6 1OB 1OC 29F 31~ 33P 3SF 3WU 4.4 50Y 50Z 51W 51X 52M 52N 52O 52P 52S 52T 52U 52W 52X 5GY 5VS 66C 6J9 702 7PT 8-0 8-1 8-3 8-4 8-5 930 A03 AAESR AAEVG AAHQN AAMMB AAMNL AANHP AANLZ AAONW AASGY AAXRX AAYCA AAZKR ABCQN ABCUV ABEML ABIJN ABPVW ACAHQ ACBWZ ACCZN ACGFS ACPOU ACRPL ACSCC ACXBN ACXQS ACYXJ ADBBV ADEOM ADIZJ ADKYN ADMGS ADMLS ADNMO ADOZA ADXAS ADZMN AEFGJ AEIGN AEIMD AENEX AEUYR AFBPY AFFPM AFGKR AFWVQ AFZJQ AGHNM AGQPQ AGXDD AGYGG AHBTC AIDQK AIDYY AITYG AIURR AJXKR ALMA_UNASSIGNED_HOLDINGS ALUQN ALVPJ AMBMR AMYDB ARCSS ASPBG ATUGU AUFTA AVWKF AZBYB AZFZN AZVAB BAFTC BDRZF BFHJK BHBCM BMNLL BROTX BRXPI BY8 CS3 D-E D-F DCZOG DPXWK DR2 DRFUL DRSTM DU5 EBS EDO EJD F00 F01 F04 F5P FEDTE G-S G.N GNP GODZA HF~ HGLYW HHY HVGLF HZ~ I-F ITG ITH IX1 J0M JPC KQQ LATKE LAW LC2 LC3 LEEKS LH4 LITHE LOXES LP6 LP7 LUTES LW6 LYRES MEWTI MK4 MRFUL MRSTM MSFUL MSSTM MXFUL MXSTM N9A NF~ O66 O9- OIG P2W P4D PQQKQ Q.N Q11 QB0 QRW R.K ROL RX1 RYL SUPJJ TN5 TUS UB1 V2E V8K W8V W99 WBKPD WIH WIK WQJ WXSBR WYISQ WZISG XG1 XV2 ~IA ~WT AAHHS AAYXX ACCFJ ADZOD AEEZP AEQDE AIWBW AJBDE CITATION 7SC 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-c2936-3f8fc46f5f12bb15e0d862730b8810d3fe50e2ed51ef24aaadd5cc2f55d7524d3 |
IEDL.DBID | DR2 |
ISSN | 1546-4261 |
IngestDate | Sat Jul 26 03:40:53 EDT 2025 Thu Apr 24 22:59:44 EDT 2025 Tue Jul 01 02:42:24 EDT 2025 Wed Aug 20 07:26:33 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 3 |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c2936-3f8fc46f5f12bb15e0d862730b8810d3fe50e2ed51ef24aaadd5cc2f55d7524d3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ORCID | 0000-0003-1177-9809 0000-0002-0092-6474 0000-0002-8212-1361 0000-0002-3221-4981 0009-0003-6962-8322 |
PQID | 3071608051 |
PQPubID | 2034909 |
PageCount | 13 |
ParticipantIDs | proquest_journals_3071608051 crossref_primary_10_1002_cav_2266 crossref_citationtrail_10_1002_cav_2266 wiley_primary_10_1002_cav_2266_CAV2266 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | May/June 2024 2024-05-00 20240501 |
PublicationDateYYYYMMDD | 2024-05-01 |
PublicationDate_xml | – month: 05 year: 2024 text: May/June 2024 |
PublicationDecade | 2020 |
PublicationPlace | Chichester |
PublicationPlace_xml | – name: Chichester |
PublicationTitle | Computer animation and virtual worlds |
PublicationYear | 2024 |
Publisher | Wiley Subscription Services, Inc |
Publisher_xml | – name: Wiley Subscription Services, Inc |
References | 2015; 34 2013; 36 2021; 34 2023 2022 2021 2020 2019 2018 2017 2016 2014 2017; 30:5998–6008 e_1_2_10_23_1 e_1_2_10_24_1 e_1_2_10_21_1 e_1_2_10_22_1 e_1_2_10_20_1 Vaswani A (e_1_2_10_19_1) 2017; 30 Zhang J (e_1_2_10_12_1) 2021; 34 e_1_2_10_2_1 e_1_2_10_4_1 e_1_2_10_18_1 e_1_2_10_3_1 e_1_2_10_6_1 e_1_2_10_16_1 e_1_2_10_5_1 e_1_2_10_17_1 e_1_2_10_8_1 e_1_2_10_14_1 e_1_2_10_7_1 e_1_2_10_15_1 e_1_2_10_9_1 e_1_2_10_13_1 e_1_2_10_34_1 e_1_2_10_10_1 e_1_2_10_33_1 e_1_2_10_11_1 e_1_2_10_32_1 e_1_2_10_31_1 e_1_2_10_30_1 e_1_2_10_29_1 e_1_2_10_27_1 e_1_2_10_28_1 e_1_2_10_25_1 e_1_2_10_26_1 |
References_xml | – volume: 30:5998–6008 year: 2017 article-title: Attention is all you need publication-title: Adv Neural Inf Process Syst – year: 2022 – year: 2021 – year: 2020 – year: 2023 – volume: 36 start-page: 1325 issue: 7 year: 2013 end-page: 1339 article-title: Human3.6m: large scale datasets and predictive methods for 3d human sensing in natural environments publication-title: IEEE Trans Pattern Anal Mach Intell – year: 2017 – year: 2016 – year: 2018 – volume: 34 start-page: 1 issue: 6 year: 2015 end-page: 16 article-title: SMPL: a skinned multi‐person linear model publication-title: ACM Trans Graph – year: 2019 – year: 2014 – volume: 34 start-page: 13153 year: 2021 end-page: 13164 article-title: Direct multi‐view multi‐person 3d pose estimation publication-title: Adv Neural Inf Process Syst – ident: e_1_2_10_21_1 doi: 10.1109/TPAMI.2013.248 – ident: e_1_2_10_5_1 – ident: e_1_2_10_30_1 – ident: e_1_2_10_16_1 doi: 10.1007/978-3-319-10602-1_48 – ident: e_1_2_10_10_1 – ident: e_1_2_10_14_1 doi: 10.1007/978-3-319-46493-0_38 – ident: e_1_2_10_28_1 – ident: e_1_2_10_24_1 doi: 10.1109/3DV.2017.00064 – ident: e_1_2_10_2_1 doi: 10.1007/978-3-319-46454-1_34 – ident: e_1_2_10_32_1 – ident: e_1_2_10_33_1 doi: 10.1109/3DV.2017.00055 – volume: 34 start-page: 13153 year: 2021 ident: e_1_2_10_12_1 article-title: Direct multi‐view multi‐person 3d pose estimation publication-title: Adv Neural Inf Process Syst – ident: e_1_2_10_13_1 doi: 10.1145/2816795.2818013 – ident: e_1_2_10_27_1 – volume: 30 year: 2017 ident: e_1_2_10_19_1 article-title: Attention is all you need publication-title: Adv Neural Inf Process Syst – ident: e_1_2_10_8_1 – ident: e_1_2_10_23_1 – ident: e_1_2_10_20_1 – ident: e_1_2_10_29_1 – ident: e_1_2_10_34_1 – ident: e_1_2_10_9_1 – ident: e_1_2_10_18_1 – ident: e_1_2_10_31_1 – ident: e_1_2_10_26_1 doi: 10.1109/3DV53792.2021.00015 – ident: e_1_2_10_15_1 – ident: e_1_2_10_4_1 – ident: e_1_2_10_7_1 – ident: e_1_2_10_25_1 – ident: e_1_2_10_6_1 – ident: e_1_2_10_17_1 – ident: e_1_2_10_22_1 – ident: e_1_2_10_3_1 – ident: e_1_2_10_11_1 |
SSID | ssj0026210 |
Score | 2.3810735 |
Snippet | Parametric human modeling are limited to either single‐view frameworks or simple multi‐view frameworks, failing to fully leverage the advantages of easily... |
SourceID | proquest crossref wiley |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
SubjectTerms | 3D human mesh recovery body modeling computer vision deep learning Global optimization Human body Occlusion Parameter estimation |
Title | HIDE: Hierarchical iterative decoding enhancement for multi‐view 3D human parameter regression |
URI | https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fcav.2266 https://www.proquest.com/docview/3071608051 |
Volume | 35 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS-RAEG5EL3pQd1UcV6UXRE8Zk05XZsbb4Cizoh58IXiI_YqKGhdnxoMnf4K_cX_JVqWT8YGCeAqEbpL0o-qrytdfMbbiwlAlNlIBGGgFEpqtQKNjC-KWAZmELaeK1MDeftI9ljuncFqyKuksjNeHGCbcaGcU9po2uNK99RfRUKMe6ogdSG2bqFqEhw6GylEiEV6IAB8YUJRQ6c6GYr3q-NYTvcDL1yC18DLbU-ysej9PLrmuD_q6bh7fSTd-7wOm2WQJPnnbr5YfbMTlP9nEyVVv4O_2Zth5909na4N3r-hgclEn5YZ75WU0i9xisErOjrv8kpYLpRY5wl5e8BL_PT3TjwYed3hR-o-Trvgt8W34vbvwhNt8lh1vbx1tdoOyCkNgEAokQZw1MyOTDLJIaB2BCy1GQWgYdLMZhTbOHIROOAuRy4RUCg0mGCMyANsAIW08x0bzu9zNMy4gFsqquIEYUmpw2mA0JkHYRiRV0rA1tlbNSGpKiXKqlHGTenFlkeKYpTRmNfZ72PKvl-X4oM1iNalpuTF7KZq0KEGUDFGNrRaz82n_dLN9QteFrzb8xcYFQh5Ph1xko_37gVtCyNLXy2ys3dnbPVwuFul_U9bqSw |
linkProvider | Wiley-Blackwell |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NTtwwEB5ROBQOBVoQ21IwEoJTdhPH492lJ8SCwu8BAeJQKTi2AwgIFbvbQ088As_IkzCON8uPqFT1FCkaK4ntmflmMv4GYNmGoZImUgFqbAcCW-0gI8cWxG2NQoZtq8rUwP6BTI7FzimejsCP6iyM54cYJtycZpT22im4S0g3nllDtfpdJ_AgP8CYa-jtiPM7h0PuKC65pyKgRwYuTqiYZ0PeqEa-9kXPAPMlTC39zNYk_Kze0JeXXNX7vayu_7whb_zPT5iCTwP8ydb9hpmGEVt8homTy27f3-1-gbNku7O5xpJLdza5bJVyzTz5MllGZihedf6O2eLC7RiXXWSEfFlZmvh4_-D-NbC4w8ruf8xRi9-4kht2Z899zW0xA8dbm0cbSTBoxBBoQgMyiPNWroXMMY94lkVoQ0OBENmGrNWKQhPnFkPLrcHI5lwoRTYTteY5omkiFyaehdHitrBzwDjGXBkVNwlGigxtpikgE8hNMxJKNk0NVqslSfWApdw1y7hOPb8yT2nOUjdnNVgaSv7yzBzvyMxXq5oOdLObklWLJAFljGqwUi7PX8enG-sn7vr1XwUX4WNytL-X7m0f7H6DcU4IyFdHzsNo765vvxOC6WUL5U59Ao7F7NM |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bT9swFD4aTJrgYVcQZd3wpGl7Sps4Pm67t4pSlV3QNEFViYfg2A5UlFDRdg972k_Yb9wv2XGclA2BNPEUKbKT2D6X7zjH3wF4a8NQSROpADV2AoHtTpCSYwvijkYhw45VxdbAlwM5OBIfRzgqsyrdWRjPD7HccHOaUdhrp-BTkzWvSUO1-t4g7CBX4KF7qCvb0Pu2pI7iknsmAnpj4MKEing25M2q57-u6Bpf_o1SCzfTfwLH1Qf67JLzxmKeNvSPG9yN9xvBU3hcok_W9eLyDB7Y_DmsD8ezhb87ewEng_3e3gc2GLuTyUWhlAnz1MtkF5mhaNV5O2bzMycvbm-REe5lRWLi75-_3J8GFvdYUfuPOWLxC5dww67sqc-4zTfgqL93uDsIyjIMgSYsIIM4a2dayAyziKdphDY0FAaRZUjb7Sg0cWYxtNwajGzGhVJkMVFrniGaFnJh4k1YzS9zuwWMY8yVUXGLQKRI0aaawjGB3LQioWTL1OB9tSKJLjnKXamMSeLZlXlCc5a4OavBm2XLqefluKVNvVrUpNTMWUI2LZIEkzGqwbtide7sn-x2h-66_b8Nd-DR114_-bx_8OklrHGCPz41sg6r86uFfUXwZZ6-LuT0D-Ih64I |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=HIDE%3A+Hierarchical+iterative+decoding+enhancement+for+multi%E2%80%90view+3D+human+parameter+regression&rft.jtitle=Computer+animation+and+virtual+worlds&rft.au=Lin%2C+Weitao&rft.au=Zhang%2C+Jiguang&rft.au=Meng%2C+Weiliang&rft.au=Liu%2C+Xianglong&rft.date=2024-05-01&rft.issn=1546-4261&rft.eissn=1546-427X&rft.volume=35&rft.issue=3&rft_id=info:doi/10.1002%2Fcav.2266&rft.externalDBID=n%2Fa&rft.externalDocID=10_1002_cav_2266 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1546-4261&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1546-4261&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1546-4261&client=summon |