VQ-VAE-2를 활용한 향상된 동적 영상 생성 인공지능

본 연구는 인공지능(AI)을 활용한 콘텐츠 생산을 목표로 하고 있으며, 특히 영상 생성 분야의 급격한 발전에 주목하고 있다. 최근 영상 생성 모델에 대한 수요가 지속적으로 증가함에 따라, 본 연구에서는 VQ-VAE-2 프레임워크를 활용한 새로운 영상 생성 모델을 제안한다. 이 모델은 정지된 이미지를 동적인 영상 콘텐츠로 효과적으로 변환하도록 설계되었으며, AI 기반 영상 합성 분야에서 상당한 진전을 이룰 수 있는 가능성을 제시한다. TGIF 데이터셋을 사용하여 본 연구에서 제안하는 VQ-VAE-2 모델의 변형을 훈련한 결과, 단일...

Full description

Saved in:

Bibliographic Details
Published in	한국정보통신학회논문지 Vol. 28; no. 10; pp. 1168 - 1173
Main Authors	이현준(Hyun-Jun Lee), 김지우(Ji-Woo Kim), 채상미(Sang-Mi Chae)
Format	Journal Article
Language	Korean
Published	한국정보통신학회 01.10.2024
Subjects	전자/정보통신공학 Image Generation 비디오 생성 Video Generation Artificial Intelligence 딥러닝 인공지능 이미지 생성 Deep Learning
Online Access	Get full text
ISSN	2234-4772 2288-4165
DOI	10.6109/jkiice.2024.28.10.1168

Cover

Loading…

Abstract	본 연구는 인공지능(AI)을 활용한 콘텐츠 생산을 목표로 하고 있으며, 특히 영상 생성 분야의 급격한 발전에 주목하고 있다. 최근 영상 생성 모델에 대한 수요가 지속적으로 증가함에 따라, 본 연구에서는 VQ-VAE-2 프레임워크를 활용한 새로운 영상 생성 모델을 제안한다. 이 모델은 정지된 이미지를 동적인 영상 콘텐츠로 효과적으로 변환하도록 설계되었으며, AI 기반 영상 합성 분야에서 상당한 진전을 이룰 수 있는 가능성을 제시한다. TGIF 데이터셋을 사용하여 본 연구에서 제안하는 VQ-VAE-2 모델의 변형을 훈련한 결과, 단일 사진으로부터 고품질의 영상을 생성할 수 있는 모델을 성공적으로 개발하였다. 이 성과는 AI를 활용한 영상 제작의 가능성을 입증할 뿐만 아니라 창의적인 콘텐츠 생성에 대한 새로운 가능성을 열어준다. 더 나아가, VQ-VAE-2 모델의 훈련 과정에서 생성된 픽셀의 분포를 분석하여 자연 영상의 구조를 심도 있게 연구할 계획이다. 이러한 분석을 통해 이미지 합성의 기본 메커니즘에 대한 더 깊은 통찰을 제공하고, 혁신적인 응용을 위한 AI 활용에 대한 이해를 높이고자 한다. This study aims to produce content using AI, specifically focusing on the emerging field of video generation. In response to the rising demand for advanced video generation models, we propose a novel video generation approach utilizing the VQ-VAE-2 framework. This model is designed to efficiently transform still images into dynamic video content, offering significant advancements in the field of AI-driven video synthesis. By training a variation of the VQ-VAE-2 model with the TGIF dataset, we have successfully developed a model capable of generating videos from a single photo. This achievement not only demonstrates the potential of AI in video production but also opens up new possibilities for creative content generation. Furthermore, we plan to analyze the structure of natural images by examining the distribution of pixels generated during the training process of the VQ-VAE-2 model. This analysis will provide deeper insights into the underlying mechanisms of image synthesis, enhancing our understanding of how AI can be harnessed for innovative applications. KCI Citation Count: 0
AbstractList	본 연구는 인공지능(AI)을 활용한 콘텐츠 생산을 목표로 하고 있으며, 특히 영상 생성 분야의 급격한 발전에 주목하고 있다. 최근 영상 생성 모델에 대한 수요가 지속적으로 증가함에 따라, 본 연구에서는 VQ-VAE-2 프레임워크를 활용한 새로운 영상 생성 모델을 제안한다. 이 모델은 정지된 이미지를 동적인 영상 콘텐츠로 효과적으로 변환하도록 설계되었으며, AI 기반 영상 합성 분야에서 상당한 진전을 이룰 수 있는 가능성을 제시한다. TGIF 데이터셋을 사용하여 본 연구에서 제안하는 VQ-VAE-2 모델의 변형을 훈련한 결과, 단일 사진으로부터 고품질의 영상을 생성할 수 있는 모델을 성공적으로 개발하였다. 이 성과는 AI를 활용한 영상 제작의 가능성을 입증할 뿐만 아니라 창의적인 콘텐츠 생성에 대한 새로운 가능성을 열어준다. 더 나아가, VQ-VAE-2 모델의 훈련 과정에서 생성된 픽셀의 분포를 분석하여 자연 영상의 구조를 심도 있게 연구할 계획이다. 이러한 분석을 통해 이미지 합성의 기본 메커니즘에 대한 더 깊은 통찰을 제공하고, 혁신적인 응용을 위한 AI 활용에 대한 이해를 높이고자 한다. This study aims to produce content using AI, specifically focusing on the emerging field of video generation. In response to the rising demand for advanced video generation models, we propose a novel video generation approach utilizing the VQ-VAE-2 framework. This model is designed to efficiently transform still images into dynamic video content, offering significant advancements in the field of AI-driven video synthesis. By training a variation of the VQ-VAE-2 model with the TGIF dataset, we have successfully developed a model capable of generating videos from a single photo. This achievement not only demonstrates the potential of AI in video production but also opens up new possibilities for creative content generation. Furthermore, we plan to analyze the structure of natural images by examining the distribution of pixels generated during the training process of the VQ-VAE-2 model. This analysis will provide deeper insights into the underlying mechanisms of image synthesis, enhancing our understanding of how AI can be harnessed for innovative applications. KCI Citation Count: 0
Author	이현준(Hyun-Jun Lee) 채상미(Sang-Mi Chae) 김지우(Ji-Woo Kim)
Author_xml	– sequence: 1 fullname: 이현준(Hyun-Jun Lee) – sequence: 2 fullname: 김지우(Ji-Woo Kim) – sequence: 3 fullname: 채상미(Sang-Mi Chae)
BackLink	https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART003131876$$DAccess content in National Research Foundation of Korea (NRF)
BookMark	eNotjDtLw1AAhS9SwVr7FySLi5CY-8h9jKFWLRSLWrpe8riRazWVFAc3S51cdFGstOASxFFEcPAXmdv_YNQOh_Nx-DiroJIOUgXAOnQdCl2xddLXOlIOchFxEHfKGULKl0AVIc5tAqlX-WVMbMIYWgH14VCHLqaICYhpFfi9A7vnN21U5F_WfDI1T6_z-6k1f8jNeFTcTa3idmKeR5Z5HJWDZcYzc_1mmdnn9_uHebkqbvI1sJwEp0NVX3QNHO00u409u93ZbTX8tp1SD9seQ4ohFKIkRCEOhFuGKEwpVDRSWCgcK4rChHsi5rFQNGaCxV5AeBxFNME1sPn_mmaJ7EdaDgL918cD2c-kf9htSehS4goMS3ljIV9k-kzFOpDnJQTZpdzvbDchFJ5HMMY__T9tHQ
ContentType	Journal Article
DBID	DBRKI TDB ACYCR
DOI	10.6109/jkiice.2024.28.10.1168
DatabaseName	DBPIA - 디비피아 Nurimedia DBPIA Journals Korean Citation Index
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
DocumentTitleAlternate	From Static to Dynamic: Enhancing Video Generation Using VQ-VAE-2
DocumentTitle_FL	From Static to Dynamic: Enhancing Video Generation Using VQ-VAE-2
EISSN	2288-4165
EndPage	1173
ExternalDocumentID	oai_kci_go_kr_ARTI_10640931 NODE11955433
GroupedDBID	.UV ALMA_UNASSIGNED_HOLDINGS DBRKI TDB ACYCR
ID	FETCH-LOGICAL-n653-572e722b2fb2b3a903a94e3661e6ce39e3de62bf859d8d9e6d797d5a48dcc6f3
ISSN	2234-4772
IngestDate	Sat Nov 02 03:32:40 EDT 2024 Thu Feb 06 13:38:39 EST 2025
IsPeerReviewed	true
IsScholarly	true
Issue	10
Keywords	Image Generation 비디오 생성 Video Generation Artificial Intelligence 딥러닝 인공지능 이미지 생성 Deep Learning
Language	Korean
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-n653-572e722b2fb2b3a903a94e3661e6ce39e3de62bf859d8d9e6d797d5a48dcc6f3
Notes	http://jkiice.org
PageCount	6
ParticipantIDs	nrf_kci_oai_kci_go_kr_ARTI_10640931 nurimedia_primary_NODE11955433
PublicationCentury	2000
PublicationDate	2024-10
PublicationDateYYYYMMDD	2024-10-01
PublicationDate_xml	– month: 10 year: 2024 text: 2024-10
PublicationDecade	2020
PublicationTitle	한국정보통신학회논문지
PublicationYear	2024
Publisher	한국정보통신학회
Publisher_xml	– name: 한국정보통신학회
SSID	ssib036279136 ssib053377456 ssib044738262 ssib015937029 ssib023393675 ssib012146319
Score	2.2666347
Snippet	본 연구는 인공지능(AI)을 활용한 콘텐츠 생산을 목표로 하고 있으며, 특히 영상 생성 분야의 급격한 발전에 주목하고 있다. 최근 영상 생성 모델에 대한 수요가 지속적으로 증가함에 따라, 본 연구에서는 VQ-VAE-2 프레임워크를 활용한 새로운 영상 생성 모델을 제안한다. 이 모델은...
SourceID	nrf nurimedia
SourceType	Open Website Publisher
StartPage	1168
SubjectTerms	전자/정보통신공학
Title	VQ-VAE-2를 활용한 향상된 동적 영상 생성 인공지능
URI	https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE11955433 https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART003131876
Volume	28
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
ispartofPNX	한국정보통신학회논문지, 2024, 28(10), , pp.1168-1173
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Na9RAFA9tPehFFBXrRwnonJasm_nKzDHZTVkLrUhr7S3ka8u6kJXSPehBXOrJi178qLTgpYhHEcGDf5Gb_g--mWSzUQp-4GGz4c2bN-_Ny2R-k0zeM4ybkePELVj6WBFOI4smPWmFEY4tHrIi00Ois5asrvHuPbqyxbbm5t_Udi2NdqNm_PjE70r-xatAA7-qr2T_wrOVUCDAOfgXjuBhOP6RjzfvWpuub2Hke8hlyGs3kN9BUiLZRn4bSRe5UlMYUIoyrhihTBDAkKqebBVlHhLLqiqUuS0oa2gJQnOV7I3yTHb0CUVeydVBnkC-izyCPC3dddQOCiUTVGB1AFypo_kZcjtli0AEfiWBKh7BS1HCU6VlrcIaF4m2Fq5M1ra3tQJlu9NraKqalqcsoZqFKtWw6D4aZdbKKGvorUhyVgnUklr7ygzdk56qtNK37g-HDZ2CWtbb8exSfNWv7rLSCYv1MNu2VvtqX0PVTvmUBdNqv14xLv5X58zu7gDLqEWdIm9RMy1pcIEDQmb16QmL-jBs1SYb2y4yEpXAxbaLpDC_Toq8iCn7YNBXUaqUcU0smnq2nNb_KeD42p2OrwIBMkrIvHEKO47e_7D6xJ_eqG2VDZ7M4gACJCZO7fUuJkSSWpAgAEmOtEmFeyl1iKjFqYQlB6xCdKblqluKr_mV7rdO1hzAYLYDGPJ0NlKJMOBuWgOGG-eMs-WKznSL4XnemBsMLxjudGhOjr6Zx_sH-buPx68OzOPXR_neePLywJy82M_fj8387RgIZr53mD_7ZOaHX79__pJ_eDp5fnTRWF_2N9pdq8xWYmWcEYs5OHUwjnAvwhEJZQt-NCUAf1Mep0SmJEk5jnqCyUQkMuWJI52EhVQkccx75JKxkA2z9LJhyh7nqR22YpzAWj_EIUsTHsZcJYuIBO8tGjfA8GAQ9wMVG179bw-DwU4AK-Dbga1ezUtiLxpLVccED4vINUHdtVd-x3DVODMbCdeMhd2dUXodEPhutKSvhh_tD6T4
linkProvider	ISSN International Centre
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=VQ-VAE-2%EB%A5%BC+%ED%99%9C%EC%9A%A9%ED%95%9C+%ED%96%A5%EC%83%81%EB%90%9C+%EB%8F%99%EC%A0%81+%EC%98%81%EC%83%81+%EC%83%9D%EC%84%B1+%EC%9D%B8%EA%B3%B5%EC%A7%80%EB%8A%A5&rft.jtitle=%ED%95%9C%EA%B5%AD%EC%A0%95%EB%B3%B4%ED%86%B5%EC%8B%A0%ED%95%99%ED%9A%8C%EB%85%BC%EB%AC%B8%EC%A7%80&rft.au=%EC%9D%B4%ED%98%84%EC%A4%80%28Hyun-Jun+Lee%29&rft.au=%EA%B9%80%EC%A7%80%EC%9A%B0%28Ji-Woo+Kim%29&rft.au=%EC%B1%84%EC%83%81%EB%AF%B8%28Sang-Mi+Chae%29&rft.date=2024-10-01&rft.pub=%ED%95%9C%EA%B5%AD%EC%A0%95%EB%B3%B4%ED%86%B5%EC%8B%A0%ED%95%99%ED%9A%8C&rft.issn=2234-4772&rft.eissn=2288-4165&rft.volume=28&rft.issue=10&rft.spage=1168&rft.epage=1173&rft_id=info:doi/10.6109%2Fjkiice.2024.28.10.1168&rft.externalDocID=NODE11955433
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2234-4772&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2234-4772&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2234-4772&client=summon