Mask Guided Gated Convolution for Amodal Content Completion

We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated convolutions to assign more weight to the visible pixels of the occluded instance compared to the background, while ignoring the features of the...

Full description

Saved in:

Bibliographic Details
Main Authors	Saleh, Kaziwa, Szénási, Sándor, Vámossy, Zoltán
Format	Journal Article
Language	English
Published	21.07.2024
Subjects	Computer Science - Computer Vision and Pattern Recognition
Online Access	Get full text

Cover

Loading…

Abstract	We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated convolutions to assign more weight to the visible pixels of the occluded instance compared to the background, while ignoring the features of the invisible pixels. By drawing more attention from the visible region, our model can predict the invisible patch more effectively than the baseline models, especially in instances with uniform texture. The model is trained on COCOA dataset and two subsets of it in a self-supervised manner. The results demonstrate that our model generates higher quality and more texture-rich outputs compared to baseline models. Code is available at: https://github.com/KaziwaSaleh/mask-guided.
AbstractList	We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated convolutions to assign more weight to the visible pixels of the occluded instance compared to the background, while ignoring the features of the invisible pixels. By drawing more attention from the visible region, our model can predict the invisible patch more effectively than the baseline models, especially in instances with uniform texture. The model is trained on COCOA dataset and two subsets of it in a self-supervised manner. The results demonstrate that our model generates higher quality and more texture-rich outputs compared to baseline models. Code is available at: https://github.com/KaziwaSaleh/mask-guided.
Author	Saleh, Kaziwa Szénási, Sándor Vámossy, Zoltán
Author_xml	– sequence: 1 givenname: Kaziwa surname: Saleh fullname: Saleh, Kaziwa – sequence: 2 givenname: Sándor surname: Szénási fullname: Szénási, Sándor – sequence: 3 givenname: Zoltán surname: Vámossy fullname: Vámossy, Zoltán
BackLink	https://doi.org/10.48550/arXiv.2407.15203$$DView paper in arXiv
BookMark	eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIxMNczNDUyMOZksPZNLM5WcC_NTElNUXBPLAGSzvl5Zfk5pSWZ-XkKaflFCo65-SmJOSDhktS8EiCdW5CTCpLlYWBNS8wpTuWF0twM8m6uIc4eumBr4guKMnMTiyrjQdbFg60zJqwCAMkfNkI
ContentType	Journal Article
Copyright	http://creativecommons.org/licenses/by-nc-nd/4.0
Copyright_xml	– notice: http://creativecommons.org/licenses/by-nc-nd/4.0
DBID	AKY GOX
DOI	10.48550/arxiv.2407.15203
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2407_15203
GroupedDBID	AKY GOX
ID	FETCH-arxiv_primary_2407_152033
IEDL.DBID	GOX
IngestDate	Wed Jul 24 12:11:18 EDT 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_2407_152033
OpenAccessLink	https://arxiv.org/abs/2407.15203
ParticipantIDs	arxiv_primary_2407_15203
PublicationCentury	2000
PublicationDate	2024-07-21
PublicationDateYYYYMMDD	2024-07-21
PublicationDate_xml	– month: 07 year: 2024 text: 2024-07-21 day: 21
PublicationDecade	2020
PublicationYear	2024
Score	3.8546503
SecondaryResourceType	preprint
Snippet	We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Computer Vision and Pattern Recognition
Title	Mask Guided Gated Convolution for Amodal Content Completion
URI	https://arxiv.org/abs/2407.15203
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQSbNMNUlMSzTWTQTW3aArzJJ1E40N0nQNzUzNUy1SDM1SzUGbk339zDxCTbwiTCOYGBRge2ESiyoyyyDnAycV64O6G6AbekDHeTIbGYGWbLn7R0AmJ8FHcUHVI9QB25hgIaRKwk2QgR_aulNwhESHEANTap4Ig7VvYnG2gntpZkpqigJotCpFwTk_rwwa5wrAVqOCY25-ClAf-KiovBIFUCYFHYqdnyfKIO_mGuLsoQu2Lr4AcjZEPMgl8WCXGIsxsAB78KkSDAqmlonJaWlpJuaWFmkmhommlgbGySmGKSZppsamqRYGiZIMErhMkcItJc3AZQSsYUEDjUaGMgwsJUWlqbLAGrIkSQ4cTAAzfGqH
link.rule.ids	228,230,783,888
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Mask+Guided+Gated+Convolution+for+Amodal+Content+Completion&rft.au=Saleh%2C+Kaziwa&rft.au=Sz%C3%A9n%C3%A1si%2C+S%C3%A1ndor&rft.au=V%C3%A1mossy%2C+Zolt%C3%A1n&rft.date=2024-07-21&rft_id=info:doi/10.48550%2Farxiv.2407.15203&rft.externalDocID=2407_15203