Mask Guided Gated Convolution for Amodal Content Completion

We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated convolutions to assign more weight to the visible pixels of the occluded instance compared to the background, while ignoring the features of the...

Full description

Saved in:
Bibliographic Details
Main Authors Saleh, Kaziwa, Szénási, Sándor, Vámossy, Zoltán
Format Journal Article
LanguageEnglish
Published 21.07.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated convolutions to assign more weight to the visible pixels of the occluded instance compared to the background, while ignoring the features of the invisible pixels. By drawing more attention from the visible region, our model can predict the invisible patch more effectively than the baseline models, especially in instances with uniform texture. The model is trained on COCOA dataset and two subsets of it in a self-supervised manner. The results demonstrate that our model generates higher quality and more texture-rich outputs compared to baseline models. Code is available at: https://github.com/KaziwaSaleh/mask-guided.
AbstractList We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated convolutions to assign more weight to the visible pixels of the occluded instance compared to the background, while ignoring the features of the invisible pixels. By drawing more attention from the visible region, our model can predict the invisible patch more effectively than the baseline models, especially in instances with uniform texture. The model is trained on COCOA dataset and two subsets of it in a self-supervised manner. The results demonstrate that our model generates higher quality and more texture-rich outputs compared to baseline models. Code is available at: https://github.com/KaziwaSaleh/mask-guided.
Author Saleh, Kaziwa
Szénási, Sándor
Vámossy, Zoltán
Author_xml – sequence: 1
  givenname: Kaziwa
  surname: Saleh
  fullname: Saleh, Kaziwa
– sequence: 2
  givenname: Sándor
  surname: Szénási
  fullname: Szénási, Sándor
– sequence: 3
  givenname: Zoltán
  surname: Vámossy
  fullname: Vámossy, Zoltán
BackLink https://doi.org/10.48550/arXiv.2407.15203$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIxMNczNDUyMOZksPZNLM5WcC_NTElNUXBPLAGSzvl5Zfk5pSWZ-XkKaflFCo65-SmJOSDhktS8EiCdW5CTCpLlYWBNS8wpTuWF0twM8m6uIc4eumBr4guKMnMTiyrjQdbFg60zJqwCAMkfNkI
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by-nc-nd/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by-nc-nd/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2407.15203
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2407_15203
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2407_152033
IEDL.DBID GOX
IngestDate Wed Jul 24 12:11:18 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2407_152033
OpenAccessLink https://arxiv.org/abs/2407.15203
ParticipantIDs arxiv_primary_2407_15203
PublicationCentury 2000
PublicationDate 2024-07-21
PublicationDateYYYYMMDD 2024-07-21
PublicationDate_xml – month: 07
  year: 2024
  text: 2024-07-21
  day: 21
PublicationDecade 2020
PublicationYear 2024
Score 3.8546503
SecondaryResourceType preprint
Snippet We present a model to reconstruct partially visible objects. The model takes a mask as an input, which we call weighted mask. The mask is utilized by gated...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title Mask Guided Gated Convolution for Amodal Content Completion
URI https://arxiv.org/abs/2407.15203
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQSbNMNUlMSzTWTQTW3aArzJJ1E40N0nQNzUzNUy1SDM1SzUGbk339zDxCTbwiTCOYGBRge2ESiyoyyyDnAycV64O6G6AbekDHeTIbGYGWbLn7R0AmJ8FHcUHVI9QB25hgIaRKwk2QgR_aulNwhESHEANTap4Ig7VvYnG2gntpZkpqigJotCpFwTk_rwwa5wrAVqOCY25-ClAf-KiovBIFUCYFHYqdnyfKIO_mGuLsoQu2Lr4AcjZEPMgl8WCXGIsxsAB78KkSDAqmlonJaWlpJuaWFmkmhommlgbGySmGKSZppsamqRYGiZIMErhMkcItJc3AZQSsYUEDjUaGMgwsJUWlqbLAGrIkSQ4cTAAzfGqH
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Mask+Guided+Gated+Convolution+for+Amodal+Content+Completion&rft.au=Saleh%2C+Kaziwa&rft.au=Sz%C3%A9n%C3%A1si%2C+S%C3%A1ndor&rft.au=V%C3%A1mossy%2C+Zolt%C3%A1n&rft.date=2024-07-21&rft_id=info:doi/10.48550%2Farxiv.2407.15203&rft.externalDocID=2407_15203