Gibson Env: Real-World Perception for Embodied Agents

Developing visual perception models for active agents and sensorimotor control in the physical world are cumbersome as existing algorithms are too slow to efficiently learn in real-time and robots are fragile and costly. This has given rise to learning-in-simulation which consequently casts a questi...

Full description

Saved in:
Bibliographic Details
Published in2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition pp. 9068 - 9079
Main Authors Xia, Fei, Zamir, Amir R., He, Zhiyang, Sax, Alexander, Malik, Jitendra, Savarese, Silvio
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2018
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Developing visual perception models for active agents and sensorimotor control in the physical world are cumbersome as existing algorithms are too slow to efficiently learn in real-time and robots are fragile and costly. This has given rise to learning-in-simulation which consequently casts a question on whether the results transfer to real-world. In this paper, we investigate developing real-world perception for active agents, propose Gibson Environment for this purpose, and showcase a set of perceptual tasks learned therein. Gibson is based upon virtualizing real spaces, rather than artificially designed ones, and currently includes over 1400 floor spaces from 572 full buildings. The main characteristics of Gibson are: I. being from the real-world and reflecting its semantic complexity, II. having an internal synthesis mechanism "Goggles" enabling deploying the trained models in real-world without needing domain adaptation, III. embodiment of agents and making them subject to constraints of physics and space.
AbstractList Developing visual perception models for active agents and sensorimotor control in the physical world are cumbersome as existing algorithms are too slow to efficiently learn in real-time and robots are fragile and costly. This has given rise to learning-in-simulation which consequently casts a question on whether the results transfer to real-world. In this paper, we investigate developing real-world perception for active agents, propose Gibson Environment for this purpose, and showcase a set of perceptual tasks learned therein. Gibson is based upon virtualizing real spaces, rather than artificially designed ones, and currently includes over 1400 floor spaces from 572 full buildings. The main characteristics of Gibson are: I. being from the real-world and reflecting its semantic complexity, II. having an internal synthesis mechanism "Goggles" enabling deploying the trained models in real-world without needing domain adaptation, III. embodiment of agents and making them subject to constraints of physics and space.
Author Xia, Fei
Zamir, Amir R.
Malik, Jitendra
Savarese, Silvio
Sax, Alexander
He, Zhiyang
Author_xml – sequence: 1
  givenname: Fei
  surname: Xia
  fullname: Xia, Fei
– sequence: 2
  givenname: Amir R.
  surname: Zamir
  fullname: Zamir, Amir R.
– sequence: 3
  givenname: Zhiyang
  surname: He
  fullname: He, Zhiyang
– sequence: 4
  givenname: Alexander
  surname: Sax
  fullname: Sax, Alexander
– sequence: 5
  givenname: Jitendra
  surname: Malik
  fullname: Malik, Jitendra
– sequence: 6
  givenname: Silvio
  surname: Savarese
  fullname: Savarese, Silvio
BookMark eNotjMtKw0AUQEdRsNasXbjJDyTeeeaOuxJiFQqW4mNZJjM3MpJmShIE_96CchZnceBcs4shDcTYLYeSc7D39ft2VwrgWAJYpc9YZivkWqIxSoA9ZwsORhbGcnvFsmn6AgBhUKLSC6bXsZ3SkDfD90O-I9cXH2nsQ76l0dNxjqfUpTFvDm0KkUK--qRhnm7YZef6ibJ_L9nbY_NaPxWbl_VzvdoUUSg-F7wVKlRGefLKCnnCBCTF0ekgvQPfWU9dQIGyghDAo1Oy9WiwtVp1JJfs7u8biWh_HOPBjT971JUFJeUvwpJG9g
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/CVPR.2018.00945
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Physics
EISBN 9781538664209
1538664208
EISSN 1063-6919
EndPage 9079
ExternalDocumentID 8579043
Genre orig-research
GroupedDBID 6IE
6IH
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
OCL
RIE
RIL
RIO
ID FETCH-LOGICAL-i241t-1b24d764cec49232326d8e418a5d3ca0cf9cefd828370dd0c8a43bc868b954fe3
IEDL.DBID RIE
IngestDate Wed Aug 27 02:52:16 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i241t-1b24d764cec49232326d8e418a5d3ca0cf9cefd828370dd0c8a43bc868b954fe3
PageCount 12
ParticipantIDs ieee_primary_8579043
PublicationCentury 2000
PublicationDate 2018-06
PublicationDateYYYYMMDD 2018-06-01
PublicationDate_xml – month: 06
  year: 2018
  text: 2018-06
PublicationDecade 2010
PublicationTitle 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
PublicationTitleAbbrev CVPR
PublicationYear 2018
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0002683845
ssj0003211698
Score 2.599554
Snippet Developing visual perception models for active agents and sensorimotor control in the physical world are cumbersome as existing algorithms are too slow to...
SourceID ieee
SourceType Publisher
StartPage 9068
SubjectTerms Cameras
Neural networks
Physics
Rendering (computer graphics)
Robot sensing systems
Three-dimensional displays
Visualization
Title Gibson Env: Real-World Perception for Embodied Agents
URI https://ieeexplore.ieee.org/document/8579043
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA3bQPA03Sb-pgePZmubH_3qTcbGECZjONltNMlXEHET13nwrzdJaxXxILkkOYSQEL6X5L33EXIVoUlSgYoyliDlUnOqRMSoAZM7fygIvdv-9F5OFvxuKZYNcl1rYRDRk8-w76r-L99s9M49lQ1AJGnIWZM07cWt1GrV7ymxBAbVD5lrM3uzkSlUbj5RmA6Gj7O543I58mTq5Es_0qn4aDJuk-nXPEoSyXN_V6i-_vhl0fjfiR6Q3rduL5jVEemQNHDdIe0KaAbVMd52yJ7nfeptl4gygXgwWr_fBHMLGqln1wSzmu8SWFQbjF7Uxrgxbp0Sa9sji_HoYTihVSYF-mQjdEEjFXOTSK5RO0M2W6QB5BFkwjCdhTpPNeYGvBWOMaGGjDOlQYJKBc-RHZHWerPGYxIoiQqSTGgZa27yWKEFQCYCQLBIAPCEdN16rF5Ls4xVtRSnf3efkX23IyX36py0ircdXtgoX6hLv72fMq-khQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV3LSgMxFL34QHTloxXfZqHLqfNIMncEF6KVVm0pRcVdbZI7IGIrtlX0W_wV_81kZqwibgWZzUwWgdyEuSfJOecC7ARk4kSQ8qIoJo9LzT0lgsgzaFLnD4V-5rbfaMraJT-9FtcT8DbWwhBRRj6jinvN7vJNX4_cUdkeijjx-Wep6jN6ebYbtMFB_djO5m4YnlQvjmpeUUPAu7W5aegFKuQmllyTdlZk9pEGiQfYFSbSXV-niabUYGYCY4yvscsjpVGiSgRPKbL9TsK0xRkizNVh4xOcUGKExZ2c-47sXkomWPgHBX6yd3TVajv2mKNrJk4w9a2AS5a_Tubh_XPkOW3lrjIaqop-_WEK-V9DswDlL2Uia41z7iJMUG8J5gsozYof1WAJZjJmqx6UQOQl0lm197TP2hYWexl_iLXGjB5mcTur3qu-cX0cOq3ZoAyXfzKYZZjq9Xu0AkxJUhh3hZah5iYNFVmIZwJEQot1kFah5OLfecjtQDpF6Nd-b96G2dpF47xzXm-ercOcWw0502wDpoaPI9q0mGaotrKlxeDmryfsA2VdAvc
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+IEEE%2FCVF+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=Gibson+Env%3A+Real-World+Perception+for+Embodied+Agents&rft.au=Xia%2C+Fei&rft.au=Zamir%2C+Amir+R.&rft.au=He%2C+Zhiyang&rft.au=Sax%2C+Alexander&rft.date=2018-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=9068&rft.epage=9079&rft_id=info:doi/10.1109%2FCVPR.2018.00945&rft.externalDocID=8579043