GPS-Net: Graph Property Sensing Network for Scene Graph Generation

Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed d...

Full description

Saved in:
Bibliographic Details
Main Authors Lin, Xin, Ding, Changxing, Zeng, Jinquan, Tao, Dacheng
Format Journal Article
LanguageEnglish
Published 29.03.2020
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2003.12962

Cover

Loading…
Abstract Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships. Accordingly, in this paper, we propose a Graph Property Sensing Network (GPS-Net) that fully explores these three properties for SGG. First, we propose a novel message passing module that augments the node feature with node-specific contextual information and encodes the edge direction information via a tri-linear model. Second, we introduce a node priority sensitive loss to reflect the difference in priority between nodes during training. This is achieved by designing a mapping function that adjusts the focusing parameter in the focal loss. Third, since the frequency of relationships is affected by the long-tailed distribution problem, we mitigate this issue by first softening the distribution and then enabling it to be adjusted for each subject-object pair according to their visual appearance. Systematic experiments demonstrate the effectiveness of the proposed techniques. Moreover, GPS-Net achieves state-of-the-art performance on three popular databases: VG, OI, and VRD by significant gains under various settings and metrics. The code and models are available at \url{https://github.com/taksau/GPS-Net}.
AbstractList Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships. Accordingly, in this paper, we propose a Graph Property Sensing Network (GPS-Net) that fully explores these three properties for SGG. First, we propose a novel message passing module that augments the node feature with node-specific contextual information and encodes the edge direction information via a tri-linear model. Second, we introduce a node priority sensitive loss to reflect the difference in priority between nodes during training. This is achieved by designing a mapping function that adjusts the focusing parameter in the focal loss. Third, since the frequency of relationships is affected by the long-tailed distribution problem, we mitigate this issue by first softening the distribution and then enabling it to be adjusted for each subject-object pair according to their visual appearance. Systematic experiments demonstrate the effectiveness of the proposed techniques. Moreover, GPS-Net achieves state-of-the-art performance on three popular databases: VG, OI, and VRD by significant gains under various settings and metrics. The code and models are available at \url{https://github.com/taksau/GPS-Net}.
Author Lin, Xin
Tao, Dacheng
Zeng, Jinquan
Ding, Changxing
Author_xml – sequence: 1
  givenname: Xin
  surname: Lin
  fullname: Lin, Xin
– sequence: 2
  givenname: Changxing
  surname: Ding
  fullname: Ding, Changxing
– sequence: 3
  givenname: Jinquan
  surname: Zeng
  fullname: Zeng, Jinquan
– sequence: 4
  givenname: Dacheng
  surname: Tao
  fullname: Tao, Dacheng
BackLink https://doi.org/10.48550/arXiv.2003.12962$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIwMNYzNLI0M-JkcHIPCNb1Sy2xUnAvSizIUAgoyi9ILSqpVAhOzSvOzEtXAMqV5xdlK6TlFykEJ6fmpUIVugOZRYklmfl5PAysaYk5xam8UJqbQd7NNcTZQxdsW3xBUWZuYlFlPMjWeLCtxoRVAAAyPjhC
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by-sa/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by-sa/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2003.12962
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2003_12962
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2003_129623
IEDL.DBID GOX
IngestDate Tue Jul 22 23:04:28 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2003_129623
OpenAccessLink https://arxiv.org/abs/2003.12962
ParticipantIDs arxiv_primary_2003_12962
PublicationCentury 2000
PublicationDate 2020-03-29
PublicationDateYYYYMMDD 2020-03-29
PublicationDate_xml – month: 03
  year: 2020
  text: 2020-03-29
  day: 29
PublicationDecade 2020
PublicationYear 2020
Score 3.4407525
SecondaryResourceType preprint
Snippet Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title GPS-Net: Graph Property Sensing Network for Scene Graph Generation
URI https://arxiv.org/abs/2003.12962
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMU0zTDZPTjPQNTJMsdQ1SUpO0bVMTAJyTZLNDFJMTZMMwZf2-fqZeYSaeEWYRjAxKMD2wiQWVWSWQc4HTirWB62c0gPWSKBCltnICNS5cvePgExOgo_igqpHqAO2McFCSJWEmyADP7R1p-AIiQ4hBqbUPBEGJ_eAYF2_1BIrBXfQ6dAKAaDx76KSSoVg0OLxvHQFP8hSbAVg-1EhOBlY-EAVQo6EBoWcKIO8m2uIs4cu2Nb4AsgREaDLHY3jwQ4yFmNgAXbkUyUYFFKSgZ0NCwvjRHOjJBNLCzMLYFfFwDTNLBFYZ5ubWZpKMkjgMkUKt5Q0A5cRqA9oALqTTYaBpaSoNFUWWFGWJMmBQwsAb2FrIw
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=GPS-Net%3A+Graph+Property+Sensing+Network+for+Scene+Graph+Generation&rft.au=Lin%2C+Xin&rft.au=Ding%2C+Changxing&rft.au=Zeng%2C+Jinquan&rft.au=Tao%2C+Dacheng&rft.date=2020-03-29&rft_id=info:doi/10.48550%2Farxiv.2003.12962&rft.externalDocID=2003_12962