GPS-Net: Graph Property Sensing Network for Scene Graph Generation
Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships. There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed d...
Saved in:
Main Authors | , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
29.03.2020
|
Subjects | |
Online Access | Get full text |
DOI | 10.48550/arxiv.2003.12962 |
Cover
Loading…
Abstract | Scene graph generation (SGG) aims to detect objects in an image along with
their pairwise relationships. There are three key properties of scene graph
that have been underexplored in recent works: namely, the edge direction
information, the difference in priority between nodes, and the long-tailed
distribution of relationships. Accordingly, in this paper, we propose a Graph
Property Sensing Network (GPS-Net) that fully explores these three properties
for SGG. First, we propose a novel message passing module that augments the
node feature with node-specific contextual information and encodes the edge
direction information via a tri-linear model. Second, we introduce a node
priority sensitive loss to reflect the difference in priority between nodes
during training. This is achieved by designing a mapping function that adjusts
the focusing parameter in the focal loss. Third, since the frequency of
relationships is affected by the long-tailed distribution problem, we mitigate
this issue by first softening the distribution and then enabling it to be
adjusted for each subject-object pair according to their visual appearance.
Systematic experiments demonstrate the effectiveness of the proposed
techniques. Moreover, GPS-Net achieves state-of-the-art performance on three
popular databases: VG, OI, and VRD by significant gains under various settings
and metrics. The code and models are available at
\url{https://github.com/taksau/GPS-Net}. |
---|---|
AbstractList | Scene graph generation (SGG) aims to detect objects in an image along with
their pairwise relationships. There are three key properties of scene graph
that have been underexplored in recent works: namely, the edge direction
information, the difference in priority between nodes, and the long-tailed
distribution of relationships. Accordingly, in this paper, we propose a Graph
Property Sensing Network (GPS-Net) that fully explores these three properties
for SGG. First, we propose a novel message passing module that augments the
node feature with node-specific contextual information and encodes the edge
direction information via a tri-linear model. Second, we introduce a node
priority sensitive loss to reflect the difference in priority between nodes
during training. This is achieved by designing a mapping function that adjusts
the focusing parameter in the focal loss. Third, since the frequency of
relationships is affected by the long-tailed distribution problem, we mitigate
this issue by first softening the distribution and then enabling it to be
adjusted for each subject-object pair according to their visual appearance.
Systematic experiments demonstrate the effectiveness of the proposed
techniques. Moreover, GPS-Net achieves state-of-the-art performance on three
popular databases: VG, OI, and VRD by significant gains under various settings
and metrics. The code and models are available at
\url{https://github.com/taksau/GPS-Net}. |
Author | Lin, Xin Tao, Dacheng Zeng, Jinquan Ding, Changxing |
Author_xml | – sequence: 1 givenname: Xin surname: Lin fullname: Lin, Xin – sequence: 2 givenname: Changxing surname: Ding fullname: Ding, Changxing – sequence: 3 givenname: Jinquan surname: Zeng fullname: Zeng, Jinquan – sequence: 4 givenname: Dacheng surname: Tao fullname: Tao, Dacheng |
BackLink | https://doi.org/10.48550/arXiv.2003.12962$$DView paper in arXiv |
BookMark | eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIwMNYzNLI0M-JkcHIPCNb1Sy2xUnAvSizIUAgoyi9ILSqpVAhOzSvOzEtXAMqV5xdlK6TlFykEJ6fmpUIVugOZRYklmfl5PAysaYk5xam8UJqbQd7NNcTZQxdsW3xBUWZuYlFlPMjWeLCtxoRVAAAyPjhC |
ContentType | Journal Article |
Copyright | http://creativecommons.org/licenses/by-sa/4.0 |
Copyright_xml | – notice: http://creativecommons.org/licenses/by-sa/4.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2003.12962 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2003_12962 |
GroupedDBID | AKY GOX |
ID | FETCH-arxiv_primary_2003_129623 |
IEDL.DBID | GOX |
IngestDate | Tue Jul 22 23:04:28 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-arxiv_primary_2003_129623 |
OpenAccessLink | https://arxiv.org/abs/2003.12962 |
ParticipantIDs | arxiv_primary_2003_12962 |
PublicationCentury | 2000 |
PublicationDate | 2020-03-29 |
PublicationDateYYYYMMDD | 2020-03-29 |
PublicationDate_xml | – month: 03 year: 2020 text: 2020-03-29 day: 29 |
PublicationDecade | 2020 |
PublicationYear | 2020 |
Score | 3.4407525 |
SecondaryResourceType | preprint |
Snippet | Scene graph generation (SGG) aims to detect objects in an image along with
their pairwise relationships. There are three key properties of scene graph
that... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Computer Vision and Pattern Recognition |
Title | GPS-Net: Graph Property Sensing Network for Scene Graph Generation |
URI | https://arxiv.org/abs/2003.12962 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMU0zTDZPTjPQNTJMsdQ1SUpO0bVMTAJyTZLNDFJMTZMMwZf2-fqZeYSaeEWYRjAxKMD2wiQWVWSWQc4HTirWB62c0gPWSKBCltnICNS5cvePgExOgo_igqpHqAO2McFCSJWEmyADP7R1p-AIiQ4hBqbUPBEGJ_eAYF2_1BIrBXfQ6dAKAaDx76KSSoVg0OLxvHQFP8hSbAVg-1EhOBlY-EAVQo6EBoWcKIO8m2uIs4cu2Nb4AsgREaDLHY3jwQ4yFmNgAXbkUyUYFFKSgZ0NCwvjRHOjJBNLCzMLYFfFwDTNLBFYZ5ubWZpKMkjgMkUKt5Q0A5cRqA9oALqTTYaBpaSoNFUWWFGWJMmBQwsAb2FrIw |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=GPS-Net%3A+Graph+Property+Sensing+Network+for+Scene+Graph+Generation&rft.au=Lin%2C+Xin&rft.au=Ding%2C+Changxing&rft.au=Zeng%2C+Jinquan&rft.au=Tao%2C+Dacheng&rft.date=2020-03-29&rft_id=info:doi/10.48550%2Farxiv.2003.12962&rft.externalDocID=2003_12962 |