Discovery of Collocation Patterns: from Visual Words to Visual Phrases

A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization....

Full description

Saved in:
Bibliographic Details
Published in2007 IEEE Conference on Computer Vision and Pattern Recognition pp. 1 - 8
Main Authors Junsong Yuan, Ying Wu, Ming Yang
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2007
Subjects
Online AccessGet full text
ISBN9781424411795
1424411793
ISSN1063-6919
1063-6919
DOI10.1109/CVPR.2007.383222

Cover

Abstract A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
AbstractList A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
Author Ying Wu
Junsong Yuan
Ming Yang
Author_xml – sequence: 1
  surname: Junsong Yuan
  fullname: Junsong Yuan
  organization: Northwestern Univ., Evanston
– sequence: 2
  surname: Ying Wu
  fullname: Ying Wu
  organization: Northwestern Univ., Evanston
– sequence: 3
  surname: Ming Yang
  fullname: Ming Yang
  organization: Northwestern Univ., Evanston
BookMark eNpNjFFLwzAUhaNOcJt9F3zJH2jNTdIm8U2qU2FgEZ2PI2lvsdI1klRh_96CEzwvH4ePcxZkNvgBCbkAlgEwc1VuqueMM6YyoQXn_IgsQHIpATRTx2QOrBBpYcCckMQofXDK5LN_7owkMX6wKXqa5XpOVrddrP03hj31LS193_vajp0faGXHEcMQr2kb_I5uuvhle_rmQxPp6P969R5sxHhOTlvbR0wOXJLX1d1L-ZCun-4fy5t12nEJY9qYgqOwDdTOooBcF0wwJawoeMPBuNohKO1yqYUUlpnaNS2iAy6krGHCklz-_naIuP0M3c6G_VZyxbhU4gfoAlF7
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/CVPR.2007.383222
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore Digital Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Computer Science
EISBN 1424411807
9781424411801
EISSN 1063-6919
EndPage 8
ExternalDocumentID 4270247
Genre orig-research
GroupedDBID 23M
29F
29O
6IE
6IH
6IK
ABDPE
ACGFS
ALMA_UNASSIGNED_HOLDINGS
CBEJK
IPLJI
M43
RIE
RIO
RNS
ID FETCH-LOGICAL-i241t-d962e3ad1cbae3158603073a362d219bcbe178b548343a09cbdfeeb12344c1123
IEDL.DBID RIE
ISBN 9781424411795
1424411793
ISSN 1063-6919
IngestDate Wed Aug 27 01:48:26 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i241t-d962e3ad1cbae3158603073a362d219bcbe178b548343a09cbdfeeb12344c1123
PageCount 8
ParticipantIDs ieee_primary_4270247
PublicationCentury 2000
PublicationDate 2007-06
PublicationDateYYYYMMDD 2007-06-01
PublicationDate_xml – month: 06
  year: 2007
  text: 2007-06
PublicationDecade 2000
PublicationTitle 2007 IEEE Conference on Computer Vision and Pattern Recognition
PublicationTitleAbbrev CVPR
PublicationYear 2007
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000818058
ssj0023720
ssj0003211698
Score 2.1399214
Snippet A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Computer vision
Data mining
Impedance
Information retrieval
Itemsets
Object recognition
Spatial resolution
Text recognition
Uncertainty
Vector quantization
Title Discovery of Collocation Patterns: from Visual Words to Visual Phrases
URI https://ieeexplore.ieee.org/document/4270247
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELbaTkwFWsRbHhhJGsd5mbVQVUhFEaKlW-VXRAVKUJMM8OuxnTgIxMCWuymxLr7n9x0AVyzkvpScObFAKkGhPnYoTzxHkowmRAoaG8abxUM0Xwb363DdA9cdFkZKaYbPpKsfTS9fFLzWpbJJoMFTQdwHfWVmDVarq6doajbb4dMyVplNRLqOgq-3sZjOZ4SdiCBiQV6aEg1b7qdWDm0_0yOT6Sp9bJgOsTZ-_8cWFuOEZkOwsK_fzJ68unXFXP75i9nxv9-3D8bfcD-Ydo7sAPRkfgiGbXwK27-_VCq7AsLqRmB2uy25ngL9gEUGdRWiaGqAMDXEnXl5AzWCBa62ZU3f4LPKdUtYFVZOX3bKjZZjsJzdPU3nTruawdkql185gkS-xFQgzqjEKEwic1lQ5Q6FugMZZxLFCQt1qRJTj3AmMqncgo-DgKsQDx-BQV7k8hjAMOBJzHgsaKRSGxpQxDjKOFd5D6XMYydgpE9q896wb2zaQzr9W30G9uxEn4fOwaDa1fJChQ0VuzT28gV4MLlX
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8IwGG4QD3pCBeO3PXh0sK3dR72iBBXIYgC5kbbrItFshm0H_fW23YrRePC2t7c2XZ_363leAK6Yx10hOLOC2JEBCnWRRXloW4IkNCQipoFWvBlP_OEMPyy8RQNcb7gwQgjdfCa66lPX8uOMlypV1sOKPIWDLbAtcR97FVtrk1FR4mymxqdsJGMbn2xqCq6ax6Jrnz6yfOIQQ_NSomjIqD_Vtmcqmjbp9efRU6V1iNT1d3_MYdEwNGiBsdlA1X3y2i0L1uWfv7Qd_7vDPdD5JvzBaANl-6Ah0gPQqj1UWP__uVwyQyDMWhsMblc5V32gHzBLoMpDZFUWEEZaujPNb6DisMD5Ki_pG3yW0W4Oi8zY0ctaAmneAbPB3bQ_tOrhDNZKgn5hxcR3BaKxwxkVyPFCXz8XVAJiLF9BxplwgpB5KlmJqE04ixMhgcFFGHPp5KFD0EyzVBwB6GEeBowHMfVlcEMxdRh3Es5l5EMps9kxaKuTWr5X-hvL-pBO_l6-BDvD6Xi0HN1PHk_Brunvs50z0CzWpTiXTkTBLvTd-QJkX7yk
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2007+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=Discovery+of+Collocation+Patterns%3A+from+Visual+Words+to+Visual+Phrases&rft.au=Junsong+Yuan&rft.au=Ying+Wu&rft.au=Ming+Yang&rft.date=2007-06-01&rft.pub=IEEE&rft.isbn=9781424411795&rft.issn=1063-6919&rft.eissn=1063-6919&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FCVPR.2007.383222&rft.externalDocID=4270247
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon