Discovery of Collocation Patterns: from Visual Words to Visual Phrases
A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization....
Saved in:
Published in | 2007 IEEE Conference on Computer Vision and Pattern Recognition pp. 1 - 8 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.06.2007
|
Subjects | |
Online Access | Get full text |
ISBN | 9781424411795 1424411793 |
ISSN | 1063-6919 1063-6919 |
DOI | 10.1109/CVPR.2007.383222 |
Cover
Abstract | A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning. |
---|---|
AbstractList | A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning. |
Author | Ying Wu Junsong Yuan Ming Yang |
Author_xml | – sequence: 1 surname: Junsong Yuan fullname: Junsong Yuan organization: Northwestern Univ., Evanston – sequence: 2 surname: Ying Wu fullname: Ying Wu organization: Northwestern Univ., Evanston – sequence: 3 surname: Ming Yang fullname: Ming Yang organization: Northwestern Univ., Evanston |
BookMark | eNpNjFFLwzAUhaNOcJt9F3zJH2jNTdIm8U2qU2FgEZ2PI2lvsdI1klRh_96CEzwvH4ePcxZkNvgBCbkAlgEwc1VuqueMM6YyoQXn_IgsQHIpATRTx2QOrBBpYcCckMQofXDK5LN_7owkMX6wKXqa5XpOVrddrP03hj31LS193_vajp0faGXHEcMQr2kb_I5uuvhle_rmQxPp6P969R5sxHhOTlvbR0wOXJLX1d1L-ZCun-4fy5t12nEJY9qYgqOwDdTOooBcF0wwJawoeMPBuNohKO1yqYUUlpnaNS2iAy6krGHCklz-_naIuP0M3c6G_VZyxbhU4gfoAlF7 |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/CVPR.2007.383222 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore Digital Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences Computer Science |
EISBN | 1424411807 9781424411801 |
EISSN | 1063-6919 |
EndPage | 8 |
ExternalDocumentID | 4270247 |
Genre | orig-research |
GroupedDBID | 23M 29F 29O 6IE 6IH 6IK ABDPE ACGFS ALMA_UNASSIGNED_HOLDINGS CBEJK IPLJI M43 RIE RIO RNS |
ID | FETCH-LOGICAL-i241t-d962e3ad1cbae3158603073a362d219bcbe178b548343a09cbdfeeb12344c1123 |
IEDL.DBID | RIE |
ISBN | 9781424411795 1424411793 |
ISSN | 1063-6919 |
IngestDate | Wed Aug 27 01:48:26 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i241t-d962e3ad1cbae3158603073a362d219bcbe178b548343a09cbdfeeb12344c1123 |
PageCount | 8 |
ParticipantIDs | ieee_primary_4270247 |
PublicationCentury | 2000 |
PublicationDate | 2007-06 |
PublicationDateYYYYMMDD | 2007-06-01 |
PublicationDate_xml | – month: 06 year: 2007 text: 2007-06 |
PublicationDecade | 2000 |
PublicationTitle | 2007 IEEE Conference on Computer Vision and Pattern Recognition |
PublicationTitleAbbrev | CVPR |
PublicationYear | 2007 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0000818058 ssj0023720 ssj0003211698 |
Score | 2.1399214 |
Snippet | A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | Computer vision Data mining Impedance Information retrieval Itemsets Object recognition Spatial resolution Text recognition Uncertainty Vector quantization |
Title | Discovery of Collocation Patterns: from Visual Words to Visual Phrases |
URI | https://ieeexplore.ieee.org/document/4270247 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELbaTkwFWsRbHhhJGsd5mbVQVUhFEaKlW-VXRAVKUJMM8OuxnTgIxMCWuymxLr7n9x0AVyzkvpScObFAKkGhPnYoTzxHkowmRAoaG8abxUM0Xwb363DdA9cdFkZKaYbPpKsfTS9fFLzWpbJJoMFTQdwHfWVmDVarq6doajbb4dMyVplNRLqOgq-3sZjOZ4SdiCBiQV6aEg1b7qdWDm0_0yOT6Sp9bJgOsTZ-_8cWFuOEZkOwsK_fzJ68unXFXP75i9nxv9-3D8bfcD-Ydo7sAPRkfgiGbXwK27-_VCq7AsLqRmB2uy25ngL9gEUGdRWiaGqAMDXEnXl5AzWCBa62ZU3f4LPKdUtYFVZOX3bKjZZjsJzdPU3nTruawdkql185gkS-xFQgzqjEKEwic1lQ5Q6FugMZZxLFCQt1qRJTj3AmMqncgo-DgKsQDx-BQV7k8hjAMOBJzHgsaKRSGxpQxDjKOFd5D6XMYydgpE9q896wb2zaQzr9W30G9uxEn4fOwaDa1fJChQ0VuzT28gV4MLlX |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8IwGG4QD3pCBeO3PXh0sK3dR72iBBXIYgC5kbbrItFshm0H_fW23YrRePC2t7c2XZ_363leAK6Yx10hOLOC2JEBCnWRRXloW4IkNCQipoFWvBlP_OEMPyy8RQNcb7gwQgjdfCa66lPX8uOMlypV1sOKPIWDLbAtcR97FVtrk1FR4mymxqdsJGMbn2xqCq6ax6Jrnz6yfOIQQ_NSomjIqD_Vtmcqmjbp9efRU6V1iNT1d3_MYdEwNGiBsdlA1X3y2i0L1uWfv7Qd_7vDPdD5JvzBaANl-6Ah0gPQqj1UWP__uVwyQyDMWhsMblc5V32gHzBLoMpDZFUWEEZaujPNb6DisMD5Ki_pG3yW0W4Oi8zY0ctaAmneAbPB3bQ_tOrhDNZKgn5hxcR3BaKxwxkVyPFCXz8XVAJiLF9BxplwgpB5KlmJqE04ixMhgcFFGHPp5KFD0EyzVBwB6GEeBowHMfVlcEMxdRh3Es5l5EMps9kxaKuTWr5X-hvL-pBO_l6-BDvD6Xi0HN1PHk_Brunvs50z0CzWpTiXTkTBLvTd-QJkX7yk |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2007+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=Discovery+of+Collocation+Patterns%3A+from+Visual+Words+to+Visual+Phrases&rft.au=Junsong+Yuan&rft.au=Ying+Wu&rft.au=Ming+Yang&rft.date=2007-06-01&rft.pub=IEEE&rft.isbn=9781424411795&rft.issn=1063-6919&rft.eissn=1063-6919&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FCVPR.2007.383222&rft.externalDocID=4270247 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon |