Discovery of Collocation Patterns: from Visual Words to Visual Phrases

A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization....

Full description

Saved in:

Bibliographic Details
Published in	2007 IEEE Conference on Computer Vision and Pattern Recognition pp. 1 - 8
Main Authors	Junsong Yuan, Ying Wu, Ming Yang
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2007
Subjects	Computer vision Data mining Impedance Information retrieval Itemsets Object recognition Spatial resolution Text recognition Uncertainty Vector quantization
Online Access	Get full text
ISBN	9781424411795 1424411793
ISSN	1063-6919 1063-6919
DOI	10.1109/CVPR.2007.383222

Cover

Abstract	A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
AbstractList	A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
Author	Ying Wu Junsong Yuan Ming Yang
Author_xml	– sequence: 1 surname: Junsong Yuan fullname: Junsong Yuan organization: Northwestern Univ., Evanston – sequence: 2 surname: Ying Wu fullname: Ying Wu organization: Northwestern Univ., Evanston – sequence: 3 surname: Ming Yang fullname: Ming Yang organization: Northwestern Univ., Evanston
BookMark	eNpNjFFLwzAUhaNOcJt9F3zJH2jNTdIm8U2qU2FgEZ2PI2lvsdI1klRh_96CEzwvH4ePcxZkNvgBCbkAlgEwc1VuqueMM6YyoQXn_IgsQHIpATRTx2QOrBBpYcCckMQofXDK5LN_7owkMX6wKXqa5XpOVrddrP03hj31LS193_vajp0faGXHEcMQr2kb_I5uuvhle_rmQxPp6P969R5sxHhOTlvbR0wOXJLX1d1L-ZCun-4fy5t12nEJY9qYgqOwDdTOooBcF0wwJawoeMPBuNohKO1yqYUUlpnaNS2iAy6krGHCklz-_naIuP0M3c6G_VZyxbhU4gfoAlF7
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1109/CVPR.2007.383222
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Xplore Digital Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Applied Sciences Computer Science
EISBN	1424411807 9781424411801
EISSN	1063-6919
EndPage	8
ExternalDocumentID	4270247
Genre	orig-research
GroupedDBID	23M 29F 29O 6IE 6IH 6IK ABDPE ACGFS ALMA_UNASSIGNED_HOLDINGS CBEJK IPLJI M43 RIE RIO RNS
ID	FETCH-LOGICAL-i241t-d962e3ad1cbae3158603073a362d219bcbe178b548343a09cbdfeeb12344c1123
IEDL.DBID	RIE
ISBN	9781424411795 1424411793
ISSN	1063-6919
IngestDate	Wed Aug 27 01:48:26 EDT 2025
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i241t-d962e3ad1cbae3158603073a362d219bcbe178b548343a09cbdfeeb12344c1123
PageCount	8
ParticipantIDs	ieee_primary_4270247
PublicationCentury	2000
PublicationDate	2007-06
PublicationDateYYYYMMDD	2007-06-01
PublicationDate_xml	– month: 06 year: 2007 text: 2007-06
PublicationDecade	2000
PublicationTitle	2007 IEEE Conference on Computer Vision and Pattern Recognition
PublicationTitleAbbrev	CVPR
PublicationYear	2007
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0000818058 ssj0023720 ssj0003211698
Score	2.1399214
Snippet	A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a...
SourceID	ieee
SourceType	Publisher
StartPage	1
SubjectTerms	Computer vision Data mining Impedance Information retrieval Itemsets Object recognition Spatial resolution Text recognition Uncertainty Vector quantization
Title	Discovery of Collocation Patterns: from Visual Words to Visual Phrases
URI	https://ieeexplore.ieee.org/document/4270247
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELbaTkwFWsRbHhhJGsd5mbVQVUhFEaKlW-VXRAVKUJMM8OuxnTgIxMCWuymxLr7n9x0AVyzkvpScObFAKkGhPnYoTzxHkowmRAoaG8abxUM0Xwb363DdA9cdFkZKaYbPpKsfTS9fFLzWpbJJoMFTQdwHfWVmDVarq6doajbb4dMyVplNRLqOgq-3sZjOZ4SdiCBiQV6aEg1b7qdWDm0_0yOT6Sp9bJgOsTZ-_8cWFuOEZkOwsK_fzJ68unXFXP75i9nxv9-3D8bfcD-Ydo7sAPRkfgiGbXwK27-_VCq7AsLqRmB2uy25ngL9gEUGdRWiaGqAMDXEnXl5AzWCBa62ZU3f4LPKdUtYFVZOX3bKjZZjsJzdPU3nTruawdkql185gkS-xFQgzqjEKEwic1lQ5Q6FugMZZxLFCQt1qRJTj3AmMqncgo-DgKsQDx-BQV7k8hjAMOBJzHgsaKRSGxpQxDjKOFd5D6XMYydgpE9q896wb2zaQzr9W30G9uxEn4fOwaDa1fJChQ0VuzT28gV4MLlX
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8IwGG4QD3pCBeO3PXh0sK3dR72iBBXIYgC5kbbrItFshm0H_fW23YrRePC2t7c2XZ_363leAK6Yx10hOLOC2JEBCnWRRXloW4IkNCQipoFWvBlP_OEMPyy8RQNcb7gwQgjdfCa66lPX8uOMlypV1sOKPIWDLbAtcR97FVtrk1FR4mymxqdsJGMbn2xqCq6ax6Jrnz6yfOIQQ_NSomjIqD_Vtmcqmjbp9efRU6V1iNT1d3_MYdEwNGiBsdlA1X3y2i0L1uWfv7Qd_7vDPdD5JvzBaANl-6Ah0gPQqj1UWP__uVwyQyDMWhsMblc5V32gHzBLoMpDZFUWEEZaujPNb6DisMD5Ki_pG3yW0W4Oi8zY0ctaAmneAbPB3bQ_tOrhDNZKgn5hxcR3BaKxwxkVyPFCXz8XVAJiLF9BxplwgpB5KlmJqE04ixMhgcFFGHPp5KFD0EyzVBwB6GEeBowHMfVlcEMxdRh3Es5l5EMps9kxaKuTWr5X-hvL-pBO_l6-BDvD6Xi0HN1PHk_Brunvs50z0CzWpTiXTkTBLvTd-QJkX7yk
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2007+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=Discovery+of+Collocation+Patterns%3A+from+Visual+Words+to+Visual+Phrases&rft.au=Junsong+Yuan&rft.au=Ying+Wu&rft.au=Ming+Yang&rft.date=2007-06-01&rft.pub=IEEE&rft.isbn=9781424411795&rft.issn=1063-6919&rft.eissn=1063-6919&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FCVPR.2007.383222&rft.externalDocID=4270247
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon