Text detection in video frames using hybrid features

Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid f...

Full description

Saved in:

Bibliographic Details
Published in	2009 International Conference on Machine Learning and Cybernetics Vol. 1; pp. 318 - 322
Main Authors	Zhong Ji, Jian Wang, Yu-Ting Su
Format	Conference Proceeding
Language	English
Published	IEEE 01.07.2009
Subjects	Feature extraction Image edge detection Indexing Layout Overlay text Robustness Scene text Speech analysis Support vector machines SVM Text detection Transform coding Video annotation Video compression Videoconference
Online Access	Get full text

Cover

Loading…

Abstract	Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance.
AbstractList	Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance.
Author	Zhong Ji Yu-Ting Su Jian Wang
Author_xml	– sequence: 1 surname: Zhong Ji fullname: Zhong Ji organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China – sequence: 2 surname: Jian Wang fullname: Jian Wang organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China – sequence: 3 surname: Yu-Ting Su fullname: Yu-Ting Su organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China
BookMark	eNo1kL1OwzAUhY1oJZqSF4DFL5Dg65_YHlEEpVIQSwa2yomvwYimKE4RfXsiUc7y6QzfGU5GFsNhQEJugJUAzN5t6-emLjljtlQcuJL6gmQguZRCM8EvSW61-e9cLMiKQ8UKEOJ1SbLZMxbAKn5F8pQ-2BypuK7EisgWfybqccJ-ioeBxoF-R48HGka3x0SPKQ5v9P3UjdHTgG46jpiuyTK4z4T5mWvSPj609VPRvGy29X1TRNBqKowH0_VOo1WOSd8zr5zpjVauCr5jnbed5wC9l4wFHewsaWkkmMpLazuxJrd_sxERd19j3LvxtDsfIH4BMvlL5Q
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ICMLC.2009.5212547
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISBN	1424437032 9781424437030
EndPage	322
ExternalDocumentID	5212547
Genre	orig-research
GroupedDBID	6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI M43 OCL RIE RIL
ID	FETCH-LOGICAL-i175t-8d18bca7e95a04dc0d5a8c875a6fdb0bd9bd211cd400f7f91757484186d499b3
IEDL.DBID	RIE
ISBN	9781424437023 1424437024
ISSN	2160-133X
IngestDate	Wed Aug 27 02:21:16 EDT 2025
IsPeerReviewed	false
IsScholarly	false
LCCN	2008911952
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i175t-8d18bca7e95a04dc0d5a8c875a6fdb0bd9bd211cd400f7f91757484186d499b3
PageCount	5
ParticipantIDs	ieee_primary_5212547
PublicationCentury	2000
PublicationDate	2009-July
PublicationDateYYYYMMDD	2009-07-01
PublicationDate_xml	– month: 07 year: 2009 text: 2009-July
PublicationDecade	2000
PublicationTitle	2009 International Conference on Machine Learning and Cybernetics
PublicationTitleAbbrev	ICMLC
PublicationYear	2009
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0000452763 ssj0000744891
Score	1.4869375
Snippet	Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text...
SourceID	ieee
SourceType	Publisher
StartPage	318
SubjectTerms	Feature extraction Image edge detection Indexing Layout Overlay text Robustness Scene text Speech analysis Support vector machines SVM Text detection Transform coding Video annotation Video compression Videoconference
Title	Text detection in video frames using hybrid features
URI	https://ieeexplore.ieee.org/document/5212547
Volume	1
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgKtIhveWAkbdo4cTJXVAVRxFCkbpU_zlAhpYgmA_x6zk5SBGIgU5wlsS3n7tnvvQO4iseImuJIgBGlb1xrHigt44ALHqY2sVKgZ1s8JLMnfreMly243mlhENGTz3Dgbv1Zvtno0m2VDZ3ONOaiDW0CbpVWa7ef4qzBRW0l5duCgIcvmDceJWFAUGzZ6LoiQYGpsXuq21EjqAmz4e1kfj-prCzrN_4oveIjz7QL8-abK8LJ66As1EB__rJz_G-n9qH_rfFjj7vodQAtzA-h2xR5YPWa7wFf0O-bGSw8Zytn65w56d6GWUfr2jJHnH9mLx9O-cUsep_QbR8W05vFZBbUpRaCNeUPRZCaUUpzJDCLZciNDk0sU01YRibWqFCZTBmCitrQkrfCEsaLnQnpKE0MQSYVHUEn3-R4DMzoNEJtlYpMxqVBpaxMdChTbunC8Qn03Bis3iozjVXd_dO_H5_BXnV84_ix59Ap3ku8oCygUJd--r8AXc2rRQ
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV09T8MwED1BGWAqUBDfeIAxJU2cjw5MhaqFtmIoUrcqts9QIaWIpkLlr_BX-HGcnaQIxIpEJjuDI5-t3J393juAs8BDlORHHPQpfONSckfIJHB4xN1YhzqJ0KItBmHnnt-MgtEKvC-5MIhowWdYN017l6-mcm6Oyi4MzzTgUQGhvMXFKyVos8vuFa3muee1r4etjlPUEHAm5BgzJ1aNmD4eYTNIXK6kq4IklhSkJ6FWwhWqKRTlQFLRXtaRpuQlMOqajThUlAsIn4ZdhTUKMwIvJ4ctD3CMFnlUaFfZfkSZjq3Q5zVC16Hcb1QSyfyIPGGpL1X0_ZLB4zYvuq1-r5VrZxZT_Fbrxbq6dhU-SiPlCJen-jwTdfn2Qz_yn1pxE3a-OIzsbumdt2AF022olkUsWPFPqwEfkntiCjOLSUvZJGWGmjhl2sDWZswQAx7Y48Iw25hGq4M624HhX0xgFyrpNMU9YErGPkothK-aPFEohE5C6SYx1_Sgtw81Y_Lxcy4WMi6sffD761NY7wz7vXGvO7g9hI38qspggY-gkr3M8Zginkyc2J3HYPzHa_QJnVYIpg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2009+International+Conference+on+Machine+Learning+and+Cybernetics&rft.atitle=Text+detection+in+video+frames+using+hybrid+features&rft.au=Zhong+Ji&rft.au=Jian+Wang&rft.au=Yu-Ting+Su&rft.date=2009-07-01&rft.pub=IEEE&rft.isbn=9781424437023&rft.issn=2160-133X&rft.volume=1&rft.spage=318&rft.epage=322&rft_id=info:doi/10.1109%2FICMLC.2009.5212547&rft.externalDocID=5212547
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2160-133X&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2160-133X&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2160-133X&client=summon