Text detection in video frames using hybrid features

Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid f...

Full description

Saved in:
Bibliographic Details
Published in2009 International Conference on Machine Learning and Cybernetics Vol. 1; pp. 318 - 322
Main Authors Zhong Ji, Jian Wang, Yu-Ting Su
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.07.2009
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance.
AbstractList Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance.
Author Zhong Ji
Yu-Ting Su
Jian Wang
Author_xml – sequence: 1
  surname: Zhong Ji
  fullname: Zhong Ji
  organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China
– sequence: 2
  surname: Jian Wang
  fullname: Jian Wang
  organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China
– sequence: 3
  surname: Yu-Ting Su
  fullname: Yu-Ting Su
  organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China
BookMark eNo1kL1OwzAUhY1oJZqSF4DFL5Dg65_YHlEEpVIQSwa2yomvwYimKE4RfXsiUc7y6QzfGU5GFsNhQEJugJUAzN5t6-emLjljtlQcuJL6gmQguZRCM8EvSW61-e9cLMiKQ8UKEOJ1SbLZMxbAKn5F8pQ-2BypuK7EisgWfybqccJ-ioeBxoF-R48HGka3x0SPKQ5v9P3UjdHTgG46jpiuyTK4z4T5mWvSPj609VPRvGy29X1TRNBqKowH0_VOo1WOSd8zr5zpjVauCr5jnbed5wC9l4wFHewsaWkkmMpLazuxJrd_sxERd19j3LvxtDsfIH4BMvlL5Q
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICMLC.2009.5212547
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1424437032
9781424437030
EndPage 322
ExternalDocumentID 5212547
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IPLJI
M43
OCL
RIE
RIL
ID FETCH-LOGICAL-i175t-8d18bca7e95a04dc0d5a8c875a6fdb0bd9bd211cd400f7f91757484186d499b3
IEDL.DBID RIE
ISBN 9781424437023
1424437024
ISSN 2160-133X
IngestDate Wed Aug 27 02:21:16 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCN 2008911952
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-8d18bca7e95a04dc0d5a8c875a6fdb0bd9bd211cd400f7f91757484186d499b3
PageCount 5
ParticipantIDs ieee_primary_5212547
PublicationCentury 2000
PublicationDate 2009-July
PublicationDateYYYYMMDD 2009-07-01
PublicationDate_xml – month: 07
  year: 2009
  text: 2009-July
PublicationDecade 2000
PublicationTitle 2009 International Conference on Machine Learning and Cybernetics
PublicationTitleAbbrev ICMLC
PublicationYear 2009
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000452763
ssj0000744891
Score 1.4869375
Snippet Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text...
SourceID ieee
SourceType Publisher
StartPage 318
SubjectTerms Feature extraction
Image edge detection
Indexing
Layout
Overlay text
Robustness
Scene text
Speech analysis
Support vector machines
SVM
Text detection
Transform coding
Video annotation
Video compression
Videoconference
Title Text detection in video frames using hybrid features
URI https://ieeexplore.ieee.org/document/5212547
Volume 1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgKtIhveWAkbdo4cTJXVAVRxFCkbpU_zlAhpYgmA_x6zk5SBGIgU5wlsS3n7tnvvQO4iseImuJIgBGlb1xrHigt44ALHqY2sVKgZ1s8JLMnfreMly243mlhENGTz3Dgbv1Zvtno0m2VDZ3ONOaiDW0CbpVWa7ef4qzBRW0l5duCgIcvmDceJWFAUGzZ6LoiQYGpsXuq21EjqAmz4e1kfj-prCzrN_4oveIjz7QL8-abK8LJ66As1EB__rJz_G-n9qH_rfFjj7vodQAtzA-h2xR5YPWa7wFf0O-bGSw8Zytn65w56d6GWUfr2jJHnH9mLx9O-cUsep_QbR8W05vFZBbUpRaCNeUPRZCaUUpzJDCLZciNDk0sU01YRibWqFCZTBmCitrQkrfCEsaLnQnpKE0MQSYVHUEn3-R4DMzoNEJtlYpMxqVBpaxMdChTbunC8Qn03Bis3iozjVXd_dO_H5_BXnV84_ix59Ap3ku8oCygUJd--r8AXc2rRQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV09T8MwED1BGWAqUBDfeIAxJU2cjw5MhaqFtmIoUrcqts9QIaWIpkLlr_BX-HGcnaQIxIpEJjuDI5-t3J393juAs8BDlORHHPQpfONSckfIJHB4xN1YhzqJ0KItBmHnnt-MgtEKvC-5MIhowWdYN017l6-mcm6Oyi4MzzTgUQGhvMXFKyVos8vuFa3muee1r4etjlPUEHAm5BgzJ1aNmD4eYTNIXK6kq4IklhSkJ6FWwhWqKRTlQFLRXtaRpuQlMOqajThUlAsIn4ZdhTUKMwIvJ4ctD3CMFnlUaFfZfkSZjq3Q5zVC16Hcb1QSyfyIPGGpL1X0_ZLB4zYvuq1-r5VrZxZT_Fbrxbq6dhU-SiPlCJen-jwTdfn2Qz_yn1pxE3a-OIzsbumdt2AF022olkUsWPFPqwEfkntiCjOLSUvZJGWGmjhl2sDWZswQAx7Y48Iw25hGq4M624HhX0xgFyrpNMU9YErGPkothK-aPFEohE5C6SYx1_Sgtw81Y_Lxcy4WMi6sffD761NY7wz7vXGvO7g9hI38qspggY-gkr3M8Zginkyc2J3HYPzHa_QJnVYIpg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2009+International+Conference+on+Machine+Learning+and+Cybernetics&rft.atitle=Text+detection+in+video+frames+using+hybrid+features&rft.au=Zhong+Ji&rft.au=Jian+Wang&rft.au=Yu-Ting+Su&rft.date=2009-07-01&rft.pub=IEEE&rft.isbn=9781424437023&rft.issn=2160-133X&rft.volume=1&rft.spage=318&rft.epage=322&rft_id=info:doi/10.1109%2FICMLC.2009.5212547&rft.externalDocID=5212547
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2160-133X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2160-133X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2160-133X&client=summon