Text detection in video frames using hybrid features
Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid f...
Saved in:
Published in | 2009 International Conference on Machine Learning and Cybernetics Vol. 1; pp. 318 - 322 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.07.2009
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance. |
---|---|
AbstractList | Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance. |
Author | Zhong Ji Yu-Ting Su Jian Wang |
Author_xml | – sequence: 1 surname: Zhong Ji fullname: Zhong Ji organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China – sequence: 2 surname: Jian Wang fullname: Jian Wang organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China – sequence: 3 surname: Yu-Ting Su fullname: Yu-Ting Su organization: Sch. of Electron. Inf. Eng., Tianjin Univ., Tianjin, China |
BookMark | eNo1kL1OwzAUhY1oJZqSF4DFL5Dg65_YHlEEpVIQSwa2yomvwYimKE4RfXsiUc7y6QzfGU5GFsNhQEJugJUAzN5t6-emLjljtlQcuJL6gmQguZRCM8EvSW61-e9cLMiKQ8UKEOJ1SbLZMxbAKn5F8pQ-2BypuK7EisgWfybqccJ-ioeBxoF-R48HGka3x0SPKQ5v9P3UjdHTgG46jpiuyTK4z4T5mWvSPj609VPRvGy29X1TRNBqKowH0_VOo1WOSd8zr5zpjVauCr5jnbed5wC9l4wFHewsaWkkmMpLazuxJrd_sxERd19j3LvxtDsfIH4BMvlL5Q |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ICMLC.2009.5212547 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 1424437032 9781424437030 |
EndPage | 322 |
ExternalDocumentID | 5212547 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI M43 OCL RIE RIL |
ID | FETCH-LOGICAL-i175t-8d18bca7e95a04dc0d5a8c875a6fdb0bd9bd211cd400f7f91757484186d499b3 |
IEDL.DBID | RIE |
ISBN | 9781424437023 1424437024 |
ISSN | 2160-133X |
IngestDate | Wed Aug 27 02:21:16 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
LCCN | 2008911952 |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i175t-8d18bca7e95a04dc0d5a8c875a6fdb0bd9bd211cd400f7f91757484186d499b3 |
PageCount | 5 |
ParticipantIDs | ieee_primary_5212547 |
PublicationCentury | 2000 |
PublicationDate | 2009-July |
PublicationDateYYYYMMDD | 2009-07-01 |
PublicationDate_xml | – month: 07 year: 2009 text: 2009-July |
PublicationDecade | 2000 |
PublicationTitle | 2009 International Conference on Machine Learning and Cybernetics |
PublicationTitleAbbrev | ICMLC |
PublicationYear | 2009 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0000452763 ssj0000744891 |
Score | 1.4869375 |
Snippet | Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 318 |
SubjectTerms | Feature extraction Image edge detection Indexing Layout Overlay text Robustness Scene text Speech analysis Support vector machines SVM Text detection Transform coding Video annotation Video compression Videoconference |
Title | Text detection in video frames using hybrid features |
URI | https://ieeexplore.ieee.org/document/5212547 |
Volume | 1 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgKtIhveWAkbdo4cTJXVAVRxFCkbpU_zlAhpYgmA_x6zk5SBGIgU5wlsS3n7tnvvQO4iseImuJIgBGlb1xrHigt44ALHqY2sVKgZ1s8JLMnfreMly243mlhENGTz3Dgbv1Zvtno0m2VDZ3ONOaiDW0CbpVWa7ef4qzBRW0l5duCgIcvmDceJWFAUGzZ6LoiQYGpsXuq21EjqAmz4e1kfj-prCzrN_4oveIjz7QL8-abK8LJ66As1EB__rJz_G-n9qH_rfFjj7vodQAtzA-h2xR5YPWa7wFf0O-bGSw8Zytn65w56d6GWUfr2jJHnH9mLx9O-cUsep_QbR8W05vFZBbUpRaCNeUPRZCaUUpzJDCLZciNDk0sU01YRibWqFCZTBmCitrQkrfCEsaLnQnpKE0MQSYVHUEn3-R4DMzoNEJtlYpMxqVBpaxMdChTbunC8Qn03Bis3iozjVXd_dO_H5_BXnV84_ix59Ap3ku8oCygUJd--r8AXc2rRQ |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV09T8MwED1BGWAqUBDfeIAxJU2cjw5MhaqFtmIoUrcqts9QIaWIpkLlr_BX-HGcnaQIxIpEJjuDI5-t3J393juAs8BDlORHHPQpfONSckfIJHB4xN1YhzqJ0KItBmHnnt-MgtEKvC-5MIhowWdYN017l6-mcm6Oyi4MzzTgUQGhvMXFKyVos8vuFa3muee1r4etjlPUEHAm5BgzJ1aNmD4eYTNIXK6kq4IklhSkJ6FWwhWqKRTlQFLRXtaRpuQlMOqajThUlAsIn4ZdhTUKMwIvJ4ctD3CMFnlUaFfZfkSZjq3Q5zVC16Hcb1QSyfyIPGGpL1X0_ZLB4zYvuq1-r5VrZxZT_Fbrxbq6dhU-SiPlCJen-jwTdfn2Qz_yn1pxE3a-OIzsbumdt2AF022olkUsWPFPqwEfkntiCjOLSUvZJGWGmjhl2sDWZswQAx7Y48Iw25hGq4M624HhX0xgFyrpNMU9YErGPkothK-aPFEohE5C6SYx1_Sgtw81Y_Lxcy4WMi6sffD761NY7wz7vXGvO7g9hI38qspggY-gkr3M8Zginkyc2J3HYPzHa_QJnVYIpg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2009+International+Conference+on+Machine+Learning+and+Cybernetics&rft.atitle=Text+detection+in+video+frames+using+hybrid+features&rft.au=Zhong+Ji&rft.au=Jian+Wang&rft.au=Yu-Ting+Su&rft.date=2009-07-01&rft.pub=IEEE&rft.isbn=9781424437023&rft.issn=2160-133X&rft.volume=1&rft.spage=318&rft.epage=322&rft_id=info:doi/10.1109%2FICMLC.2009.5212547&rft.externalDocID=5212547 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2160-133X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2160-133X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2160-133X&client=summon |