LVTIA: A new method for keyphrase extraction from scientific video lectures

Due to the growth of technology, the expansion of communication infrastructure and crises of COVID-19 pandemic, e-learning and virtual education is expanding. One of the best ways to access and organize these information is indexing using automatic intelligent methods. Indexing requires assigning ke...

Full description

Saved in:

Bibliographic Details
Published in	Information processing & management Vol. 59; no. 2; p. 102802
Main Authors	Hassani, Hamid, Ershadi, Mohammad Javad, Mohebi, Azadeh
Format	Journal Article
Language	English
Published	Oxford Elsevier Ltd 01.03.2022 Elsevier Science Ltd
Subjects	Algorithms Audio signals COVID-19 Data mining Distance learning Frames (data processing) Indexing Information processing Keyphrase extraction Keyword extraction Multimedia indexing Recall Statistical tests Text mining Video Video lecture indexing Multimedia indexing Text mining Keyword extraction Keyphrase extraction Video lecture indexing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Due to the growth of technology, the expansion of communication infrastructure and crises of COVID-19 pandemic, e-learning and virtual education is expanding. One of the best ways to access and organize these information is indexing using automatic intelligent methods. Indexing requires assigning keywords or keyphrases to each video, to represent its content. The main focus of this research is to propose an approach by which appropriate keyphrases are assigned to scientific video lectures. For this purpose, a new algorithm called LVTIA, Lecture Video Text mining-base Indexing Algorithm, is proposed in which the textual content of video frames along with the text extracted from audio signal are merged together, and a new keyphrase extraction method is proposed. The proposed method considers new local and global features for each candidate phrases, along with a new feature reflecting the occurrence of each phrase in the audio signals or video frames. The method is implemented using five distinct data sets in English and Persian. The results are evaluated based on precision, recall, F1-measure and MAP@K metrics and compared with some of the well-known keyphrase extraction algorithms. Based on the results, the best MAP@K for English videos is related to LVTIA algorithm with the values of, 0.7912, 0.8069, 0.8069 for k=5,10,15, respectively. In addition, LVTIA is able to provide best MAP@K for Persian videos which are 0.6367, 0.6866, 0.6874 for k=5,10,15, respectively. According to Friedman nonparametric statistical test, the performance of different algorithms in precision, recall, F1-measure metrics, are statistically different from LVTIA as well. •LVTIA is a new method for video lecture indexing using statistical features.•The text extracted from audio and video frames are considered to extract keyphrases.•LVTIA uses global and local features for candidate phrases, using video time intervals.•The proposed algorithm is evaluated based on precision, recall, f-measure and MAP@K.•LVTIA is tested on five datasets with different languages (English and Persian).
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0306-4573 1873-5371
DOI:	10.1016/j.ipm.2021.102802