Extractive Text-Based Summarization of Arabic Videos: Issues, Approaches and Evaluations

In this paper, we present and evaluate a method for extractive text-based summarization of Arabic videos. The algorithm is proposed in the scope of the AMIS project that aims at helping a user to understand videos given in a foreign language (Arabic). For that, the project proposes several strategie...

Full description

Saved in:

Bibliographic Details
Published in	Arabic Language Processing: From Theory to Practice pp. 65 - 78
Main Authors	Menacer, Mohamed Amine, González-Gallardo, Carlos-Emiliano, Abidi, Karima, Fohr, Dominique, Jouvet, Denis, Langlois, David, Mella, Odile, Sadat, Fatiha, Torres-Moreno, Juan-Manuel, Smaïli, Kamel
Format	Book Chapter
Language	English
Published	Cham Springer International Publishing
Series	Communications in Computer and Information Science
Subjects	Automatic speech recognition Segmentation Text summarization Video summarization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In this paper, we present and evaluate a method for extractive text-based summarization of Arabic videos. The algorithm is proposed in the scope of the AMIS project that aims at helping a user to understand videos given in a foreign language (Arabic). For that, the project proposes several strategies to translate and summarize the videos. One of them consists in transcribing the Arabic videos, summarizing the transcriptions, and translating the summary. In this paper we describe the video corpus that was collected from YouTube and present and evaluate the transcription-summarization part of this strategy. Moreover, we present the Automatic Speech Recognition (ASR) system used to transcribe the videos, and show how we adapted this system to the Algerian dialect. Then, we describe how we automatically segment into sentences the sequence of words provided by the ASR system, and how we summarize the obtained sequence of sentences. We evaluate objectively and subjectively our approach. Results show that the ASR system performs well in terms of Word Error Rate on MSA, but needs to be adapted for dealing with Algerian dialect data. The subjective evaluation shows the same behaviour than ASR: transcriptions for videos containing dialectal data were better scored than videos containing only MSA data. However, summaries based on transcriptions are not as well rated, even when transcriptions are better rated. Last, the study shows that features, such as the lengths of transcriptions and summaries, and the subjective score of transcriptions, explain only 31% of the subjective score of summaries.
ISBN:	3030329585 9783030329587
ISSN:	1865-0929 1865-0937
DOI:	10.1007/978-3-030-32959-4_5