Vyaktitv: A Multimodal Peer-to-Peer Hindi Conversations based Dataset for Personality Assessment

Automatically detecting personality traits can aid several applications, such as mental health recognition and human resource management. Most datasets introduced for personality detection so far have analyzed these traits for each individual in isolation. However, personality is intimately linked t...

Full description

Saved in:
Bibliographic Details
Published in2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM) pp. 103 - 111
Main Authors Khan, Shahid Nawaz, Leekha, Maitree, Shukla, Jainendra, Shah, Rajiv Ratn
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.09.2020
Subjects
Online AccessGet full text
DOI10.1109/BigMM50055.2020.00024

Cover

Loading…
More Information
Summary:Automatically detecting personality traits can aid several applications, such as mental health recognition and human resource management. Most datasets introduced for personality detection so far have analyzed these traits for each individual in isolation. However, personality is intimately linked to our social behavior. Furthermore, surprisingly little research has focused on personality analysis using low resource languages. To this end, we present a novel peer-to-peer Hindi conversation dataset, Vyaktitv 1 . It consists of high-quality audio and video recordings of the participants, with Hinglish 2 textual transcriptions for each conversation. The dataset also contains a rich set of socio-demographic features, like income, cultural orientation, amongst several others, for all the participants. We release the dataset for public use, as well as perform preliminary statistical analysis along the different dimensions. Finally, we also discuss various other applications and tasks for which the dataset can be employed.
DOI:10.1109/BigMM50055.2020.00024