Multi-modal Score Fusion and Decision Trees for Explainable Automatic Job Candidate Screening from Video CVs

We describe an end-to-end system for explainable automatic job candidate screening from video CVs. In this application, audio, face and scene features are first computed from an input video CV, using rich feature sets. These multiple modalities are fed into modality-specific regressors to predict ap...

Full description

Saved in:

Bibliographic Details
Published in	IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops pp. 1651 - 1659
Main Authors	Kaya, Heysem, Gurpinar, Furkan, Salah, Albert Ali
Format	Conference Proceeding
Language	English
Published	IEEE 01.07.2017
Subjects	Decision trees Face Feature extraction Interviews Kernel Training Visualization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	We describe an end-to-end system for explainable automatic job candidate screening from video CVs. In this application, audio, face and scene features are first computed from an input video CV, using rich feature sets. These multiple modalities are fed into modality-specific regressors to predict apparent personality traits and a variable that predicts whether the subject will be invited to the interview. The base learners are stacked to an ensemble of decision trees to produce the outputs of the quantitative stage, and a single decision tree, combined with a rule-based algorithm produces interview decision explanations based on the quantitative results. The proposed system in this work ranks first in both quantitative and qualitative stages of the CVPR 2017 ChaLearn Job Candidate Screening Coopetition.
ISSN:	2160-7516
DOI:	10.1109/CVPRW.2017.210