Skip to content

ahmedemam2/presenter_evaluation_multimodality

Repository files navigation

Evaluation of Multimodality in Presentation Delivery

This project endeavored to quantitatively evaluate a presenter's performance by scrutinizing four integral aspects: vocal attributes, physical posture, ocular movement (eye gaze), and facial expressions.

Here is a demo for the project.

demo.mp4

Libraries used: Mediapipe, NLTK, Keras, Sklearn, pandas, numpy.

To try the system, run the python file integrate.py upload your video and you will receive a result like this Screenshot 2023-06-20 192844 , which will be saved locally as a text file.

In assessing vocal attributes: Both voice emotion and pronunciation were taken into consideration. The Ravdess Voice Emotion Dataset served as a valuable resource for emotional evaluation. For pronunciation analysis, Google Cloud's Speech-to-Text API was employed. The resultant output was tokenized and lemmatized before the Jaccard similarity index was applied to compare the presenter's script with the actual delivery.

For posture evaluation: A dataset was generously supplied by the Mass Communication faculty at MSA, comprising multiple videos categorized into 'good' and 'poor' postural habits. This allowed for model training and testing, even enabling the use of ensemble methods for optimally performative models.

Regarding eye gaze: This aspect was scrutinized by utilizing Euclidean distance calculations to derive the ratio between the iris size and one side of the eye. This method enabled an understanding of the presenter's focus, as the script's optimal position is centralized.

In relation to facial expression: The DeepFace library was employed for the face emotion detection.

Future prospects include the implementation of Convolutional Neural Networks (CNN) on a facial emotion dataset to enhance the evaluation process.

Results: image

In summary, this quantitative approach allows for a comprehensive evaluation of a presenter's performance, encapsulating voice, posture, eye gaze, and facial expressions. However, the presenter's appearance, a variable not yet explored in this project, also contribute to the overall evaluation.

About

In this project, I attempted to evaluate the presenter based on fourfeatures:voice, posture, eye gaze, and facial expressions.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages