GitHub - ahmedemam2/presenter_evaluation_multimodality: In this project, I attempted to evaluate the presenter based on fourfeatures:voice, posture, eye gaze, and facial expressions.

Evaluation of Multimodality in Presentation Delivery

This project endeavored to quantitatively evaluate a presenter's performance by scrutinizing four integral aspects: vocal attributes, physical posture, ocular movement (eye gaze), and facial expressions.

Here is a demo for the project.

demo.mp4

Libraries used: Mediapipe, NLTK, Keras, Sklearn, pandas, numpy.

To try the system, run the python file integrate.py upload your video and you will receive a result like this , which will be saved locally as a text file.

In assessing vocal attributes: Both voice emotion and pronunciation were taken into consideration. The Ravdess Voice Emotion Dataset served as a valuable resource for emotional evaluation. For pronunciation analysis, Google Cloud's Speech-to-Text API was employed. The resultant output was tokenized and lemmatized before the Jaccard similarity index was applied to compare the presenter's script with the actual delivery.

For posture evaluation: A dataset was generously supplied by the Mass Communication faculty at MSA, comprising multiple videos categorized into 'good' and 'poor' postural habits. This allowed for model training and testing, even enabling the use of ensemble methods for optimally performative models.

Regarding eye gaze: This aspect was scrutinized by utilizing Euclidean distance calculations to derive the ratio between the iris size and one side of the eye. This method enabled an understanding of the presenter's focus, as the script's optimal position is centralized.

In relation to facial expression: The DeepFace library was employed for the face emotion detection.

Future prospects include the implementation of Convolutional Neural Networks (CNN) on a facial emotion dataset to enhance the evaluation process.

Results:

In summary, this quantitative approach allows for a comprehensive evaluation of a presenter's performance, encapsulating voice, posture, eye gaze, and facial expressions. However, the presenter's appearance, a variable not yet explored in this project, also contribute to the overall evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.idea		.idea
__pycache__		__pycache__
models		models
00039.wav		00039.wav
EyeGaze.py		EyeGaze.py
MasscomDatasetLandmarks.csv		MasscomDatasetLandmarks.csv
MasscomDatasettemp2.csv		MasscomDatasettemp2.csv
README.md		README.md
Ravdess_standard_nofear.csv		Ravdess_standard_nofear.csv
Top.wav		Top.wav
demo.py		demo.py
faceEmotions.py		faceEmotions.py
ftw.py		ftw.py
integration.py		integration.py
posture.py		posture.py
template.py		template.py
testing.py		testing.py
testing_posture.py		testing_posture.py
textSimilarity.py		textSimilarity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages