Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
22 changes: 22 additions & 0 deletions _gsocprojects/2026/project_AutoEIT.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,5 +6,27 @@ description: |
AutoEIT is an applied machine learning project focused on automating the scoring of the Elicited Imitation Task (EIT), a research tool used to measure global language proficiency. The EIT is widely respected and is available for free in several languages, but the current workflow requires manual audio transcription and human scoring—slow, tedious, and error‑prone. This project aims to build an end‑to‑end system that will: Process raw audio files, perform accurate voice‑to‑text transcription, and automatically evaluate responses using a standardized scoring rubric.

---
## Evaluation Strategy

To ensure the effectiveness of the AutoEIT system, the following evaluation approach can be used:

- **Transcription Accuracy**
- Measure Word Error Rate (WER) to evaluate speech-to-text performance

- **Scoring Accuracy**
- Compare automated scores with human-evaluated scores

- **Robustness**
- Test system performance on noisy and diverse audio inputs

- **Consistency**
- Ensure the system produces stable and repeatable results across multiple runs

---
## Dataset Considerations

- Use multilingual speech datasets
- Include diverse accents and speaking speeds
- Ensure balanced dataset for fair evaluation

{% include gsoc_project.ext %}