Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
43 changes: 42 additions & 1 deletion _gsocprojects/2026/project_AutoEIT.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,48 @@ project: AutoEIT
layout: default
logo: AutoEIT.png
description: |
AutoEIT is an applied machine learning project focused on automating the scoring of the Elicited Imitation Task (EIT), a research tool used to measure global language proficiency. The EIT is widely respected and is available for free in several languages, but the current workflow requires manual audio transcription and human scoring—slow, tedious, and error‑prone. This project aims to build an end‑to‑end system that will: Process raw audio files, perform accurate voice‑to‑text transcription, and automatically evaluate responses using a standardized scoring rubric.
AutoEIT is an applied machine learning project focused on automating the scoring of the Elicited Imitation Task (EIT), a research tool used to measure global language proficiency.

The EIT is widely used and available in multiple languages. However, the current workflow relies on manual audio transcription and human scoring, which is slow, tedious, and prone to errors.

This project aims to develop an end-to-end system with the following features:

- Process raw audio files
- Perform accurate voice-to-text transcription
- Automatically evaluate responses
- Apply a standardized scoring rubric

---

## System Workflow

The proposed AutoEIT system can follow this workflow:

1. **Audio Input**
- User provides recorded speech response

2. **Preprocessing**
- Noise reduction and audio normalization

3. **Speech-to-Text Conversion**
- Convert audio into text using a transcription model

4. **Text Analysis**
- Compare transcribed text with expected response

5. **Scoring Mechanism**
- Apply EIT scoring rules to evaluate correctness

6. **Output**
- Generate score and feedback

---

## Future Improvements

- Support multiple languages
- Improve accuracy using advanced models (e.g., Whisper)
- Enable real-time transcription and scoring

---

Expand Down