diff --git a/_gsocprojects/2026/project_AutoEIT.md b/_gsocprojects/2026/project_AutoEIT.md index 2b89871..34e5e37 100644 --- a/_gsocprojects/2026/project_AutoEIT.md +++ b/_gsocprojects/2026/project_AutoEIT.md @@ -3,7 +3,48 @@ project: AutoEIT layout: default logo: AutoEIT.png description: | - AutoEIT is an applied machine learning project focused on automating the scoring of the Elicited Imitation Task (EIT), a research tool used to measure global language proficiency. The EIT is widely respected and is available for free in several languages, but the current workflow requires manual audio transcription and human scoring—slow, tedious, and error‑prone. This project aims to build an end‑to‑end system that will: Process raw audio files, perform accurate voice‑to‑text transcription, and automatically evaluate responses using a standardized scoring rubric. + AutoEIT is an applied machine learning project focused on automating the scoring of the Elicited Imitation Task (EIT), a research tool used to measure global language proficiency. + + The EIT is widely used and available in multiple languages. However, the current workflow relies on manual audio transcription and human scoring, which is slow, tedious, and prone to errors. + + This project aims to develop an end-to-end system with the following features: + + - Process raw audio files + - Perform accurate voice-to-text transcription + - Automatically evaluate responses + - Apply a standardized scoring rubric + +--- + +## System Workflow + +The proposed AutoEIT system can follow this workflow: + +1. **Audio Input** + - User provides recorded speech response + +2. **Preprocessing** + - Noise reduction and audio normalization + +3. **Speech-to-Text Conversion** + - Convert audio into text using a transcription model + +4. **Text Analysis** + - Compare transcribed text with expected response + +5. **Scoring Mechanism** + - Apply EIT scoring rules to evaluate correctness + +6. **Output** + - Generate score and feedback + +--- + +## Future Improvements + +- Support multiple languages +- Improve accuracy using advanced models (e.g., Whisper) +- Enable real-time transcription and scoring ---