Skip to content

Conversation

@mayurdd
Copy link

@mayurdd mayurdd commented Dec 10, 2025

Adding a param to the config while connecting to the session

Please describe the changes in your PR. If it is addressing an issue, please reference that as well.

Currently, the ElevenLabsRealtimeSTTService doesn't allow us to pass include_language_detection, which can be used to get the language code in the response of the Scribe ve realtime model. After discussing with the Elevenlabs team that this is the parameter name that can be passed to get the language code, I'm creating this PR

Adding a param to the config while connecting to the session
@mayurdd mayurdd changed the title Adding include_language_detection to Elevenlabs Realtime STT Adding include_language_detection param to Elevenlabs Realtime STT Dec 10, 2025
@markbackman
Copy link
Contributor

markbackman commented Dec 10, 2025

I'm happy to include this, but I can't find it in the reference docs. Can you point to where this parameter is listed in the reference docs? I'm not seeing it here:
https://elevenlabs.io/docs/api-reference/speech-to-text/v-1-speech-to-text-realtime

@mayurdd
Copy link
Author

mayurdd commented Dec 10, 2025

I'm happy to include this, but I can't find it in the reference docs. Can you point to where this parameter is listed in the reference docs? I'm not seeing it here: https://elevenlabs.io/docs/api-reference/speech-to-text/v-1-speech-to-text-realtime

Thanks for pointing out, I will flag this to their team.
The 11labs team suggested this parameter to get language_code in the response else this property will always come as None.

Below is the current response when we connect to Scribe v2 Realtime.
ElevenLabs session started: {'message_type': 'session_started', 'session_id': '15b88595fcc1dba279eefb', 'config': {'sample_rate': 8000, 'audio_format': 'pcm_8000', 'language_code': None, 'timestamps_granularity': 'word', 'vad_commit_strategy': False, 'vad_silence_threshold_secs': 1.5, 'vad_threshold': 0.4, 'min_speech_duration_ms': 100, 'min_silence_duration_ms': 100, 'max_tokens_to_recompute': 5, 'model_id': 'scribe_v2_realtime', 'disable_logging': False, 'include_timestamps': False, 'include_language_detection': False}}

Since this was launched 10 days ago, I'm sure they will be updating the docs soon to include include_language_detection and timestamps_granularity

@mayurdd
Copy link
Author

mayurdd commented Dec 11, 2025

I'm happy to include this, but I can't find it in the reference docs. Can you point to where this parameter is listed in the reference docs? I'm not seeing it here: https://elevenlabs.io/docs/api-reference/speech-to-text/v-1-speech-to-text-realtime

@markbackman Met with the 11labs team, and the documentation is updated to have that param
https://elevenlabs.io/docs/api-reference/speech-to-text/v-1-speech-to-text-realtime

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants