TranscribeTools

Introduction

TranscribeTools is a collection of commandline tools for transcription and translation, which currently only includes TranscribeFolder. Transcribefolder is a Python application that transcribes all sound files in a configurable folder using a local version of the Whisper model. Whisper is an automatic speech recognition system created by OpenAI, which can transcribe audio files in multiple languages and translate those languages into English.

The model must be run locally to comply with the General Data Protection Regulation (GDPR). This is because, when using OpenAI’s transcription service (based on the Whisper model), OpenAI collects user data from prompts and files uploaded by the user. These audio files may contain personal data from which people can be identified. Therefore, using OpenAI’s service without a processing agreement is not allowed within organisations.

On the other hand, using TranscribeTools to run the Whisper model on your own device means that files containing personal data will not be collected. The program essentially downloads the model—released as open-source software in 2022—and uses the command line to select a folder, which it then transcribes, all locally.

It works with audio files under 25 MB in the following formats: mp3, wav, mp4, mpeg, mpga, m4a, and webm. It also allows the user to choose the model size. The larger models are more accurate but slower, while the smaller models are faster but less accurate. One exception is the turbo model, which is a optimized version of the large model that is relatively quick with a minimal decrease in accuracy.

Furthermore, the application utilizes the terminal, a text-based interface to interact with the computer, to install and use Whisper. This might sound intimidating but is hopefully manageable when following the instructions given below. The terminal is already installed in most cases.

Details

using Python 3.12.7, openai-whisper https://pypi.org/project/openai-whisper/ (current version 20250625,) does not support 3.13 yet.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Setup

Before installing TranscribeTools, you need to download a package manager to install dependencies—pieces of code that the application relies on. On macOS, we will use Homebrew and uv; on Windows, we will only use uv. Then, we will install TranscribeTools.

To run the following prompts, one must copy and paste the commands in the command line and press enter. During the setup, it might be necessary to restart the terminal after installing homebrew, uv, or transcribetools in order to be able to proceed.

Package manager

On Windows

Open Windows Powershell or the Command shell
Run prompt to install uv:

powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

On macOS:

Open Terminal
Run prompt to install brew:

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

Run prompt to install uv:

brew install uv

Install tools

Install the (commandline) tools in this project. For now it's only transcribefolder:

uv tool install transcribetools

Command-line usage

Getting started

To get started with transcribefolder, simply follow the instructions below. The first time you run it, a configuration file will be created with the current folder and model, which will be used from then on. If needed, you can update the configuration by running the command:

transcribefolder config create

Run the prompt

transcribefolder transcribe

Select which folder to transcribe
Enter the name of the Whisper model you'd like to use
Press enter to use the default configuration file name

Prompt list

Run prompt to see the possible commands and options:

transcribefolder --help

Run prompt to create a configuration file with the right folder to transcribe and the right whisper model to use:

transcribefolder config create

Run prompt to show the default configuration file (transcribefolder.toml):

transcribefolder config show

Run prompt to show the specified configuration file:

transcribefolder -c name of the config file.toml config show

Run prompt to transcribe all sound files in the selected folder using the default configuration file (transcribefolder.toml):

transcribefolder transcribe

Run prompt to transcribe all sound files in the selected folder using a specific configuration file:

transcribefolder -c name of the config file.toml transcribe

Known issues

The deepl_translate command is not yet working.
The duration and realtime factor are not available for processed files in the formats: mp4, mpeg, mpga, m4a, and webm.

Plans

Make it a local service, running in the background
Investigate options to let it run on a central computer, as a service
Create Docker image
Add speaker partitioning (see TranscribeWhisperX)
Adjust models using PyTorch (more control)

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.idea		.idea
click_spinner		click_spinner
data		data
tests		tests
transcribetools.egg-info		transcribetools.egg-info
transcribetools		transcribetools
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
cloud_openai.py		cloud_openai.py
detect_language.py		detect_language.py
main.py		main.py
pyproject.toml		pyproject.toml
readme.md		readme.md
translate0.py		translate0.py
translate1.py		translate1.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TranscribeTools

Introduction

Details

License

Setup

Package manager

On Windows

On macOS:

Install tools

Command-line usage

Getting started

Prompt list

Known issues

Plans

Documentation about Whisper on the cloud and local

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

ndegroot/transcribetools

Folders and files

Latest commit

History

Repository files navigation

TranscribeTools

Introduction

Details

License

Setup

Package manager

On Windows

On macOS:

Install tools

Command-line usage

Getting started

Prompt list

Known issues

Plans

Documentation about Whisper on the cloud and local

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages