AudioBot

About • Installation • How To Use • Credits • License

About

This repository contains an implementation of an intelligent voice assistant. The solution is based on the combination of Automatic Speech Recognition, Text To Speech, and LLM models.

See the LauzHack Workshop with the discussion on how to create intelligent voice assistants.

Installation

To install the assistant, follow these steps:

(Optional) Create and activate new environment using conda or venv (+pyenv).

a. conda version:

# create env
conda create -n project_env python=PYTHON_VERSION

# activate env
conda activate project_env

b. venv (+pyenv) version:

# create env
~/.pyenv/versions/PYTHON_VERSION/bin/python3 -m venv project_env

# alternatively, using default python version
python3 -m venv project_env

# activate env
source project_env

Install all required packages
```
pip install -r requirements.txt
```
(Optional) Install pre-commit:
```
pre-commit install
```
Create an API key in Groq. Create a new file named .env in the root directory and copy-paste your API key into it.

How To Use

To record and play sound, you need to define your hardware settings. See more in the PyTorch documentation (information about ffmpeg specifically) and this tutorial. Usually, the format is alsa for linux systems and avfoundation for mac systems.

When the hardware is known, you can start AI AudioBot using this command:

python3 run.py stream_reader.source=YOUR_MICROPHONE \
    stream_reader.format=YOUR_FORMAT \
    stream_writer.format=YOUR_FORMAT

You can also change other parameters via Hydra options. See src/configs/audio_bot.yaml.

Credits

HuggingFace was used for ASR and TTS models (Spectrogram Generator and Vocoder). Groq API with llama-3-8b-8192 model was used for LLM. The KWS model is taken from the 2022 version of the HSE DLA Course.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioBot

About

Installation

How To Use

Credits

License

About

Releases

Packages

Languages

License

Blinorot/AudioBot

Folders and files

Latest commit

History

Repository files navigation

AudioBot

About

Installation

How To Use

Credits

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages