Set Marvin free, an interactive speech-based game

Summary

Welcome to Marvin, a demon trapped inside a voodo doll that you need to set free through simple, voice enabled minigames and speech-based interactions.

Marvin currently supports the following intentions:

Intent	Voice command
SPEECH GAME	"I want to set you free ¹"
STORY	"tell me a story"
JOKE	"tell me a joke"
FACT	"I want to learn a fact"
PROVERB	"teach me a proverb"
RIDDLE	"tell me a riddle"
TONGUE-TWISTER	"tell me a tongue-twister"

Marvin is a python project, that uses poetry as a dependency manager. It was designed to run offline on a Raspberry Pi to more easily be embedded inside a plush toy.

Installation Guide

Marvin uses python 3.10.12 and poetry as a dependency manager. If you do not have poetry installed in your system, please refer to poetry's installation guide. In order to properly build the virtual environment, run the following commands (skip if already installed):

pyenv install 3.10.12
brew install portaudio ffmpeg
pipx install poetry

Run the following command to install the dependencies of this project

poetry install --no-root

Marvin relies on pre-synthesized audio files, that are retrieved following the user query. This speeds up inference, meaning that all content is already pre-generated and deterministic prior to any interactions. In order to generate the user content, please run the following script:

cd tts-gen/robot/ && poetry run python tts.py

This script uses a text-to-speech custom recipe to generate a robotic voice. This can take several minutes. Once the script finishes, and all content is properly generated, Marvin is ready to be launched. You only have to generate the text-to-speech audio once.

User Guide

Export the path to the config file:

export CONFIG_PATH=$(pwd)/config/config.yaml

Please refer to the following command to launch a terminal-based instance of Marvin:

poetry run python src/main.py

Alternatively, you can run the run.sh. Note that this bash script also launches the synthatic data generation script if no generated audio is detected.

bash run.sh

Once the system properly loads, we are finally ready to interact with Marvin. Say 'Marvin' to wake the system up and ask him to tell you any one of the supported intentions for a brief interaction.

e.g.

   - you: "Marvin"
   - Marvin: "Yes?"
   - You: "Tell me a joke."
   - Marvin: "Fine... Why was the math book so sad? It had too many problems."

Furthermore, you can tell Marvin that you want to set him free in order to start a brief game with many possible endings.

e.g.

   - you: "Marvin"
   - Marvin: "What now?"
   - You: "I want to set you free."
   - Marvin: "So you want to set me free hein? <game class starts>."

Minigames

Marvin contains 4 speech-based minigames that are prompted when someone attempts to free him. These games can have different difficulty levels based on user preference. The games are as follows:

Pitch-based game (North Section): In this game the user needs to match the pitch of a short series of musical notes
Animal guessing game (South Section): In this game the user needs to guess the animal sound that was played
Memory game (East Section): The user needs to say the sequence in whicha set of three different sounds were played (piano, drum and guitar)
Reverse game (West Section): The user needs to flip the words in a sentence (e.g. "cat sees bird" --> "bird sees cat")

The following table illustrates the difficulty levels for each game:

Game	easy	medium	hard
pitch game	2 notes	3 notes	5 notes
animal game	1 sound	1 sound	1 sound
memory game	2 sounds	3 sounds	5 sounds
reverse game	3 words	4 words	5 words

Once the four minigames are cleared, the user can issue the magic words to finally free Marvin. However, speaking them in the different orders may trigger unforeseen consequences, leading to different endings.

Microphone Setup

Marvin requires a microphone to function, whose ID can vary depending on device. Currently, the system is configured to default to one of the following device names: "sysdefault" or "default", which are common in unix based system. Nevertheless, a custom microphone name can be added on the config file (./config/config.yaml). Please refer to the microphone name variable in this config (defaulted to a Macbook Pro Microphone device name) and change it accordingly.

The following simple script can be used to list all of the available devices and their corresponding names.

poetry run python extras/mic_idx_finder.py

Tests

In order to run the tests please refer to the following command:

poetry run pytests tests/

Notes

Marvin relies on a generic pre-trained keyword spotter and speech recognizer. Therefore, Marvin's transcriptions and occasional false activations are a consequence of this aspect. Over-articulation can improve his understanding.

Marvin also relies on simple regular expressions (regex) for intent classification.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

References

https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/Online_Offline_Speech_Commands_Demo.ipynb

Bold words correspond to the main regex expression being matched for each intent ↩

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
config		config
content		content
extras		extras
notebooks		notebooks
samples		samples
src		src
tests		tests
tts-gen/robot		tts-gen/robot
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
create_content.sh		create_content.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Set Marvin free, an interactive speech-based game

Summary

Installation Guide

User Guide

Minigames

Microphone Setup

Tests

Notes

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Set Marvin free, an interactive speech-based game

Summary

Installation Guide

User Guide

Minigames

Microphone Setup

Tests

Notes

License

References

Footnotes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages