GMTalker

English | 中文

GMTalker, an interactive digital human rendered by Unreal Engine, is developed by the Media Intelligence Team at Bright Laboratory. The system integrates speech recognition, speech synthesis, natural language understanding, and lip-sync animation driving. It supports rapid deployment on Windows and requires only 2GB of VRAM to run the entire project.It can be deployed on Windows, Linux, and Android. This project demonstrates the demo effects of 3D cartoon digital human avatars, suitable for presentations, expansions, and commercial integration.

System Architecture Diagram

🧱 Features

Cross-Platform: Deploy on Windows, Linux, and Android with a single codebase.
Offline & Real-Time: Fully offline streaming dialogue with millisecond response.
Smart Interaction: Wake-up, interrupt, and voice cloning support.
Model Flexibility: Compatible with LLMs, custom Agents, and local knowledge bases.
Avatar Customization: Custom characters with lip-sync and micro-expressions.
Quick Setup: Easy backend configuration, no extra dependencies needed.
Hardware Support: Runs on Huawei Ascend NPU or CPU-only mode.

🔥 NEWS

🗓️ 2025.10.15: Backend now supports Docker deployment, see Docker Documentation for details.
🗓️ 2025.10.10:It now offers comprehensive support for both GPU and NPU 910B servers. The FunASR speech recognition is based on the ONNX Runtime, while the TTS speech synthesis leverages torch_npu.
🗓️ 2025.9.12: The project now offers comprehensive support for Android, Linux, Web, and Windows platforms. With no GPU required on the client side.
🗓️ 2025.9.1: Upgraded the model with a lightweight lip-sync driver and packaged the complete UE project into a standalone executable (.exe), allowing it to run smoothly on ordinary laptops.
🗓️ 2025.8.25: UpdatedImport UE avatar | Character Overview | Animation Overview documents.
🗓️ 2025.8.19: Released UE5 project files, including the GuangDUNDUN character. (jointly developed by Guangming Lab and the Shenzhen Guangming District Government).
🗓️ 2025.8.12: Added WebUI usage guide for quick project deployment.

💬 Join Our Community

Scan QR code to join GMTalker technical exchange group

📦 Quick Start

After configuring the backend, launch the application by downloading the installation package. With FunASR and MeloTTS, it can be started with one click—no additional environment setup or dependencies required.

Hardware Requirements

Operating System: Windows 10+ / Linux
Memory: 8GB+ RAM
GPU Support: Minimum 2GB VRAM (NVIDIA GPU with CUDA support recommended)

Cloning project

git clone  https://github.com/feima09/GMTalker.git

One click start

Windows:

webui.bat

Linux:

chmod +x webui.sh
./webui.sh

Docker Deployment (Recommended):

# Using docker-compose
docker-compose up -d

# Or using docker cli
docker run -d \
  --name gmtalker \
  -p 5002:5002 -p 7860:7860 \
  -v $(pwd)/configs:/app/configs \
  huiji2333/gmtalker:latest

👉 Docker Deployment Guide

Accessing Services

Main service:http://127.0.0.1:5002
Web configuration interface:http://127.0.0.1:7860

👉 Click here to view the WebUI User Guide

Download UE Executable

Download and launch GLM3.exe Windows version Google Drive Windows version Baidu Netdisk Linux version Google Drive Linux version Baidu Netdisk

Deploy Essential Local AI Services

Download the FunASR speech recognition lazy package here, then run run_server_2pass.batto start it with one click.
Download the MeloTTS speech recognition lazy package here, then run start.batto start it with one click.

👉 If you need to develop from source code, please click here to view the complete installation guide,Please refer to the backend overall architecture

📊 Comparison with Other Open-Source Solutions

Project Name	3D Avatar	UE5 Rendering	Voice Input	Voice Interruption	Lip Sync	Body Movements	Local Deployment (Win)	Star ⭐
LiveTalking	❌	❌	❌	❌	✅	❌	❌	6.1k
OpenAvatarChat	✅	❌	✅	❌	✅	❌	❌	1.6k
MNN	✅	❌	✅	❌	✅	✅	❌	12.6k
Fay	❌	✅	✅	✅	✅	✅	✅	11.6k
GMTalker	✅	✅	✅	✅	✅	✅	✅	🚀

✅ indicates full support for the feature, while ❌ indicates it is missing or unsupported.

📚 About Guangming Laboratory

The Guangdong Provincial Laboratory of Artificial Intelligence and Digital Economy (Shenzhen) (hereinafter referred to as Guangming Laboratory) is one of the third batch of Guangdong Provincial Laboratories approved for construction by the Guangdong Provincial Government. The laboratory focuses on cutting-edge theories and future technological trends in global artificial intelligence and the digital economy, dedicated to serving major national development strategies and significant needs.

Relying on Shenzhen's industrial, geographical, and policy advantages, Guangming Laboratory brings together global scientific research forces and fully unleashes the agglomeration effect of scientific and technological innovation resources. Centered around the core task of building a domestic AI computing power ecosystem, and driven by the development of multimodal AI technology and its application ecosystem, the laboratory strives to break through key technologies, produce original achievements, and continuously advance technological innovation and industrial empowerment.

The laboratory's goal is to accelerate the supply of diversified applications and full-scenario penetration of artificial intelligence technology, achieving mutual reinforcement of technological innovation and industrial driving forces, and continuously promoting the generation of new quality productivity powered by AI.

🌐 Contact Us (Project Collaboration)

Website: Guangming Laboratory Official Site
Email: [email protected]/[email protected]

Acknowledgements
Thanks to all team members and partners who participated in the development and support of the GMTalker project. (Fei Ma, Hongbo Xu, Minghui Li, Yiming Luo, Haijun Zhu, Yiyao Zhuo, Chao Song)

License

This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).

You are free to use, modify, and share the code and assets for non-commercial purposes, provided that you give appropriate credit.

🔗 Full License Text
🔍 Human-readable Summary

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
assets		assets
configs		configs
docs		docs
python		python
services		services
tools		tools
utils		utils
webui		webui
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
app.bat		app.bat
app.ps1		app.ps1
app.py		app.py
app.sh		app.sh
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
webui.bat		webui.bat
webui.ps1		webui.ps1
webui.py		webui.py
webui.sh		webui.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GMTalker

🧱 Features

🔥 NEWS

💬 Join Our Community

📦 Quick Start

After configuring the backend, launch the application by downloading the installation package. With FunASR and MeloTTS, it can be started with one click—no additional environment setup or dependencies required.

Hardware Requirements

📊 Comparison with Other Open-Source Solutions

📚 About Guangming Laboratory

🌐 Contact Us (Project Collaboration)

License

About

Uh oh!

Releases

Contributors 4

Languages

License

feima09/GMTalker

Folders and files

Latest commit

History

Repository files navigation

GMTalker

🧱 Features

🔥 NEWS

💬 Join Our Community

📦 Quick Start

After configuring the backend, launch the application by downloading the installation package. With FunASR and MeloTTS, it can be started with one click—no additional environment setup or dependencies required.

​​Hardware Requirements​

📊 Comparison with Other Open-Source Solutions

📚 About Guangming Laboratory

🌐 Contact Us (Project Collaboration)

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors 4

Languages

Hardware Requirements