💬 Hi_LLM_Chat: On-Device Multi-Turn Chatbot Powered by Llama 3.2 1B

Hi_LLM_Chat is a simple yet powerful multi-turn chatbot that runs entirely on your local machine. It's built upon the Llama 3.2 1B model and aims for natural conversations with users. Experience real-time interaction with its streaming text generation feature! 🚀

✨ Key Features

🖥️ 100% Local Execution: No internet connection or external API calls needed (after model download). Everything is processed on your PC.
🔄 Multi-Turn Conversation Support: Remembers previous parts of the conversation for contextual responses.
🦙 Utilizes Llama 3.2 1B Model: Leverages a capable yet relatively lightweight large language model.
💨 Real-time Streaming Output: Responses appear token-by-token, like someone typing in real-time, making interactions more engaging.
🔧 Simple Structure: Focuses on core chatbot functionality, making it easy to understand and modify.

🎬 Demo

⚙️ Requirements

Python: 3.10+ recommended
Llama 3.2 1B Model Weights & Tokenizer: You need the official Llama 3.2 1B model weights and associated tokenizer files. Request access and download them from Meta's official channels: https://llama.meta.com/llama-downloads/ (Follow the instructions on the site).

▶️ Usage

This section covers everything from setting up the environment to running the chatbot.

Clone the Repository:

git clone https://github.com/bok3948/Hi_LLM_Chat.git
cd Hi_LLM_Chat

Download Model & Tokenizer:
- Obtain the Llama 3.2 1B (Instruct version recommended for chat) model weights and tokenizer from the official Meta channels (https://llama.meta.com/llama-downloads/

Create Directory & Place Model Files:

Create a directory within the cloned repository to store the model files (e.g., create a folder named llama3.2-1B-instruct).
Place all the downloaded model files (weights, tokenizer, params, etc.) into the directory you just created.

Example Directory Structure: After placing the files, your project folder should resemble this:

Hi_LLM_Chat/
│
├── llama3.2-1B-instruct/     <-- Directory for model files
│   │
│   ├── consolidated.00.pth   <-- Example Llama 3.2 weight file(s)
│   ├── (or *.safetensors)    <-- Alternative weight file format
│   ├── (or *.gguf)           <-- Example GGUF model file (if using llama.cpp)
│   │
│   ├── tokenizer.model       <-- Llama 3.2 tokenizer file
│   ├── params.json           <-- Model parameters file
│   └── ...                   <-- Any other files included with the download
│
├── chat.py                   <-- Your main chatbot script (Example name)
├── requirements.txt          <-- List of required Python libraries
├── config.py                 <-- Optional configuration file (Example name)
├── README.md                 <-- This README file
└── .git/                     <-- Git directory (hidden by default)

Run the Chatbot:
- Open your terminal and execute the main script:
```
python main.py
```
- Example specifying model folder via command-line argument
```
python main.py --model_folder_path ./llama3.2-1B-instruct
```

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
models		models
README.md		README.md
chat.py		chat.py
chatbot_demo.gif		chatbot_demo.gif
main.py		main.py
requirements.txt		requirements.txt
tokenizer.model		tokenizer.model
tokenizer.py		tokenizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

💬 Hi_LLM_Chat: On-Device Multi-Turn Chatbot Powered by Llama 3.2 1B

✨ Key Features

🎬 Demo

⚙️ Requirements

▶️ Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

bok3948/Hi_LLM_Chat

Folders and files

Latest commit

History

Repository files navigation

💬 Hi_LLM_Chat: On-Device Multi-Turn Chatbot Powered by Llama 3.2 1B

✨ Key Features

🎬 Demo

⚙️ Requirements

▶️ Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages