Shikigami

Personal conjured assistant template for fellow onmyōji 🔖

Vision

A self-contained, learning, fully offline virtual assistant.

Setup:

Start by running init.sh after you create your own config.yml file.
Run git submodule update --init --recursive to pull down beanstalkd, then cd beanstalkd and make.
Run start.sh once ecosystem.config.js is created.

Requires pm2, nvm, and rvm under a dedicated user account.

Architecture:

                                            
 ┌────────────────────────────────────────┐ 
 │               PM2 Daemon               │ 
 └─────┬───────────────────────────┬──────┘ 
       │                           │        
       ▼                           ▼        
  ┌─────────┐                   ┌─────┐     
  │ core.rb ├───────────────────┤     │     
  └────┬────┘                   │     │     
       │                        │     │     
       ▼                        │     │     
     ┌───┐                      │  B  │     
     │   │     ┌───────────┐    │  e  │     
     │   ├────►│ module 00 ├────┤  a  │     
     │   │     └───────────┘    │  n  │     
     │ M │                      │  s  │     
     │ o │     ┌───────────┐    │  t  │     
     │ d ├────►│ module 01 ├────┤  a  │     
     │ u │     └───────────┘    │  l  │     
     │ l │                      │  k  │     
     │ e │     ┌───────────┐    │  d  │     
     │ s ├────►│ module 02 ├────┤     │     
     │   │     └───────────┘    │     │     
     │   │                      │     │     
     │   │     ┌───────────┐    │     │     
     │   ├────►│ module NN ├────┤     │     
     └───┘     └───────────┘    └─────┘

Events from external resources (chat clients, databases, filesystems) are processed by the appropriate module, or queued into beanstalkd as raw lines of ruby code. Each module is responsible for routing its events, which can be sent to another module or core.rb, which will spawn a new thread and executes eval() on the message body. Every directory under modules with a valid wrapper.sh file will automatically be detected by core.rb and sent to PM2 for startup and persistence.

Resources:

AI

Llama 3.1 8B Instruct
Quantization: Q5_K_M
llama_model_quantize_internal: model size  = 30633.02 MB
llama_model_quantize_internal: quant size  =  5459.93 MB
context size: 1200

Size

"Context Size" = defines the maximum sequence length the model can process during inference or training. The context size determines how much text the model can "see" at once when generating predictions or understanding the input.
Q4_K_S, Q4_K_M, Q4_K_L
In 4-bit quantization, each parameter now requires only 0.5 bytes. For a 70 billion parameter model, the memory footprint becomes:

Memory for model weights:
70B params×0.5 bytes/param=35 GB of VRAM

Fine-Tuning

** Coming Soon **

Training

Training Data:

The Large Language Model (LLM) used in this project is currently Llama 3.1, which is trained on the following:

67.0% CommonCrawl
15.0% C4
4.5% GitHub
4.5% Wikipedia
4.5% Books
2.5% ArXiv
2.0% StackExchange

*** VERY MUCH A WORK IN PROGRESS ***

Name		Name	Last commit message	Last commit date
Latest commit History 301 Commits
.github/workflows		.github/workflows
app/desktop		app/desktop
beanstalkd @ 9cceb0d		beanstalkd @ 9cceb0d
resources		resources
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
README.md		README.md
Rakefile		Rakefile
example.config.yml		example.config.yml
example.ecosystem.config.js		example.ecosystem.config.js
init.sh		init.sh
package-lock.json		package-lock.json
package.json		package.json
pre-commit		pre-commit
start.sh		start.sh
sync_dirs.sh		sync_dirs.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shikigami

Vision

Setup:

Architecture:

Resources:

AI

Size

Fine-Tuning

Training

Training Data:

About

Releases

Packages

Languages

License

Ifiht/Shikigami

Folders and files

Latest commit

History

Repository files navigation

Shikigami

Vision

Setup:

Architecture:

Resources:

AI

Size

Fine-Tuning

Training

Training Data:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages