Skip to content

Add MetriLLM skill: benchmark local LLM models#141

Closed
TheBlueHouse75 wants to merge 1 commit intoopenclaw:mainfrom
TheBlueHouse75:add-metrillm
Closed

Add MetriLLM skill: benchmark local LLM models#141
TheBlueHouse75 wants to merge 1 commit intoopenclaw:mainfrom
TheBlueHouse75:add-metrillm

Conversation

@TheBlueHouse75
Copy link

Summary

  • Adds MetriLLM skill to the ClawHub registry
  • MetriLLM is a CLI benchmark tool for local Ollama LLM models
  • Measures performance (tokens/second, time to first token, memory usage) and quality (reasoning, math, coding, instruction following, structured output, multilingual)
  • Computes a hardware fitness verdict: EXCELLENT / GOOD / MARGINAL / NOT RECOMMENDED

Skill details

Field Value
Name metrillm
Author MetriLLM
License MIT
Source https://github.com/MetriLLM/metrillm
Required tools Bash, Read
Dependencies Ollama

What users can do with this skill

  • Run full benchmarks (npx metrillm bench --model <name>) measuring both performance and quality
  • Run quick performance-only benchmarks (--perf-only)
  • View previous results stored in ~/.metrillm/results/
  • Share results to the public leaderboard (--share)

Test plan

🤖 Generated with Claude Code

MetriLLM is a CLI benchmark tool for local Ollama LLM models.
It measures performance (tok/s, TTFT, memory) and quality
(reasoning, math, coding, instruction following, structured output,
multilingual), then computes a hardware fitness verdict.

Repository: https://github.com/MetriLLM/metrillm

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@openclaw-barnacle
Copy link

Thanks for the pull request! This repository is read-only and is automatically synced from https://clawhub.ai, so we can’t accept changes here. Please make updates on the website instead.

@openclaw-barnacle openclaw-barnacle bot closed this Mar 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant