Feature Request: POWER8 vs POWER9 performance comparison for llama.cpp

## Background\n\nThe llama-cpp-power8 project achieving 147 t/s on POWER8 is impressive! The AltiVec/VSX optimization work is clearly paying off.\n\n## Proposed Comparison\n\nWould be valuable to see a **POWER8 vs POWER9 vs POWER10** performance comparison:\n\n| System | Tokens/sec | Multiplier | Notes |\n|--------|------------|------------|-------|\n| POWER8 S824 | 147 t/s | 1.0x | Baseline |\n| POWER9 AC922 | ??? | ?x | With VSX3 |\n| POWER10 E1080 | ??? | ?x | With VSX4 |\n\n## Why This Matters\n\n1. **Quantifies generational improvements** - How much does each POWER gen gain?\n2. **Validates optimization strategy** - Are VSX improvements linear?\n3. **Guides hardware recommendations** - Best price/performance for LLM inference\n\n## Test Setup Suggestion\n\n- Same model (TinyLlama 1.1B Q4_K)\n- Same batch size\n- Same memory configuration\n- Compare: stock llama.cpp vs POWER-optimized\n\nThis data would be valuable for the retro-computing + AI community! ??

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: POWER8 vs POWER9 performance comparison for llama.cpp #26

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Feature Request: POWER8 vs POWER9 performance comparison for llama.cpp #26

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions