🚀 LLaMeSIMD - The Ultimate SIMD Intrinsic & Function Translation Benchmarking Suite

🔥 What is LLaMeSIMD?

LLaMeSIMD is the world's first benchmarking suite designed to evaluate how well large language models (LLMs) can translate between different SIMD (Single Instruction Multiple Data) instruction sets across various CPU architectures.

Think of it as Rosetta Stone Validator for SIMD intrinsics, powered by AI!

🌟 Key Features

Multi-Architecture Support:
SSE4.2 (x86), NEON (ARM), VSX (PowerPC)
Dual Test Modes:
- 1-to-1 Intrinsic Translation: "What's the NEON equivalent of _mm_add_ps?"
- Full Function Translation: Convert complete SIMD functions between architectures
Multi-Model Evaluation:
Test local (Ollama), open (HuggingFace), and proprietary (OpenAI/Claude/DeepSeek) models
Scientific Metrics:
- Levenshtein similarity
- AST structural similarity
- Token overlap analysis
Beautiful Visualizations:
Automatic generation of comparison charts and CSV reports

🛠️ Installation & Setup

# Clone the repository
git clone https://github.com/VectorCamp/LLaMeSIMD.git
cd LLaMeSIMD

# Create and activate virtual environment
python -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Set up environment variables & edit with your API keys and model preferences
# To specify your preferred models, please list them as using commas
cd LLaMeSIMD
cp .env.example .env

🏎️ Usage

1️⃣ Run Tests

# Default architectures
python run_suite.py --engines SSE4.2 NEON

# Or select specific architectures (minimum 2 required)
python run_suite.py --engines NEON VSX

2️⃣ Manually Clean the Produced Results

After running the tests, review and clean the generated results stored in the Suite-Results directory. This step ensures accuracy by removing any artifacts or irrelevant outputs before proceeding to evaluation.

3️⃣ Evaluate Results

python evaluate_results.py

📊 Sample Output

After evaluation, you'll get:

Interactive Plots:
- Weighted score comparisons across models
- Architecture-specific performance breakdowns
CSV Reports:
- Detailed metrics for each test case

🧠 Why This Matters

SIMD optimization is crucial for:

High-performance computing
Game development
Scientific simulations
Computer vision
Cryptography

LLaMeSIMD helps:

Researchers benchmark model capabilities

🏆 Benchmarking Methodology

Dataset: Carefully curated intrinsic and function pairs (with significant help from our previously created tool, simd.info)
Metrics:
- Levenshtein Similarity: Character-level accuracy
- AST Similarity: Structural correctness
- Token Overlap: Semantic similarity
Weighted Scoring: 50% Levenshtein + 30% AST + 20% Token

🌐 Roadmap

Add AVX-2 support
Add AVX-512 support
Add P@SS-1 Compilation Metric

📜 License

BSD 2-Clause — Because performance optimization should be accessible to all!

✉️ Contact

📧 Email: simd.ai@vectorcamp.gr
💻 GitHub: VectorCamp/LLaMeSIMD

Happy SIMD-ing! May your vectors always be aligned and your pipelines full! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Datasets		Datasets
LLaMeSIMD		LLaMeSIMD
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 LLaMeSIMD - The Ultimate SIMD Intrinsic & Function Translation Benchmarking Suite

🔥 What is LLaMeSIMD?

🌟 Key Features

🛠️ Installation & Setup

🏎️ Usage

1️⃣ Run Tests

2️⃣ Manually Clean the Produced Results

3️⃣ Evaluate Results

📊 Sample Output

🧠 Why This Matters

🏆 Benchmarking Methodology

🌐 Roadmap

📜 License

✉️ Contact

About

Uh oh!

Releases

Uh oh!

Contributors 2

Uh oh!

Languages

License

VectorCamp/LLaMeSIMD

Folders and files

Latest commit

History

Repository files navigation

🚀 LLaMeSIMD - The Ultimate SIMD Intrinsic & Function Translation Benchmarking Suite

🔥 What is LLaMeSIMD?

🌟 Key Features

🛠️ Installation & Setup

🏎️ Usage

1️⃣ Run Tests

2️⃣ Manually Clean the Produced Results

3️⃣ Evaluate Results

📊 Sample Output

🧠 Why This Matters

🏆 Benchmarking Methodology

🌐 Roadmap

📜 License

✉️ Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors 2

Uh oh!

Languages