Open Source AI
(40 articles)GLM-5.1 Hits 95% of Claude's Coding Score, Open Source
Zhipu AI's GLM-5.1 scores 94.6% of Claude Opus 4.6's coding performance in testing. Built on GLM-5's open-source SWE-bench record of 77.8%, here's what this...
DGX Spark vs Mac Studio M3 Ultra: $10K AI Showdown
Both cost $10K. Both run Qwen3.5 397B locally. But a dual DGX Spark setup and a Mac Studio M3 Ultra 256GB deliver wildly different experiences — here's who...
OpenAI Open-Sources 5 Teen Safety Rules for AI Apps
OpenAI releases gpt-oss-safeguard, a free open-source toolkit with prompt-based teen safety policies covering five risk categories. Here's what it means for...
AI Benchmarks Are Broken — This Book Explains Why
A new book by Moritz Hardt argues that benchmark rankings — not scores — are what actually matter. We tested his thesis against every major 2026 AI benchmark.
Krasis vs llama.cpp: Is 10x Faster LLM Inference Real?
Krasis LLM Runtime claims dramatically faster inference than llama.cpp for large MoE models on a single NVIDIA GPU. We break down the real numbers, the...
A $500 GPU Just Beat Claude Sonnet at Coding Tasks
ATLAS, a source-available AI system built by a Virginia Tech student, scores 74.6% on LiveCodeBench using a single $500 consumer GPU — outperforming Claude...
Clarity-OMR vs Audiveris: 5 OMR Accuracy Tests
A deep-dive comparison of Clarity-OMR's machine learning approach against Audiveris's traditional computer vision for optical music recognition — with real...
ROCm 7 vs Vulkan on Mi50: 4-Model Benchmark Results
New benchmarks pit ROCm 7 nightly against Vulkan on an AMD Mi50 32GB running llama.cpp. Vulkan wins short-context dense inference, but ROCm dominates...
CRYSTAL Benchmark Exposes How AI Models Fake Reasoning
A new benchmark tested 20 multimodal AI models and found 19 of them cherry-pick reasoning steps while skipping actual thinking. The gap between accuracy and...
OpenAI Buys Astral: 5 Things Python Devs Must Know
OpenAI is acquiring Astral, the company behind uv and Ruff, to supercharge Codex. Here's what it means for the Python ecosystem, open source, and the AI coding...
Google Backs $12.5M Open Source Security Push with AI
Google, Microsoft, OpenAI, and Anthropic are pooling $12.5 million to secure open source software — and Google's AI tools Big Sleep and CodeMender are already...
6 Best Uncensored GGUF Models to Run Locally in 2026
The Qwen3.5-9B uncensored GGUF scene just got interesting. We ranked the top distilled, uncensored models you can actually run on consumer hardware — no cloud,...