September 24, 2025Product

Introducing, Celer 2.5

Announcing the first reasoning models in the SAGE series, combining extreme efficiency with advanced problem-solving capabilities in 3B, 8B, and 14B variants.

Introduction

We are introducing SAGE 2.5 Celer, a new series of hybrid models designed to bridge the gap between lightweight deployment and heavy reasoning tasks. Available in 3B, 8B, and 14B parameter sizes, these models feature a unique "Thinking" mode that allows them to allocate more compute time to complex queries before responding.

This release marks a significant step forward in making advanced tool-calling and mathematical reasoning accessible on consumer-grade hardware and edge devices.

Comparison: 14B Models

BenchmarksQwen2.5 14BSAGE 2.5 Celer (Std)SAGE 2.5 Celer (Think)Deepseek R1 14B
General77.87%86.67%88.27%81.00%
MMLU-67.13%70.91%76.47%69.20%
Math (GSM8K)94.31%94.31%95.68%93.33%
MATH79.20%73.49%87.37%89.78%
Multi-lingual62.29%72.50%73.43%63.86%

Comparison: Small Models (3B)

StatisticLlama 3 3BQwen Small (3B)SAGE 3B (Std)SAGE 3B (Reason)
Non-Reasoning58.67%55.42%67.20%74.90%
MMLU33.76%31.40%40.10%50.85%
MMLU-Pro74.25%70.90%80.10%86.75%
Math45.84%40.25%47.25%55.80%

Tool Calling Performance (BFCL)

CategorySAGE 3BSAGE 8BLlama 3B
Simple94.5%96.8%Not Supported
Parallel76.0%88.2%Not Supported
Multiple92.0%95.0%Not Supported
Hybrid ModelsReasoningTool Calling2025
SAGEA AI Research