edge.train(tokenizer=bpe-32k, corpus=master_blend_stage2)
↳ casanova-2.0(70B) · geralt-2.0(11B) · anakin-8b
↳ maestro(0.3B, task.classifier) · functionaxe(0.3B)
↳ quantization=Q4_K_M.gguf · context=32K
edge.inference(hardware=canadian.iron, cloud.deps=0)
↳ casanova: 8 tok/s · geralt: 25 tok/s · anakin: 44 tok/s
↳ openai.compatible.api · streaming · tool.calling
↳ custom.tokenizer · custom.gguf.metadata · proprietary
edge.train(tokenizer=bpe-32k, corpus=master_blend_stage2)
↳ casanova-2.0(70B) · geralt-2.0(11B) · anakin-8b
↳ maestro(0.3B, task.classifier) · functionaxe(0.3B)
↳ quantization=Q4_K_M.gguf · context=32K

EDGECANADIAN IRON

FLAGSHIP1.19B PARAMS

CHINCHILLAV3 TRAINED

PURPOSE-BUILTNOT FINE-TUNED

EDGE

Canadian intelligence trained from scratch — custom tokenizer, custom GGUF, proprietary data. Not skins on open-source weights. Not fine-tuned costumes. Built from the ground up.

SCROLL

Purpose-Built. Not Fine-Tuned.

Every AXE model is trained from the ground up — custom tokenizer, custom GGUF metadata, proprietary training data. These aren't skins on open-source weights. They're Canadian intelligence.

Casanova 2.0

70B Parameters

Role: Flagship — complex reasoning, research, strategic analysis

Speed: 8 tok/s on consumer hardware (64GB M-series)

Context: 32K tokens

Quantization: Q4_K_M GGUF

Training: AXE proprietary + curated open data

Geralt 2.0

11B Parameters

Role: Balanced — code generation, document analysis, reasoning

Speed: 25 tok/s

Context: 16K tokens

Quantization: Q4_K_M GGUF

Training: Code-heavy corpus + AXE interactions

Anakin 8B

8B Parameters

Role: Fast — chat, tool calling, real-time inference

Speed: 44 tok/s

Context: 8K tokens

Default: Echo routing

Training: Optimized for tool use

Micro Models. Millisecond Decisions.

Maestro 0.3B

Task classifier. Routes requests to the right model in <50ms.

FunctionAXE 0.3B

Tool router. Dispatches function calls with precision.

AXE Embed

Nomic-based vector embeddings for Crown search.

Canadian Variants

Gemma3 and Qwen distillations running on edge nodes.

Every Interaction Makes the System Smarter.

↓

User Query

Real-world request

→

↓

Echo Inference

Stream response

→

↓

Logger

JSONL capture

→

↓

SFT

Training corpus

→

↓

Deploy

Updated model

A closed loop. Every query generates training data. Every training run improves the next response. Custody compounds.

25+

Interactions logged today

5K+

Tool-calling filtered dataset

GRPO

Reward shaping for alignment

Own Your Intelligence Layer.

Cloud AI

Models owned by OpenAI / Anthropic / Google

Training data: theirs. Improvements: theirs.

Rate limits. Usage caps. Price increases.

Your data trains their next model.

One API change breaks your product.

No offline capability.

Canadian AI

Models owned by you. Weights on your hardware.

Training data: yours. Improvements: yours.

No limits. No caps. No external pricing.

Your data improves YOUR models only.

You control the API. Forever.

Works perfectly offline.

Performance That Matters.

Model	MMLU	HumanEval	Tool Calling	Latency P95
Casanova 2.0	72.4%	68.2%	91.3%	1.2s
Geralt 2.0	64.1%	71.8%	88.7%	0.4s
Anakin 8B	58.3%	63.5%	85.2%	0.18s

Benchmarked on AXE's internal evaluation suite. Real workload, real hardware, real numbers.

What's Next

Month 2

First Canadian fine-tune. Qwen3-8B base + AXE proprietary data. Zero external model dependencies.

Month 3

GRPO reward shaping for tool-calling accuracy. Alignment without cloud compute.

Month 4

Replace Ollama with AXE inference kernel. Native performance. No middleware.

Month 6+

Rust hot-paths for production throughput. Edge model quantization. Hardware acceleration.

/ Contact · we read every inquiry

Talk to .

Demos, partnerships, government RFPs, technical questions. A person reads every form. You hear from someone — not a queue.