GeniusPro Lab

Research, train, and ship models

Our lab turns benchmarks and fine-tunes into the Simon lineup you call in production — on state-of-the-art GPUs and servers we operate.

ResearchTrainEvalShip

Read the news Explore models

From benchmark to simon-says-* slug · often via auto-update

Research

Benchmark frontier and domain models — find what works for business workloads.

Train

Fine-tune experts, orchestrators, and base models on lab hardware.

Eval

Run latency, quality, and safety harnesses before anything ships.

Ship

Release to Simon — often through auto-update slugs, no migration projects.

Our mission

Models that work for real business workloads

Frontier models are impressive in demos. Our lab exists to turn them into slugs your product can depend on — with domain discipline, predictable latency, and upgrade paths that do not require migration projects.

We benchmark upstream providers, fine-tune experts for professional roles, and run eval harnesses before anything ships to Simon. When a model clears the bar, it lands behind a stable name — simon-says-chat, simon-says-finance, gp-auto-* — so your integration stays the same while quality improves.

The same pipeline powers voice, vision, and agent products. Research is not a separate track from production; it is how we keep the platform current without breaking customer apps.

Research focus

What we investigate

Lab work runs in focused tracks — each with benchmarks, training runs, and shipping criteria tied to a product surface.

Domain experts

Fine-tuned Simon Says slugs for finance, legal, architecture, and other professional workloads — tuned for vocabulary, reasoning style, and tool use.

Profession pairs: regular + -pro on stronger upstream
Shared specialist prompts across slug tiers
Benchmarked against frontier baselines

Recent

Finance, legal, and architecture experts →

Browse experts →

Agents & orchestration

Multi-step agents, CAT workflows, Tiger Mode, and Cursor-class coding agents — routing, memory, and tool loops validated before they ship.

Orchestrators that pick upstream per request
Long-running Fleet workers on private nodes
Fast agent paths for IDE integrations

Recent

Fleet + Compute on Einstein hardware →

See Tiger Mode →

Voice & vision

Realtime speech, transcription, image generation, and vision understanding — benchmarked on lab GPUs and wired into Simon API routes.

Whisper STT and Gemini Live voice sessions
Image model fallback chains for reliability
Latency targets for conversational UX

Recent

Einstein voice lab: local STT + live models →

Explore Voice →

Eval & reliability

Quality, latency, safety, and regression harnesses run on every candidate model — nothing ships without passing gates and audit trails.

Slug-level version telemetry in production
Auto-update with rollback when needed
Hallucination and jailbreak spot checks

Recent

Ship once, stay current with auto-update →

Auto-update slugs →

Lab pipeline

Research

Benchmark frontier and domain models — find what works for business workloads.

Train

Fine-tune experts, orchestrators, and base models on lab hardware.

Eval

Run latency, quality, and safety harnesses before anything ships.

Ship

Release to Simon — often through auto-update slugs, no migration projects.

Research index

Explore by model family

Browse Simon slugs by capability — experts, orchestrators, voice, vision, and lab hardware — so you can find the right surface fast.

Simon Says experts

Domain slugs for professional roles — finance, legal, architecture, and more. Each profession ships regular and -pro tiers.

simon-says-finance
simon-says-lawyer
simon-says-architect

Expert catalog

Orchestrators

Task routers like simon-says-chat classify prompts and pick the best upstream model per request.

simon-says-chat
simon-says-code
simon-says-research

Orchestrator slugs

Auto-update routes

Stable slug names that pick up better models behind the same API call — no quarterly migration projects.

gp-auto-chat
gp-auto-fast
simon-says-chat

How it works

Voice & realtime

Speech-to-text, text-to-speech, and live conversational models for Simon Voice and Einstein lab sessions.

Whisper STT
Gemini Live
Realtime API

Voice product

Vision & media

Image understanding, generation, and fallback chains when upstream models rate-limit or fail.

Vision slugs
Image fallback
SAM-class tools

Vision product

Fleet & compute

Private GPU nodes and distributed agent workers connected to the same Simon API your app already calls.

Einstein nodes
Fleet dispatch
Hybrid burst

Lab hardware

Full slug list and tiers at geniuspro.io/models

Infrastructure

Lab hardware we operate

Training and eval run on GPUs we control — Einstein-class nodes for sensitive workloads, shared lab capacity for benchmarks, and hybrid routes into cloud providers when burst makes sense.

Explore Compute See Fleet

Einstein lab nodes

Dedicated GPU servers for voice experiments, fine-tunes, and private inference previews — not multi-tenant shared pools.

Hybrid with Simon API

Route heavy jobs to your hardware or ours through the same Simon API endpoints — one integration, flexible placement.

Fleet control plane

Register workers, dispatch long-running agent jobs, and monitor runs from a single dashboard tied to production API keys.

How we ship

Principles behind every release

Every slug that reaches production follows the same rules — stable integrations, measurable quality, and clear version history.

Stable names, moving quality

Customers integrate once. Auto-update slugs absorb upstream improvements while request bodies and telemetry stay consistent.

Business-first benchmarks

We score models on professional tasks — citations, tool use, latency under load — not just academic leaderboard spots.

No silent regressions

Every slug upgrade is versioned. Teams can audit which model served a request and roll back if a release misses the bar.

US hosting by default

Production inference and customer data stay on infrastructure we operate in the United States unless a contract says otherwise.

Shipping now

Latest from the lab

Model releases, product updates, and research notes — the same feed as our News hub, curated for lab readers.

All news →

Models

New Simon Says experts: finance, legal, architecture

Call a domain slug — simon-says-finance, simon-says-lawyer, and more — without wiring your own prompts.

Jun 8, 2026

Read article Models

Auto-update slugs: ship once, stay current

simon-says-chat and gp-auto-* pick up better models behind the same name. No migration projects.

Jun 5, 2026

Read article Product

White Label AI — your logo, our stack

Launch Ainslie-class experiences under your brand with workspaces, billing, and US hosting included.

Jun 2, 2026

Read article Platform

Fleet + Compute: GPU nodes meet the cloud API

Agent workers on Einstein-class hardware, connected to the same Simon API your app already calls.

May 28, 2026

Read article Research

Einstein voice lab: local STT and live models

Whisper on lab GPUs for low-latency transcription, paired with Gemini Live sessions for conversational speech on hardware we operate.

May 22, 2026

Read article Research

Inside the Simon slug benchmark harness

How we score candidate models on professional tasks, latency under load, and regression checks before a slug upgrade ships.

May 15, 2026

Read article Research

Fast Cursor agents on simon-says-fast

IDE-integrated agents need sub-second first tokens. We benchmarked routing paths that keep coding loops responsive in Cursor and Ainslie.

May 8, 2026

Read article

All news →

Build with us

Use what the lab ships

Every slug on this page is callable today through Simon API. Start with the model catalog, wire a single endpoint, and let auto-update keep you current.

Open developer docs

Explore models See all products Back to home