Product Roadmap

A look at what’s coming next for DualMind Lab. Features are organized by priority and estimated timeline.

This roadmap is subject to change based on community feedback and development priorities. Have a feature request? Open an issue on GitHub.

In progress

UI Revamp

Priority: HighComplete redesign of the user interface with modern aesthetics, improved accessibility, and better mobile responsiveness.

New design system with consistent components
Dark/light mode improvements
Mobile-first responsive layout
Status: ~80% complete, launching soon

ELO Rating System

Priority: HighImplement a proper algorithm for model rankings instead of simple win-rate percentages.

Bayesian ELO with confidence intervals
Rating decay for inactive models
Historical rating charts

Planned — Q2 2026

Additional AI Providers

Integrate OpenAI, Anthropic, Google Gemini, and Mistral as first-class providers alongside Groq and Bytez.

Provider	Models	Status
Groq	Llama, Mixtral	Live
Bytez	Various	Live
OpenAI	GPT-4o, GPT-4o-mini	Planned
Anthropic	Claude 3.5 Sonnet	Planned
Google	Gemini 2.0	Planned

Public API Keys

Allow developers to generate API keys for programmatic access without requiring Supabase Auth session tokens.

API key management dashboard
Rate limiting per key
Usage analytics

Batch Comparisons

Run automated batch comparisons across model pairs with predefined prompt sets for systematic evaluation.

Export & Analytics

Export comparison data, vote history, and model statistics in CSV/JSON formats for external analysis.

Planned — Q3 2026

Custom Evaluation Criteria

Define custom rubrics for model evaluation beyond simple preference voting — accuracy, creativity, helpfulness, safety.

Team Workspaces

Collaborative workspaces where teams can run comparisons, share results, and build consensus on model selection.

Webhook Integrations

Real-time notifications via webhooks when comparisons complete, models are added, or rankings change.

Plugin System

Extensible plugin architecture for custom providers, evaluation metrics, and UI components.

Completed

v2.0 — Blind Comparison Arena

Arena battle mode with anonymous model responses
Random, topper, and manual model selection
Real-time SSE streaming
Thread management with public sharing
Admin dashboard with full CRUD
Supabase Auth integration

v1.5 — Groq Provider Support

Added Groq as primary AI provider
Llama 3.3 70B and Mixtral 8x7B models
Model status monitoring
Provider API key rotation

v1.0 — Initial Release

Single chat mode
Basic thread management
User authentication
PostgreSQL database schema

Request a feature

We prioritize features based on community demand. If you’d like to see something on this roadmap, open a GitHub issue or contribute directly.

Feature Request

Suggest a new feature on GitHub

Contributing

Contribute code directly

ChangelogLatest updates, releases, and improvements to DualMind Lab — the blind AI model comparison arena.

​Product Roadmap

​In progress

UI Revamp

ELO Rating System

​Planned — Q2 2026

​Planned — Q3 2026

​Completed

​Request a feature

Feature Request

Contributing

Product Roadmap

In progress

Planned — Q2 2026

Planned — Q3 2026

Completed

Request a feature