Skip to main content

Product Roadmap

A look at what’s coming next for DualMind Lab. Features are organized by priority and estimated timeline.
This roadmap is subject to change based on community feedback and development priorities. Have a feature request? Open an issue on GitHub.

In progress

UI Revamp

Priority: HighComplete redesign of the user interface with modern aesthetics, improved accessibility, and better mobile responsiveness.
  • New design system with consistent components
  • Dark/light mode improvements
  • Mobile-first responsive layout
  • Status: ~80% complete, launching soon

ELO Rating System

Priority: HighImplement a proper algorithm for model rankings instead of simple win-rate percentages.
  • Bayesian ELO with confidence intervals
  • Rating decay for inactive models
  • Historical rating charts

Planned — Q2 2026

1

Additional AI Providers

Integrate OpenAI, Anthropic, Google Gemini, and Mistral as first-class providers alongside Groq and Bytez.
ProviderModelsStatus
GroqLlama, MixtralLive
BytezVariousLive
OpenAIGPT-4o, GPT-4o-miniPlanned
AnthropicClaude 3.5 SonnetPlanned
GoogleGemini 2.0Planned
2

Public API Keys

Allow developers to generate API keys for programmatic access without requiring Supabase Auth session tokens.
  • API key management dashboard
  • Rate limiting per key
  • Usage analytics
3

Batch Comparisons

Run automated batch comparisons across model pairs with predefined prompt sets for systematic evaluation.
4

Export & Analytics

Export comparison data, vote history, and model statistics in CSV/JSON formats for external analysis.

Planned — Q3 2026

1

Custom Evaluation Criteria

Define custom rubrics for model evaluation beyond simple preference voting — accuracy, creativity, helpfulness, safety.
2

Team Workspaces

Collaborative workspaces where teams can run comparisons, share results, and build consensus on model selection.
3

Webhook Integrations

Real-time notifications via webhooks when comparisons complete, models are added, or rankings change.
4

Plugin System

Extensible plugin architecture for custom providers, evaluation metrics, and UI components.

Completed

  • Arena battle mode with anonymous model responses
  • Random, topper, and manual model selection
  • Real-time SSE streaming
  • Thread management with public sharing
  • Admin dashboard with full CRUD
  • Supabase Auth integration
  • Added Groq as primary AI provider
  • Llama 3.3 70B and Mixtral 8x7B models
  • Model status monitoring
  • Provider API key rotation
  • Single chat mode
  • Basic thread management
  • User authentication
  • PostgreSQL database schema

Request a feature

We prioritize features based on community demand. If you’d like to see something on this roadmap, open a GitHub issue or contribute directly.

Feature Request

Suggest a new feature on GitHub

Contributing

Contribute code directly