Product Roadmap
A look at what’s coming next for DualMind Lab. Features are organized by priority and estimated timeline.This roadmap is subject to change based on community feedback and development priorities. Have a feature request? Open an issue on GitHub.
In progress
UI Revamp
Priority: HighComplete redesign of the user interface with modern aesthetics, improved accessibility, and better mobile responsiveness.
- New design system with consistent components
- Dark/light mode improvements
- Mobile-first responsive layout
- Status: ~80% complete, launching soon
ELO Rating System
Priority: HighImplement a proper algorithm for model rankings instead of simple win-rate percentages.
- Bayesian ELO with confidence intervals
- Rating decay for inactive models
- Historical rating charts
Planned — Q2 2026
Additional AI Providers
Integrate OpenAI, Anthropic, Google Gemini, and Mistral as first-class providers alongside Groq and Bytez.
| Provider | Models | Status |
|---|---|---|
| Groq | Llama, Mixtral | Live |
| Bytez | Various | Live |
| OpenAI | GPT-4o, GPT-4o-mini | Planned |
| Anthropic | Claude 3.5 Sonnet | Planned |
| Gemini 2.0 | Planned |
Public API Keys
Allow developers to generate API keys for programmatic access without requiring Supabase Auth session tokens.
- API key management dashboard
- Rate limiting per key
- Usage analytics
Batch Comparisons
Run automated batch comparisons across model pairs with predefined prompt sets for systematic evaluation.
Planned — Q3 2026
Custom Evaluation Criteria
Define custom rubrics for model evaluation beyond simple preference voting — accuracy, creativity, helpfulness, safety.
Team Workspaces
Collaborative workspaces where teams can run comparisons, share results, and build consensus on model selection.
Webhook Integrations
Real-time notifications via webhooks when comparisons complete, models are added, or rankings change.
Completed
v2.0 — Blind Comparison Arena
v2.0 — Blind Comparison Arena
- Arena battle mode with anonymous model responses
- Random, topper, and manual model selection
- Real-time SSE streaming
- Thread management with public sharing
- Admin dashboard with full CRUD
- Supabase Auth integration
v1.5 — Groq Provider Support
v1.5 — Groq Provider Support
- Added Groq as primary AI provider
- Llama 3.3 70B and Mixtral 8x7B models
- Model status monitoring
- Provider API key rotation
v1.0 — Initial Release
v1.0 — Initial Release
- Single chat mode
- Basic thread management
- User authentication
- PostgreSQL database schema
Request a feature
Feature Request
Suggest a new feature on GitHub
Contributing
Contribute code directly