It would be really useful to have some sort of A/B test chat windows.
Thinking of something similar to lmarena.ai or chathub.gg that would allow users to split test different models, compare agents, or plugins using the same input.