One element that I think needs improving is the number of clicks required to get the audio output out: https://www.loom.com/share/da959db09e36464ab94494a957d69f09
Basically after generating text I right click on the 3 dots, click on the play button, wait for the button to appear, click on it on the left side, wait for the audio to load, press play again, wait for the audio to play, press the pause there... you can see it in the video.
Ideally: have an option to autoplay audio (or at least an option to display the play button that will play the generated text)