Conversation mode / Hand free voice input | Voters

Conversation mode / Hand free voice input

planned

Marvellous Quail

Add the neccesary tools (with openAI whisper in both directions) to have a totally conversation mode, with no interruptions or pressing buttons. I think we are near to get it, but now you need elevenlabs or other methods to connect.

Ngoc Nguyen

Merged in a post:

Automatic audio TTS starts before the response has completed.

Tan Swallow

This would be very useful to reduce delay.

Ngoc Nguyen

Merged in a post:

Suggestion for faster text-to-speech.

Likely Grasshopper

Currently, the feature waits until the entire message is shared before reading it aloud. I propose implementing a system that breaks down the text into manageable chunks, allowing the text-to-speech to begin reading as soon as each chunk is available, similar to ChatWithGPT.
This enhancement could:
Improve user engagement through immediate auditory feedback
Increase accessibility for users who rely on auditory processing
Give TypingMind a competitive edge over other platforms
I believe this feature would greatly enhance the user experience on TypingMind. Thank you for considering my suggestion!

Tony Dinh

Merged in a post:

Additional options to manage voice control

Immediate Muskox

Some suggested additional options. The goal would be to reduce need to use mouse inputs to use voice inputs:

allow an additional option to terminate a voice input by saying a user definable key word e.g. "send"

Same but to allow a key word to clear message

Allow pressing a key ( e.g. hotkey) to turn on/off the voice control.

Press and hold a key for input (AKA push to talk mode)

Send message when the key is released option

allow option for switching off microphone when hearing TTS responses from typing mind

option for switching off voice input when switching to another tab/window

additional options for auto-send message after speaking (this doesn't work for me currently...could have option for threshold and/or length of silence)

===

These options would also allow a redesign of the current voice input control - the suggestion would be to have an options dialogue (which allows file upload) but have voice input done live - i.e. rather than separate dialogue box, a live input is indicated by the microphone icon turning red - which can be initiated (and stopped) by clicking on the icon, or by using the hotkey, or holding of the push to talk key.

Put all these together, this would allow a recreation of the ChatGPT mobile voice map - i.e. a hands free dialogue with a LLM. Once change to make this work better would be to start TTS of LLM response before the end of message.

Tony Dinh

marked this post as

planned

Ngoc Nguyen

Merged in a post:

automatically start and stop voice functionality

Yucca Reindeer

Any chance we can have a feature to automatically start and stop voice recording when user is done speaking? See requirements below.
1) A new setting is updated to allow for automatic voice recognition.
2) If this setting is enabled, system will recognize user voice and automatically start typing user input from speech and automatically submit to AI when user is done speaking
If this setting is enabled, system will be able to determine when user starts and stop speaking and automatically submit input to AI to have more of a fluid conversation with the AI.
The idea of the feature is to be able to talk to AI like you do in natural conversation so that you don't have to start/stop conversation or click "finish" button when done speaking.
Thank you,.

Ngoc Nguyen

Merged in a post:

Conversational Mode Activation Toggle

Symbolic Tapir

A toggle feature for activating and deactivating conversational mode would be beneficial. This would enable users to send voice inputs without continuously pressing the microphone button, similar to the feature OpenAI recently implemented.

Tony Dinh

marked this post as

under review