EASY AS PIE UPDATE: Support reasoning effort = NONE, low, medium, high for Gemini in accordance to new feature in Gemini Openai compatibility layer released Apr 27, 2025
Á
Ásgeir Thor
Unlike the Gemini API, the OpenAI API offers three levels of thinking control: "low", "medium", and "high", which behind the scenes we map to 1K, 8K, and 24K thinking token budgets.
If you want to disable thinking, you can set the reasoning effort to "none".
K
Ken Harris
i believe this is only for the
Flash
model if im correct