I spotted something interesting in the Python API library code: https://github.c...

msp26 · 2025-04-17T23:05:02 1744931102

The API won't give you the "thinking" tokens, those are only visible on AI studio. Probably to try to stop distillation, very disappointing. I find reading the cot to be incredibly informative to identify failure modes.

> Hey Everyone,

> Moving forward, our team has made a decision to only show thoughts in Google AI Studio. Meaning, we no longer return thoughts via the Gemini API. Here is the updated doc to reflect that.

https://discuss.ai.google.dev/t/thoughts-are-missing-cot-not...

---

After I wrote all of that I see that the API docs page looks different today and now says:

>Note that a summarized version of the thinking process is available through both the API and Google AI Studio.

https://ai.google.dev/gemini-api/docs/thinking

Maybe they just updated it? Or people aren't on the same page at Google idk

Previously it said

> Models with thinking capabilities are available in Google AI Studio and through the Gemini API. Note that the thinking process is visible within Google AI Studio but is not provided as part of the API output.

https://web.archive.org/web/20250409174840/https://ai.google...

phillypham · 2025-04-17T21:27:23 1744925243

They removed the docs and support for it https://github.com/googleapis/python-genai/commit/af3b339a9d....

You can see the thoughts in AI Studio UI as per https://ai.google.dev/gemini-api/docs/thinking#debugging-and....

lemming · 2025-04-17T22:03:52 1744927432

I maintain an alternative client which I build from the API definitions at https://github.com/googleapis/googleapis, which according to https://github.com/googleapis/python-genai/issues/345 should be the right place. But neither the AI Studio nor the Vertex definitions even have ThinkingConfig yet - very frustrating. In general it's amazing how much API munging is required to get a working client from the public API definitions.

Deathmax · 2025-04-17T22:26:22 1744928782

It is gated behind the GOOGLE_INTERNAL visibility flag, which only internal Google projects and Cursor have at the moment as far as I know.

qwertox · 2025-04-17T22:10:21 1744927821

In AI Studio the flash moddels has two toggles: Enable thinking and Set thinking budget. If thinking budget is enabled, you can set tue max number of tokens it can use to think, else it's Auto.