From the linked docs: > If you want to disable thinking, you can set the reasoni...

mgraczyk · 2025-05-04T00:25:08 1746318308

Wow thanks I did not know

logankilpatrick · 2025-05-11T19:14:34 1746990874

We added it to the docs. The downside of the OAI compat endpoint is we have to design the API twice, once for our API, then once through the OAI compat layer which makes it slower sometimes to have certain features, especially if we diverge at all.

mgraczyk · 2025-05-11T19:17:49 1746991069

Thanks, yes makes sense.

BTW, I have noticed that when tested outside GCP, the OpenAI compat endpoint has significantly lower latency for most requests (vs using the genai library). VertexAI is better than both.

Any idea why or if that will change?