If you are open to alternatives, try https://glama.ai/gateway We currently serve...

airstrike · 2025-02-24T19:49:37 1740426577

The issue isn't API limits, but web UI limits. We can always get around the web interface's limits by using the claude API directly but then you need to have some other interface...

punkpeye · 2025-02-24T20:15:17 1740428117

The API still has limits. Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

The value proposition of Glama is that it combines UI and API.

While everyone focuses on either one or the other, I've been splitting my time equally working on both.

Glama UI would not win against Anthropic if we were to compare them by the number of features. However, the components that I developed were created with craft and love.

You have access to:

* Switch models between OpenAI/Anthropic, etc.

* Side-by-side conversations

* Full-text search of all your conversations

* Integration of LaTeX, Mermaid, rich-text editing

* Vision (uploading images)

* Response personalizations

* MCP

* Every action has a shortcut via cmd+k (ctrl+k)

airstrike · 2025-02-24T21:10:14 1740431414

Ok, but that's not the issue the parent was mentioning. I've never hit API limits but, like the original comment mentioned, I too constantly hit the web interface limits particularly when discussing relatively large modules.

glenstein · 2025-02-24T21:42:56 1740433376

Right, that's how I read it also. It's not that there's no limits with the API, but that they're appreciably different.

m_kos · 2025-02-24T23:41:21 1740440481

Your chat idea is a little similar to Abacus AI. I wish you had a similarly affordable monthly plan for chat only, but your UI seems much better. I may give it a try!

Aeolun · 2025-02-24T22:56:05 1740437765

> Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

Even heavy coding sessions never run into Claude limits, and I’m nowhere near the highest tier.

smokeydoe · 2025-02-25T02:44:17 1740451457

I think it’s based on the tools you’re using. If I’m using Cline I don't have to try very hard to hit limits. I’m on the second tier.

thrdbndndn · 2025-02-25T01:37:05 1740447425

Just tried it, is there a reason why the webUI is so slow?

Try to delete (close) the panel on the right on a side-by-side view. It took a good second to actually close. Creating one isn't much faster.

This is unbearably slow, to be blurt.

tesch1 · 2025-02-25T16:01:43 1740499303

Who is glama.ai though? Could not find company info on the site, the Frank name writing the blog posts seems to be an alias for Popeye the sailor. Am I missing something there? How can a user vet the company?

cmdtab · 2025-02-24T20:47:00 1740430020

Do you have deepseek r1 support? I need it for a current product I’m working on.

punkpeye · 2025-02-24T21:35:47 1740432947

Indeed we do https://glama.ai/models/deepseek-r1

It is provided by DeepSeek and Avian.

I am also midway of enabling a third-provider (Nebius).

You can see all models/providers over at https://glama.ai/models

As another commenter in this tread said, we are just a 'frontend wrapper' around other people services. Therefore, it is not particularly difficult to add models that are already supported by other providers.

The benefit of using our wrapper is that you can use a single API key and you get one bill for all your AI bills, you don't need to hack together your own logic for routing requests between different providers, failovers, keeping track of their costs, worry what happens if a provider goes down, etc.

The market at the moment is hugely fragmented, with many providers unstable, constantly shifting prices, etc. The benefit of a router is that you don't need to worry about those things.

cmdtab · 2025-02-24T22:05:46 1740434746

Yeah I am aware. I use open router at the moment but I find it lacks a good UX.

punkpeye · 2025-02-24T22:25:26 1740435926

Open router is great.

They have a very solid infrastructure.

Scaling infrastructure to handle billions of tokens is no joke.

I believe they are approaching 1 trillion tokens per week.

Glama is way smaller. We only recently crossed 10bn tokens per day.

However, I have invested a lot more into UX/UI of that chat itself, i.e. while OpenRouter is entirely focused on API gateway (which is working for them), I am going for a hybrid approach.

The market is big enough for both projects to co-exist.

pclmulqdq · 2025-02-24T21:16:20 1740431780

They are just selling a frontend wrapper on other people's services, so if someone else offers deepseek, I'm sure they will integrate it.

Daniel_Van_Zant · 2025-02-25T16:40:45 1740501645

I see Cohere, is there any support for in-line citations like you can get with their first party API?