IMO other than the Microsoft IP issue, I think the biggest thing that has shifte...

adamoshadjivas · 2025-07-11T22:34:22 1752273262

Agreed on everything. Just to add, not only anthropic is offering CC at like a 500% loss, they restricted sonnet/opus 4 access to windsurf, and jacked up their enterprise deal to Cursor. The increase in price was so big that it forced cursor to make that disastrous downgrade to their plans.

I think only way Cursor and other UX wrappers still win is if on device models or at least open source models catch up in the next 2 years. Then i can see a big push for UX if models are truly a commodity. But as long as claude is much better then yes they hold all the cards. (And don't have a bigger company to have a civil war with like openai)

adidoit · 2025-07-12T12:01:26 1752321686

Not sure this is true. Inference margins are substantial and if you look at your claude code usage it's very clever at caching

  Input │      Output │  Cache Create │     Cache Read
 916,134 │  11,106,507 │   199,684,538 │  2,767,614,506

as an example here's my usage. Massive daily usage for the past two months.

teruakohatu · 2025-07-11T23:34:05 1752276845

> CC at like a 500% loss

Do you have a citation for this?

It might be at a loss, but I don’t think it is that extravagant.

csomar · 2025-07-12T03:52:57 1752292377

The way I am doing the math with my Max subscription and assuming DeepSeek API prices, it is still x5 times cheaper. So either DeepSeek is losing money (unlikely) or Anthropic is losing lots of money (more likely). Grok kinda confirms my suspicions. Assuming DeepSeek prices, I've probably spent north of $100 of Grok compute. I didn't pay Grok or Twitter a single cent. $100 is a lot of loss for a single user.

manojlds · 2025-07-12T13:07:45 1752325665

Comparison should be with Claude API pricing. It doesn't matter what other models cost.

vmg12 · 2025-07-12T16:02:55 1752336175

Claude API pricing has significant margin baked in. I think it's safe to assume that anthropic is getting 80% margin on their api and they are selling claude code for less than that.

mnky9800n · 2025-07-13T09:56:43 1752400603

I assumed I pay a premium for a Max account to not have to worry about api costs getting stuck in a forever loop.

vmg12 · 2025-07-13T20:00:34 1752436834

You are not paying a premium for a max account. You are paying much less than what you would pay if you were using the api directly.

tonyhart7 · 2025-07-12T07:25:07 1752305107

what?? sonnet/opus is way better than deepseek, how can you compare that to deepseek

also you probably talking about distilled deepseek model

nurettin · 2025-07-12T09:09:21 1752311361

I haven't tried deepseek but I've seen claude do crazy things if you are at the correct random.seed

Tokumei-no-hito · 2025-07-12T15:41:45 1752334905

what do you mean about correct random.seed?

christina97 · 2025-07-12T15:53:03 1752335583

It means “if you’re unlucky”.

nurettin · 2025-07-12T17:55:33 1752342933

To me, claude usually feels like a bumbling idiot. But in extremely rare cases it feels like a sentient super intelligence. I facetiously assumed that in those cases it ran on the correct RNG seed.

resonious · 2025-07-11T23:40:31 1752277231

I'm also curious about this. Claude Code feels very expensive to me, but at the same time I don't have much perspective (nothing to compare it to, really, other than Codex or other agent editors I guess. And CC is way better so likely worth the extra money anyway)

harikb · 2025-07-11T23:52:52 1752277972

I think GP is talking about Claude Code Max 100 & 200 plans. They are very reasonable compared to anything else that has per-use token usage.

I am on Max and I can work 5 hrs+ a day easily. It does fall back to Sonnet pretty fast, but I don't seem to notice any big differece.

e1g · 2025-07-12T00:00:58 1752278458

Yes, my CC usage is regularly $50-$100 per day, so their Max plan is absolutely great value that I don’t expect to last.

AJ007 · 2025-07-12T01:10:19 1752282619

Pretty easy to hit $100 an hour using Opus on API credits. The model providers are heavily subsidized, the datacenters appear to be too. If you look at the Coreweave stuff and the private datacenters it starts looking like the telecom bubble. Even Meta is looking to finance datacenter expansion - https://www.reuters.com/business/meta-seeks-29-billion-priva...

The reason they are talking about building new nuclear power plants in the US isn't just for a few training runs, its for inference. At scale the AI tools are going to be extremely expensive.

Also note China produces twice as much electricity as the United States. Software development and agent demand is going to be competitive across industries. You may think, oh I can just use a few hours of this a day and I got a week of work done (happens to me some days), but you are going to end up needing to match what your competitors are doing - not what you got comfortable with. This is the recurring trap of new technology (no capitalism required.)

There is a danger to independent developers becoming reliant on models. $100-$200 is a customer acquisition cost giveaway. The state of the art models probably will end up costing hourly what a human developer costs. There is also the speed and batching part. How willing is the developer to, for example, get 50% off but maybe wait twice as long for the output. Hopefully the good dev models end up only costing $1000-$2000 a month in a year. At least that will be more accessible.

Somewhere in the future these good models will run on device and just cost the price of your hardware. Will it be the AGI models? We will find out.

I wonder how this comment will age, will look back at it in 5 or 10 years.

mark_l_watson · 2025-07-12T12:59:41 1752325181

Your excellent comments make me grateful that I am retired and just work part time on my own research and learning. I believe you when you say professional developers will need large inference compute budgets.

Probably because I am an old man, but I don’t personally vibe with full time AI assistant use, rather I will use the best models available for brief periods on specific problems.

Ironically, when I do use the best models available to me it is almost always to work on making weaker and smaller models running on Ollama more effective for my interests.

BTW, I have used neural network tech in production since 1985, and I am thrilled by the rate of progress, but worry about such externalities as energy use, environmental factors, and hurting the job market for many young people.

AJ007 · 2025-07-12T14:02:24 1752328944

I've been around for a while (not quite retirement age) and this time is the closest to the new feeling I had using the internet and web in the early days. There are simultaneously infinite possibilities but also great uncertainty what pathways will be taken and how things will end up.

There are a lot of parts in the near term to dislike here, especially the consequences for privacy, adtech, energy use. I do have concerns that the greatest pitfalls in the short terms are being ignored while other uncertainties are being exaggerated. (I've been warning on deep learning model use for recommendation engines for years, and only a sliver of people seem to have picked up on that one, for example.)

On the other hand, if good enough models can run locally, humans can end up with a lot more autonomy and choice with their software and operating systems than they have today. The most powerful models might run on supercomputers and just be solving the really big science problems. There is a lot of fantastic software out there that does not improve by throwing infinite resources at it.

Another consideration is while the big tech firms are spending (what will likely approach) hundreds of billions of dollars in a race to "AGI", what matters to those same companies even more than winning is making sure that the winner isn't a winner takes all. In that case, hopefully the outcome looks more like open source.

manmal · 2025-07-12T05:24:09 1752297849

The SOTA models will always run in data centers, because they have 5x or more VRAM and 10-100x the compute allowance. Plus, they can make good use of scaling w/ batch inference which is a huge power savings, and which a single developer machine doesn’t make full use of.

SV_BubbleTime · 2025-07-12T04:56:43 1752296203

> Pretty easy to hit $100 an hour

I don’t see how that can be true, but if it is…

Either you, or I are definitely use Claude Code incorrectly.

shinycode · 2025-07-12T08:18:45 1752308325

It’s definitely easy with an API key I hit 200$ in an evening. I didn’t think that could be possible. Horrifying

DANmode · 2025-07-12T09:29:49 1752312589

To be clear, this is a lot of full-scale reading and (re)writing, without any rules, promots, "agents"/code to limit your resource usage, right?

Nobody's asking for $200 in single-line diffs in less than a day - right?

AJ007 · 2025-07-12T11:49:47 1752320987

Right, this is having Claude Code just running as an agent doing a lot of stuff. Also tool use is a big context hog here.

shinycode · 2025-07-12T09:39:10 1752313150

It’s not about a single line diff but the same prompts with cursor does not end costing that much

DANmode · 2025-07-13T04:44:21 1752381861

Yes - do you understand why?

shinycode · 2025-07-13T09:20:10 1752398410

Yes I do, it’s just for new comers who are used to cursor where, without careful prompting you just lock yourself out of premium requests, that’s not immediately a given that CC is more dangerous and does not work the same way at all. Of course it requires careful planning but the trap is easy to fall into.

DANmode · 2025-07-14T01:51:56 1752457916

In my experience, it's more about the tool's local indexing and aggressive automatic upload and model usage limitations to avoid them (and you) overpaying.

People are recreating this with local toolchains now.

macrolime · 2025-07-12T14:00:19 1752328819

This is around what what Cursor was costing me with Claude 4 Opus before I switched to Claude Code. Sonnet works fine for some things, but for some projects it spews unusable garbage, unless the specification is so detailed that it's almost the implementation already.

SV_BubbleTime · 2025-07-12T14:40:59 1752331259

> unless the specification is so detailed that it's almost the implementation already.

You mean… it’s almost exactly like working with interns and jr developers? ;)

macrolime · 2025-07-12T17:29:11 1752341351

Yeah, it's like an idiot savant intern who has memorized all the API docs in the world, but still somehow struggles to work independently.

calgoo · 2025-07-13T19:03:28 1752433408

And never remembers what we talked about 5 minutes ago on a different thread.

spopejoy · 2025-07-14T13:33:10 1752499990

This is where something like Perplexity's "memory" feature is really great. It treats other threads similarly to web resources.

I would love to understand better just how Perplexity is able to integrate up-to-date sources like other theads (and presumably recent web searches, but I haven't verified this, they could be just from the latest model) into it's query responses. It feels seamless.

dostick · 2025-07-12T05:35:45 1752298545

Why “no capitalism required”? Competition of this kind is only possible with capitalism.

pembrook · 2025-07-12T10:46:01 1752317161

Have you been human before? competition for resources and status is an instinctive trait.

It rears its head regardless of what sociopolitical environment you place us in.

You’re either competing to offer better products or services to customers…or you’re competing for your position in the breadline or politburo via black markets.

wkat4242 · 2025-07-12T15:20:31 1752333631

Even in the Soviet Union there were multiple design bureaus competing for designs of things like aircraft. Tupolec, Ilyushin, Sukhoi, Mikoyan-Gyurevich (MiG), Yakolev, Mil. There were quite a lot. Several (not all, they had their specialisations) provided designs when a requirement was raised. Not too different from the US yet not capitalist.

AJ007 · 2025-07-12T11:48:02 1752320882

Unfortunately it's called war and it appears to be part of human nature.

tsimionescu · 2025-07-12T07:06:47 1752304007

Not really, it's possible with any market economy, even a hypothetical socialist one (that is, one where all market actors are worker-owned co-ops).

And, since there is no global super-state, the world economy is a market economy, so even if every state were a state-owned planned economy, North Korea style, still there would exist this type of competition between states.

kortilla · 2025-07-12T17:29:23 1752341363

Worker owned coops are not socialist unless the government forces it.

freedomben · 2025-07-12T23:30:47 1752363047

That is true, but if all business are worker owned Co-ops then it has to be forced

tsimionescu · 2025-07-12T18:45:24 1752345924

Worker-owned co-ops is the basic idea of socialism (that is what "workers owning the means of productions" means in modern language).

0xDEAFBEAD · 2025-07-12T07:28:47 1752305327

I mean, if you wanna get technical, many companies in Silicon Valley are worker-owned (equity compensation)

tsimionescu · 2025-07-12T08:06:29 1752307589

They are not worker owned, they have some small amount of worker ownership. But the majority of stock is never owned by workers, other than the CEO.

0xDEAFBEAD · 2025-07-12T08:46:09 1752309969

Consider also that VC funds often have pension funds as their limited partners. Workers have a claim to their pension, and thus a claim to the startup returns that the VC invests in.

So yeah it basically comes down to your definition of "worker-owned". What fraction of worker ownership is necessary? Do C-level execs count as workers? Can it be "worker-owned" if the "workers" are people working elsewhere?

Beyond the "worker-owned" terminology, why is this distinction supposed to matter exactly? Supposing there was an SV startup that was relatively generous with equity compensation, so over 50% of equity is owned by non-C-level employees. What would you expect to change, if anything, if that threshold was passed?

tsimionescu · 2025-07-12T18:44:43 1752345883

> Supposing there was an SV startup that was relatively generous with equity compensation, so over 50% of equity is owned by non-C-level employees. What would you expect to change, if anything, if that threshold was passed?

If the workers are majority owners, then they can, for example, fire a CEO that is leading the company in the wrong direction, or trying to cut their salaries, or anything like that.

0xDEAFBEAD · 2025-07-14T03:44:53 1752464693

>If the workers are majority owners, then they can, for example, fire a CEO that is leading the company in the wrong direction, or trying to cut their salaries, or anything like that.

Why wouldn't the board fire said CEO?

The most common reason to cut salaries is if the company is in dire financial straits regardless. Co-ops are more likely to cut salary and less likely to do layoffs.

tsimionescu · 2025-07-14T05:40:35 1752471635

Because the board doesn't understand the business at the level that employees do. Or because the board has different goals for the business than employees do. Or because the board is filled with friends of the CEO who let them do whatever.

Also, lots of companies reduce salaries or headcount if they feel they can get away with it. They don't need to be in dire financial straights, it's enough to have a few quarters of no or low growth and to want to show a positive change.

skinnymuch · 2025-07-13T08:16:59 1752394619

It’s not “your definition”. Worker owned means the workers own the means of production. What you’re talking about is not that at all.

What changes is democracy in the work place.

You are confusing owning minority equity with what actual control gives you —- actual ownership of capital/MoP/assets/profits

0xDEAFBEAD · 2025-07-14T03:48:36 1752464916

How specifically would you expect a typical SV corp's policies to change if employee equity passes from 49% to 51%?

Remember, if employees own 49%, if they can persuade just 2% of the other shareholders that a change will be positive for the business, they can make that change. So minority vs majority is not as significant as it may seem.

jhickok · 2025-07-12T00:57:57 1752281877

Can you give me an idea of how much interaction would be $50-$100 per day? Like are you pretty constantly in a back and forth with CC? And if you wouldn’t mind, any chance you can give me an idea of productivity gains pre/post LLM?

e1g · 2025-07-12T01:17:25 1752283045

Yes, a lot of usage, I’d guess top 10% among my peers. I do 6-10hrs of constant iterating across mid-size codebases of 750k tokens. CC is set to use Opus by default, which further drives up costs.

Estimating productivity gains is a flame war I don’t want to start, but as a signal: if the CC Max plan goes up 10x in price, I’m still keeping my subscription.

I maintain top-tier subscription to every frontier service (~$1k/mo) and throughout the week spend multiple hours with each of Cursor, Amp, Augment, Windsurf, Codex CLI, Gemini CLI, but keep on defaulting to Claude Code.

jonstewart · 2025-07-12T02:52:59 1752288779

I am curious what kind of development you’re doing and where your projects fall on the fast iteration<->correctness curve (no judgment). I’ve used CC Pro for a few weeks now and I will keep it, it’s fantastically useful for some things, but it has wasted more of my time than it saved when I’ve experimented with giving it harder tasks.

brailsafe · 2025-07-12T07:01:39 1752303699

It's interesting to work with a number of people using various models and interaction modes in slightly different capacities. I can see where the huge productivity gains are and can feel them, but the same is true for the opposite. I'm pretty sure I lost a full day or more trying to track down a build error because it was relatively trivial fpr someone to ask CC or something to refactor a ton of files, which it seems to have done a bit too eagerly. On the other hand, that refactor would have been super tedious, so maybe worth it?

jhickok · 2025-07-12T01:40:43 1752284443

Thank you for your perspective. I’ve been staring at Claude Code for a bit and I think I will just pull the trigger.

SV_BubbleTime · 2025-07-12T04:59:14 1752296354

It’s a wild frontier, but as a recent convert to CC, I would say go for it.

It’s so stupid fast to get running that you aren’t out anything if you don’t like it.

There was no way I was going to switch to a different IDE.

mark_l_watson · 2025-07-12T13:14:29 1752326069

Mostly to save money (I am retired) I mostly use Gemini APIs. I used to also use good open weight models on groq.com, but life is simpler just using Gemini.

Ultimately, my not using the best tools for my personal research projects has zero effect on the world but I am still very curious what elite developers with the best tools can accomplish, and what capability I am ‘leaving on the table.’

foolishgame · 2025-07-12T01:40:31 1752284431

I am curious what kind of code development you are doing with so many subscriptions?

Are you doing front end backend full stack or model development itself?

Are you destilling models for training your own?

I have never heard someone using so much subscription?

Is this for your full time job or startup?

Why not use qwen or deep seek and host it yourself?

I am impressed with what you are doing.

e1g · 2025-07-12T05:04:53 1752296693

I’m a founder/CTO of an enterprise SaaS, and I code everything from data modeling, to algos, backend integrations, frontend architecture, UI widgets, etc. All in TypeScript, which is perfectly suited to LLMs because we can fit the types and repo map into context without loading all code.

As to “why”: I’ve been coding for 25 years, and LLMs is the first technology that has a non-linear impact on my output. It’s simultaneously moronic and jaw-dropping. I’m good at what I do (eg, merged fixes into Node) and Claude/o3 regularly finds material edge cases in my code that I was confident in. Then they add a test case (as per our style), write a fix, and update docs/examples within two minutes.

I love coding and the art&craft of software development. I’ve written millions of lines of revenue generating code, and made millions doing it. If someone forced me to stop using LLMs in my production process, I’d quit on the spot.

Why not self host: open source models are a generation behind SOTA. R1 is just not in the same league as the pro commercial models.

atonse · 2025-07-12T05:41:47 1752298907

> If someone forced me to stop using LLMs in my production process, I’d quit on the spot.

Yup 100% agree. I’d rather try to convince them of the benefits than go back to what feels like an unnecessarily inefficient process of writing all code by hand again.

And I’ve got 25+ years of solid coding experience. Never going back.

sebastianz · 2025-07-12T09:01:10 1752310870

> data modeling, to algos, backend integrations, frontend architecture, UI widgets, etc. All in TypeScript, which is perfectly suited to LLMs because we can fit the types and repo map into context without loading all code.

Which frameworks & libraries have you found work well in this (agentic) context? I feel much of the js lib. landscape does not do enough to enforce an easily-understood project structure that would "constrain" the architecture and force modularity. (I might have this bias from my many years of work with Rails that is highly opinionated in this regard).

ineedasername · 2025-07-12T05:57:35 1752299855

When you say generation behind, can you give a sense of what that means in functionality per your current use? Slower/lower quality, it would take more iterations to get what you want?

e1g · 2025-07-12T23:37:14 1752363434

Context rot. My use case is iterating over a large codebase which quickly grows context. All LLMs degrade with larger context sizes, well below their published limits, but pro models degrade the least. R1 gets confused relatively quickly, despite their published numbers.

I think Fiction LiveBench captures some of those differences via a standardized benchmark that spreads interconnected facts through an increasingly large context to see how models can continue connecting the dots (similar to how in codebases you often have related ideas spread across many files)

https://fiction.live/stories/Fiction-liveBench-May-22-2025/o...

throwaway2037 · 2025-07-12T10:52:36 1752317556

    > I’ve written millions of lines of revenue generating code

This is a wild claim.

Approx 250 working days in a year. 25 years coding. Just one million lines would be phenom output, at 160 lines per day forever. Now you are claiming multiple millions? Come on.

e1g · 2025-07-13T00:13:07 1752365587

It's impossible as an IC on a team, or working where a concept of "tickets" exists. It's unavoidable as a solo founder, whether you're building enterprise systems or expanding your vision. Some details -

1. Before wife&kids, every weekend I would learn a library or a concept by recreating it from scratch. Re-implementing jQuery, fetch API via XHR, Promises, barebones React, a basic web router, express + common middlewares, etc. Usually, at least 1,000 lines of code every weekend. That's 1M+ over 25 years.

2. My last product is currently 400k LOCs, 95% built by me over three years. I didn't one-shot it, so assuming 2-3x ongoing refactors, that's more than 1M LOCs written.

3. In my current product repo, GitHub says for the last 6 months I'm +120k,-80k. I code less than I used to, but even at this rate, it's safely 100k-250k per year (times 20 years).

4. Even in open source, there are examples like esbuild, which is a side project from one person (cofounder and architect of Figma). esbuild is currently at ~150k LOCs, and GitHub says his contributions were +600k,-400k.

5. LOCs are not the same. 10k lines of algorithms can take a month, but 10K of React widgets is like a week of work (on a greenfield project where you know exactly what you're building). These days, when a frontend developer says their most extensive UI codebase was 100k LOCs in an interview, I assume they haven't built a big UI thing.

So yes, if the reference point is "how many sprint tickets is that", it seems impossible. If the reference point is "a creative outlet that aligns with startup-level rewards", I think my statement of "millions of lines" is conservative.

Granted, not all of it was revenue-generating - much was experimental, exploratory, or just for fun. My overarching point was that I build software products for (great) living, as opposed to a marketer who stumbled into Claude Code and now evangelizes it as some huge unlock.

codedokode · 2025-07-12T11:39:00 1752320340

100-200 lines per day, written, debugged, tested and deployed, is normal performance, isn't it? I think I could do it if worked for 8 hours.

klardotsh · 2025-07-12T17:33:01 1752341581

No, it’s not. At all. At the overwhelming majority of companies I’ve worked for or heard of, even 400-500 lines fully shipped in a week, slightly less than your figure here, would be top quartile of output - but further, it isn’t necessarily the point. Writing lines of code is a pretty small part of the job at companies with more than about 5-6 engineers on staff, past that it’s a lot more design and architecture and LEGO-brick-fitting - or just politicking and policying. Heck, I know folks who wish they could ship 400 lines of code a month, but are held back by the bureaucracies of their companies.

ohdeargodno · 2025-07-12T11:50:07 1752321007

Uh... Totaling +1000 at the end of a work week is an easy thing to do, especially if working on a new/evolving product.

kortilla · 2025-07-12T17:36:35 1752341795

Now extrapolate. That’s maybe 50k a year assuming some PTO.

10 years would make 500k and you just cross a million at 20.

So that would have to be 20 years straight of that style of working and you’re still not into plural millions until 40 years.

If someone actually produced multiple millions of lines in 25 years, it would have to be a side effect of some extremely verbose language where trivial changes take up many lines (maybe Java).

fourthark · 2025-07-12T22:39:43 1752359983

Maybe there's some copying and pasting involved. ;-)

chris_engel · 2025-07-12T21:24:25 1752355465

i've been using llm-based tools like copilot and claude pro (though not cc with opus), and while they can be helpful – e.g. for doc lookups, repetitive stuff, or quick reminders – i rarely get value beyond that. i've honestly never had a model surface a bug or edge case i wouldn’t have spotted myself.

i've tried agent-style workflows in copilot and windsurf (on claude 3.5 and 4), and honestly, they often just get stuck or build themselves into a corner. they don’t seem to reason across structure or long-term architecture in any meaningful way. it might look helpful at first, but what comes out tends to be fragile and usually something i’d refactor immediately.

sure, the model writes fast – but that speed doesn't translate into actual productivity for me unless it’s something dead simple. and if i’m spending a lot of time generating boilerplate, i usually take that as a design smell, not a task i want to automate harder.

so i’m honestly wondering: is cc max really that much better? are those productivity claims based on something fundamentally different? or is it more about tool enthusiasm + selective wins?

thatscot · 2025-07-14T13:34:57 1752500097

Yeah completely agree.

Honestly reading some of the comments here makes me want to do some sort of course on using them properly. I feel like I'm using them incorrectly.

drbojingle · 2025-07-12T20:11:08 1752351068

Just fitting types and repo map into context eh? That's great. Any other tips?

resonious · 2025-07-12T01:15:33 1752282933

Re productivity gains, CC allows me to code during my commute time. Even on a crowded bus/train I can get real work done just with my phone.

brendoelfrendo · 2025-07-12T01:29:33 1752283773

Unless you're getting paid for your commute, you're just giving your employer free productivity. I would recommend doing literally anything else with that time. Read a book, maybe.

positr0n · 2025-07-12T06:55:55 1752303355

Everywhere I've worked as a programmer you're just paid to do your job. If you get some of it done on your commute what difference does it make?

epolanski · 2025-07-13T14:45:52 1752417952

If you can't do your job in your 8 hours then you're either not good enough or the requirements are too much and the company should change processes and hire.

positr0n · 2025-07-18T04:18:53 1752812333

Right, I'm not saying anyone should actually be in the office 40 hours a week that sounds terrible. And even with all the RTO of the last couple years that doesn't seem to be expected many places.

resonious · 2025-07-12T01:44:01 1752284641

It's for a paid side gig.

dwohnitmok · 2025-07-12T04:12:55 1752293575

How do you use Claude Code via your phone?

macrolime · 2025-07-12T15:20:28 1752333628

Personally I use dev containers on a server and I have written some template containers for quickly setting up new containers that has claude code and some scripts for easily connecting to the right container etc. Makes it possible to work on mobile,but lots of room for improvement in the workflow still.

manmal · 2025-07-12T05:25:43 1752297943

vibetunnel.sh perhaps

ReaLNero · 2025-07-12T01:32:14 1752283934

What's your workflow if I may ask? I've been interested in the idea as well.

resonious · 2025-07-12T02:00:58 1752285658

The project is just a web backend. I give Claude Code grunt work tasks. Things like "make X operation also return Y data" or "create Z new model + CRUD operations". Also asking it to implement well-known patterns like denouncing or caching for an existing operation works well.

My app builds and runs fine on Termux, so my CLAUDE.md says to always run unit tests after making changes. So I punch in a request, close my phone for a bit, then check back later and review the diff. Usually takes one or two follow-up asks to get right, but since it always builds and passes tests, I never get complete garbage back.

There are some tasks that I never give it. Most of that is just intuition. Anything I need to understand deeply or care about the implementation of I do myself. And the app was originally hand-built by me, which I think is important - I would not trust CC to design the entire thing from scratch. It's much easier to review changes when you understand the overall architecture deeply.

mekpro · 2025-07-12T06:42:58 1752302578

you can easily reach 50$ per day. by force switching model to opus /model opus it will continue to use opus eventhough there is a warning about approaching limit.

i found opus is significantly more capable in coding than sonnet, especcially for the task that is poorly defined, thinking mode can fulfill alot of missing detail and you just need to edit a little before let it code.

upcoming-sesame · 2025-07-12T09:49:44 1752313784

wow. haven't tried Opus but Sonnet 4 is already damn good.

bilsbie · 2025-07-12T02:50:59 1752288659

Is there a cheap version for hobbyists? Or what’s the best thing for hobbyists to use, just cut and paste?

TeMPOraL · 2025-07-12T04:49:37 1752295777

Claude Code with a Claude subscription is the cheap version for current SOTA.

"Agentic" workflows burn through tokens like there's no tomorrow, and the new Opus model is so expensive per-token that the Max plan pays itself back in one or two days of moderate usage. When people reports their Claude Code sessions costing $100+ per day, I read that as the API price equivalent - it makes no sense to actually "pay as you go" with Claude right now.

This is arguably the cheapest option available on the market right now in terms of results per dollar, but only if you can afford the subscription itself. There's also time/value component here: on Max x5, it's quite easy to hit the usage limits of Opus (fortunately the limit is per 5 hours or so); Max x20 is only twice the price of Max x5 but gives you 4x more Opus; better model = less time spent fighting with and cleaning up after the AI. It's expensive to be poor, unfortunately.

leptons · 2025-07-12T11:34:21 1752320061

>less time spent fighting with and cleaning up after the AI.

I've yet to use anything but copilot in vscode, which is 1/2 the time helpful, and 1/2 wasting my time. For me it's almost break-even, if I don't count the frustration it causes.

I've been reading all these AI-related comment sections and none of it is convincing me there is really anything better out there. AI seems like break-even at best, but usually it's just "fighting with and cleaning up after the AI", and I'm really not interested in doing any of that. I was a lot happier when I wasn't constantly being shown bad code that I need to read and decide about, when I'm perfectly capable of writing the code myself without the hasle of AI getting in my way.

AI burnout is probably already a thing, and I'm close to that point already. I do not have hope that it will get much better than it is, as the core of the tech is essentially just a guessing game.

dgacmu · 2025-07-12T14:09:17 1752329357

I tend to agree except for one recent experience: I built a quick prototype of an application whose backend I had written twice before and finally wanted to do right. But the existing infrastructure for it had bit-rotted, and I am definitely not a UI person. Every time I dive into html+js I have to spend hours updating my years-out-of-date knowledge of how to do things.

So I vibe coded it. I was extremely specific about how the back end should operate and pretty vague about the UI, and basically everything worked.

But there were a few things about this one: first, it was just a prototype. I wanted to kick around some ideas quickly, and I didn't care at all about code quality. Second, I already knew exactly how to do the hard parts in the back end, so part of the prompt input was the architecture and mechanism that I wanted.

But it spat out that html app way way faster than I could have.

mrmincent · 2025-07-12T03:22:08 1752290528

Claude Code pro is ~$20USD/ month and is nearly enough for someone like me who can’t use it at work and is just playing around with it after work. I’m loving it.

mark_l_watson · 2025-07-12T13:18:24 1752326304

If you are a hobbyist, just use Google’s gemini-cli (currently free!) on a half dozen projects to get experience.

nickthegreek · 2025-07-12T16:31:27 1752337887

cursor on a $20/month plan (if you burn thru the free credits) or gemini-cli (free) are 2 great ways to try out this kinda stuff for a hobbyist. you can throw in v0 too, $5/month free credits. susana’s free tier can give you a db as well.

nickthegreek · 2025-07-13T02:03:03 1752372183

*supabase

taxborn · 2025-07-12T03:20:43 1752290443

I've been enjoying Zed lately

notpushkin · 2025-07-12T06:33:23 1752302003

Zed is fantastic. Just dipping my toes in agentic AI, but I was able to fix a failing test I spent maybe 15 minutes trying to untangle in a couple minutes with Zed. (It did proceed to break other tests in that file though, but I quickly reverted that.)

It is also BYOA or you can buy a subscription from Zed themselves and help them out. I currently use it with my free Copilot+ subscription (GitHub hands it out to pretty much any free/open source dev).

hanklazard · 2025-07-12T03:09:45 1752289785

Cursor at 20$/M is pretty great

mnky9800n · 2025-07-12T14:27:27 1752330447

Shhh don’t say that. I love Max. I don’t want it to go anywhere.

sothatsit · 2025-07-12T02:03:55 1752285835

You can tell Claude Code to use opus using /model and then it doesn't fall back to Sonnet btw. I am on the $100 plan and I hit rate-limits every now and then, but not enough to warrant using Sonnet instead of Opus.

rolisz · 2025-07-12T03:36:14 1752291374

Before they announced the Max plans, I could easily hit 10-15$ of API usage per day (without even being a heavy user).

Since they announced that you can use the Pro subscription with Claude Code, I've been using it much more and I've never ever been rate limited.

3uler · 2025-07-12T04:00:04 1752292804

This is what I don’t get about the cost being reported by Claude code. At work I use it against our AWS Bedrock instance, and most sessions will say 15/20 dollars and I’ll have multiple agents running. So I can easily spend 60 bucks a day in reported cost. Our AWS Bedrock bill is only a small fraction of that? Why would you over charge on direct usage of your API?

mike_hearn · 2025-07-12T09:54:03 1752314043

Anthropic has costs beyond their AWS bill ....

asaddhamani · 2025-07-12T05:15:31 1752297331

API prices are way higher than actual inference cost.

artursapek · 2025-07-12T17:29:26 1752341366

You can spend $200 worth of tokens in a single day using the Max $200/mo fixed cost plan.

virgildotcodes · 2025-07-11T22:53:36 1752274416

Seems like the survival strategy for cursor would be to develop their own frontier coding model. Maybe they can leverage the data from their still somewhat significant lead in the space to make a solid effort.

libraryofbabel · 2025-07-12T00:24:42 1752279882

I don’t think that’s a viable strategy. It is very very hard and not many people can do it. Just look at how much Meta is paying to poach the few people in the world capable of training a next gen frontier model.

lukan · 2025-07-12T00:49:19 1752281359

Why are there actually only a few people in the world able to do this?

The basic concept is out there.

Lots of smart people studying hard to catch up to also be poached. No shortage of those I assume.

Good trainingsdata still seems the most important to me.

(and lots of hardware)

Or does the specific training still involves lots of smart decisions all the time?

And those small or big decisions make all the difference?

libraryofbabel · 2025-07-12T01:11:59 1752282719

The basic concept plus a lot of money spent on compute and training data gets you pretraining. After that to get a really good model there’s a lot more fine-tuning / RL steps that companies are pretty secretive about. That is where the “smart decisions” and knowledge gained by training previous generations of sota models comes in.

We’d probably see more companies training their own models if it was cheaper, for sure. Maybe some of them would do very well. But even having a lot of money to throw at this doesn’t guarantee success, e.g. Meta’s Llama 4 was a big disappointment.

That said, it’s not impossible to catch up to close to state-of-the-art, as Deepseek showed.

ivape · 2025-07-12T23:51:20 1752364280

I’d also add that no one predicted the emergent properties of LLMs as they followed the scaling laws hypothesis. GPT showed all kinds of emergent stuff like reasoning/sentiment analysis when we went up an order of magnitude on the number of parameters. We don’t don’t actually know what would emerge if we trained a quadrillion param model. SOTA will always be mysterious until we reach those limits, so, no, companies like Cursor will never be on the frontier. It takes too much money and requires seeking out things we haven’t ever seen before.

seanhunter · 2025-07-12T05:27:27 1752298047

Why are there so few people in the world able to run 100m in sub 10s?

The basic concept is out there: run very fast.

Lots of people running every day who could be poached. No shortage of those I assume.

Good running shoes still seem the most important to me.

sideshownz · 2025-07-12T01:07:19 1752282439

1. Cost to hire is now prohibitive. You're competing against companies like Meta paying tens of millions for top talent.

2. Cost to train is also prohibitive. Grok data centre has 200,000 H100 Graphics cards. Impossible for a startup to compete with this.

tonyhart7 · 2025-07-12T07:31:03 1752305463

"Impossible for a startup to compete with this."

its funny to me since xAI literally the "youngest" in this space and recently made an Grok4 that surpass all frontier model

it literally not impossible

lukan · 2025-07-12T08:14:03 1752308043

I mean, that's a startup backed by the richest man in the world who also was engaged with OpenAI in the beginning.

I assume startup here means the average one, that has a little bit less of funding and connections.

ascorbic · 2025-07-12T15:58:00 1752335880

The richest man in the world, who could also divert the world's biggest GPU order from his other company

tonyhart7 · 2025-07-12T08:42:11 1752309731

so is Meta(fb) and Apple but that doesn't seem to be the case

money is "less" important factor, I don't say they don't matters but much less than you would think

re-thc · 2025-07-12T17:31:34 1752341494

xAI isn’t young. The brand, maybe. Not the actual history / timeline. Tesla was working on AI long ago.

xAI was just spun out to raise more money / fix the x finance issues.

ako · 2025-07-12T08:22:47 1752308567

Most startups don't have Elon Musk's money.

riwsky · 2025-07-12T05:07:55 1752296875

Because it’s not about “who can do it”, it’s about “who can do it the best”.

It’s the difference between running a marathon (impressive) and winning a marathon (here’s a giant sponsorship check).

crystal_revenge · 2025-07-13T01:09:25 1752368965

There are plenty of people theoretically capable of doing this, I secretly believe some of the most talented people in this space are randos posting on /r/LocalLlama.

But the truth is to have experience building models at this scale requires working at a high level job at a major FAANG/LLM provider. Building what Meta needs is not something you can do in your basement.

The reality is the set of people who really understand this stuff and have experience working on it at scale is very, very small. And the people in this space are already paid very well.

bluelightning2k · 2025-07-13T11:45:28 1752407128

It's a staggeringly bad deal. It's a hugely expensive task where unless you are the literal best in the world, you would never even see any usage. And even for those who are BOTH best and well known they have to be willing to lose billions on repeat with no end in sight.

It's very very rare to have winner takes all to such an extreme degree as code llm models

nmfisher · 2025-07-15T03:11:49 1752549109

I don't think it's literally "winner takes all" - I regularly cycle between Gemini, DeepSeek and Claude for coding tasks. I'm sure any GPT model would be fine too, and I could even fall back to Qwen in a pinch (exactly what I did when I was in China recently with no ability to access foreign servers).

Claude does have a slight edge in quality (which is why it's my default) but infrastructure/cost/speed are all relevant too. Different providers may focus on one at the expense of the others.

One interesting scenario where we could end up is using large hosted models for planning/logic, and handing off to local models for execution.

phillipcarter · 2025-07-12T01:01:14 1752282074

I'd recommend reading some of the papers on what it takes to actually train a proper foundation model, such as the Llama 3 Herd of Models paper. It is a deeply sophisticated process.

Coding startups also try to fine-tune OSS models to their own ends. But this is also very difficult, and usually just done as a cost optimization, not as a way to get better functionality.

vachina · 2025-07-12T09:45:51 1752313551

You need a person that can hit the ground running. Compute for LLM is extremely capital intensive and you’re always racing against time. Missing performance targets can mean life or death of the company.

bluelightning2k · 2025-07-13T11:42:12 1752406932

Windsurf has developed their own fron tier model. It's pretty good. It's not sota but it's very well aligned with their tool call formatting etc.

josephcooney · 2025-07-12T08:30:32 1752309032

interestingly windsurf have done this (I'm not sure how frontier this model is...but it's their own model) https://windsurf.com/blog/windsurf-wave-9-swe-1 but AFAIK cursor have not.

raincole · 2025-07-12T00:50:54 1752281454

> to develop their own frontier coding model

Uh, the irony is that this is exactly what Windsurf tried.

bluelightning2k · 2025-07-13T11:47:51 1752407271

As an actual user of Windsurf model, I don't think "tried" is fair. I sometimes use it. It's not as smart as Gemini but it iterates quicker and is very well aligned with their tool calls

stogot · 2025-07-12T03:06:27 1752289587

Why did they fail?

jonny_eh · 2025-07-12T07:32:09 1752305529

It's both hard AND expensive.

7thpower · 2025-07-12T15:06:50 1752332810

Where is a citation on Anthropic increasing cost to cursor? I had not seen that news, but it would make sense.

threatripper · 2025-07-12T09:09:42 1752311382

But Cursor is also offering OpenAI and Google models.

lvl155 · 2025-07-12T13:16:15 1752326175

Which is interesting because Sonnet is cheap and Opus is not on par with o3 for tasks where you want to deploy it.

Aeolun · 2025-07-11T23:41:07 1752277267

It probably doesn’t cost them all that much? Maybe they were offering the API at a 500% markup, and code is just breaking even.

manojlds · 2025-07-12T13:06:20 1752325580

If open models become big, open coding agents would be bigger at that point. Even more motivation as well.

bernawil · 2025-07-12T20:27:32 1752352052

you mean the plans are subsidized? pay-per-use doesn't look subsidized to me, I can spend 5$ a day on a medium sized codebase easily.

HenriNext · 2025-07-12T00:42:15 1752280935

- Forking VSCode is very easy; you can do it in 1 hour.

- Anthropic doesn't use the inputs for training.

- Cursor doesn't have $900M ARR. That was the raise. Their ARR is ~$500m [1].

- Claude Code already support the niceties, including "add selection to chat", accessing IDE's realtime warnings and errors (built-in tool 'ideDiagnostics'), and using IDE's native diff viewer for reviewing the edits.

[1] https://techcrunch.com/2025/06/05/cursors-anysphere-nabs-9-9...

edoceo · 2025-07-12T01:10:24 1752282624

The cost of the fork isn't creating it, it's maintaining it. But maybe AI could help :/

whatevaa · 2025-07-12T05:23:10 1752297790

The cost of vscode fork is that microsoft has restricted extension marketplace for forks. You have to maintain separate one, that is the real dealbreaker

notpushkin · 2025-07-12T12:32:58 1752323578

https://open-vsx.org/

blackoil · 2025-07-12T11:29:22 1752319762

Eclipse maintains a public repo.

jahewson · 2025-07-12T17:50:57 1752342657

Forking Linux is very easy; you can do it in 1 hour.

davidclark · 2025-07-11T23:27:50 1752276470

Is this $900M ARR a reliable number?

Their base is $20/mth. That would equal 3.75M people paying a sub to Cursor.

If literally everyone is on their $200/mth plan, then that would be 375K paid users.

There’s 50M VS Code + VS users (May 2025). [1] 7% of all VS Code users having switched to Cursor does not match my personal circle of developers. 0.7% . . . Maybe? But, that would be if everyone using Cursor were paying $200/month.

Seems impossibly high, especially given the number of other AI subscription options as well.

[1] https://devblogs.microsoft.com/blog/celebrating-50-million-d...

ashraymalhotra · 2025-07-11T23:35:06 1752276906

Maybe the OP got confused with Cursor's $900mil raise? https://cursor.com/blog/series-c

Last disclosed revenue from Cursor was $500mil. https://www.bloomberg.com/news/articles/2025-06-05/anysphere...

npinsker · 2025-07-12T02:12:51 1752286371

It’s probably due to the top comment citing that number

extr · 2025-07-11T23:36:57 1752277017

Yeah that’s probably it!

teiferer · 2025-07-12T09:00:33 1752310833

That's the same order of magnitude though.

smcleod · 2025-07-11T23:50:08 1752277808

The $20/month cursor sub is heavily limited though, for basic casual usage that's fine but you VERY soon run into its limits when working at any speed.

helloericsf · 2025-07-12T00:30:19 1752280219

The base plan limit is not hard to hit. Then you're on the usage based rocket.

sumedh · 2025-07-12T08:42:47 1752309767

Enterprise pay more.

brundolf · 2025-07-12T05:22:15 1752297735

I also just prefer CC's UX. I've tried to make myself use Copilot and Roo and I just couldn't. The extra mental overhead and UI context-switching took me out of the flow. And tab completion has never felt valuable to me.

But the chat UX is so simple it doesn't take up any extra brain-cycles. It's easier to alt-tab to and from; it feels like slacking a coworker. I can have one or more terminal windows open with agents I'm managing, and still monitor/intervene in my editor as they work. Fits much nicer with my brain, and accelerates my flow instead of disrupting it

There's something starkly different for me about not having to think about exactly what context to feed to the tool, which text to highlight or tabs to open, which predefined agent to select, which IDE button to press

Just formulate my concepts and intent and then express those in words. If I need to be more precise in my words then I will be, but I stay in a concepts + words headspace. That's very important for conserving my own mental context window

threecheese · 2025-07-12T00:50:43 1752281443

Claude Code is just proving that coding agents can be successful. The interface isn’t magic, it just fits the model and integrates with a system in all the right ways. The Anthropic team for that product is very small comparatively (their most prolific contributor is Claude), and I think it’s more of a technology proof than a core competency - it’s a great API $ business lever, but there’s no reason for them to try and win the “agentic coding UI” market. Unless Generative AI flops everywhere else, these markets will continue to emerge and need focus. The Windsurf kerfuffle is further proof that OpenAI doesn’t see the market as must-win for a frontier model shop.

And so I’d say this isn’t a harbinger of the death of Cursor, instead proof that there’s a future in the market they were just recently winning.

extr · 2025-07-12T05:51:12 1752299472

I was being hyperbolic saying their ARR will go to zero. That's obviously not the case, but the point is that CC has revealed their real product was not "agentic coding UI", it was "insanely cheap tokens". I have no doubt they will continue to see success, but their future right now looks closer to being a competitor to free/open tools like cline/roo code, as well as the CLI entrants, not a standalone $500M ARR juggarnaut. They have no horse in the race in the token market, they're a middleman.

They either need to create their own model and compete on cost, or hope that token costs come down dramatically so as to be too cheap to meter.

hv23 · 2025-07-12T14:17:09 1752329829

Digging in here more... why would you say it isn't in Anthropic's interest to win the "agentic coding UI" market?

My mental model is that these foundation model companies will need to invest in and win in a significant number of the app layer markets in order to realize enough revenue to drive returns. And if coding / agentic coding is one of the top X use cases for tokens at the app layer, seems logical that they'd want to be a winner in this market.

Is your view that these companies will be content to win at the model layer and be agnostic as to the app layer?

threecheese · 2025-07-12T16:44:55 1752338695

My intuition is that their fundamental business is executing on the models, and any other products are secondary and exist to drive revenue that they can use to compete against Google/OpenAI/Meta as well as to ensure - and demonstrate - that their models are performant in these new markets. Claude needs to be great at coding, but Anthropic doesn’t need to own Coding. Claude Code is growing their core business, just like a Claude Robotics or a Claude Scheduling might, but they cant focus on robotics or scheduling because that takes them away from the core business of models. A strategic relationship with Cursor might have been enough to accomplish this, but it wasn’t - maybe Cursor couldn’t execute fast enough, or didn’t align on priorities, or whatever. I’ve watched a bunch of interviews with the CC team and I very much get the impression that it was more “holy shit, this works great” than a product strategy.

You may be right about “they need to invest in and win” in order to have __enough__ revenue to outcompete the nation-state sized competition, but this stuff is moving way to fast for anyone know.

re-thc · 2025-07-12T17:36:17 1752341777

> A strategic relationship with Cursor might have been enough to accomplish this, but it wasn’t

It’s a huge risk as Cursor can get acquired, just like what this news article is about.

nikcub · 2025-07-11T22:35:15 1752273315

Cursor see it coming - it's why they're moving to the web and mobile[0]

The bigger issue is the advantage Anthropic, Google and OpenAI have in developing and deploying their own models. It wasn't that long ago that Cursor was reading 50 lines of code at a time to save on token costs. Anthropic just came out and yolo'd the context window because they could afford to, and it blew everything else away.

Cursor could release a cli tomorrow but it wouldn't help them compete when Anthropic and Google can always be multiples cheaper

[0] https://cursor.com/blog/agent-web

Aeolun · 2025-07-11T23:43:41 1752277421

> Anthropic just came out and yolo'd the context window because they could afford to

I don’t think this is true at all. The reason CC is so good is that they’re very deliberate about what goes in the context. CC often spends ages reading 5 LOC snippets, but afterwards it only has relevant stuff in context.

ec109685 · 2025-07-12T02:08:42 1752286122

Background of how it works: https://kirshatrov.com/posts/claude-code-internals

Prompt: https://gist.github.com/transitive-bullshit/487c9cb52c75a970...

manmal · 2025-07-12T05:32:42 1752298362

Also check out claude-trace, which injects fetch hooks to get at the data: https://github.com/badlogic/lemmy/tree/main/apps/claude-trac...

RainyDayTmrw · 2025-07-12T02:18:06 1752286686

I'm always surprised how short system prompts are. It makes me wonder where the rest of the app's behavior is encoded.

ec109685 · 2025-07-12T23:32:39 1752363159

Lots and lots of reinforcement training.

anon7000 · 2025-07-12T03:00:55 1752289255

I’ve definitely observed that CC is waaaay slower than cursor

nsonha · 2025-07-11T23:54:44 1752278084

Heard a lot of this context bs parroted all over HN, don't buy it. If simply increasing context size can solve problem, Gemini would be the best model for everything.

SamDc73 · 2025-07-12T00:18:40 1752279520

Gemini tends to be better at bug hunting, but yes everything else Claude is still superior

extr · 2025-07-11T22:46:20 1752273980

I think this is an interesting and cool direction for Cursor to be going in and I don't doubt something like this is the future. But I have my doubts whether it will save them in the short/medium term:

- AI is not good enough yet to abandon the traditional IDE experierence if you're doing anything non-trivial. Hard finding use cases for this right now.

- There's no moat here. There are already a dozen "Claude Code UI" OSS projects with similar basic functionality.

madeofpalk · 2025-07-11T22:50:18 1752274218

I have a whole backlog of trivial tasks I never get around to because I’m working on less trivial things.

lunarcave · 2025-07-11T23:28:25 1752276505

Strictly speaking about large, complex, sprawling codebases, I don't think you can beat the experience that an IDE + coding agent brings with a terminal-based coding agent.

Auto-regressive nature of these things mean that errors accumulate, and IDEs are well placed to give that observability to the human, than a coding agent. I can course correct more easily in an IDE with clear diffs, coding navigation, than following a terminal timeline.

nojs · 2025-07-12T01:05:35 1752282335

You can view and navigate the diffs made by the terminal agent in your IDE in realtime, just like Cursor, as well as commit, revert, etc. That’s really all the “integration” you need.

teruakohatu · 2025-07-11T23:35:22 1752276922

> I don't think you can beat the experience that an IDE + coding agent brings with a terminal-based coding agent.

CC has some integration with VSC it is not all or nothing.

jdkoeck · 2025-07-12T07:22:21 1752304941

Honestly, I think the Claude Code integration in VS Code is very close to the « nothing » part of the spectrum!

petesergeant · 2025-07-12T07:54:28 1752306868

> I don't think you can beat the experience that an IDE + coding agent brings with a terminal-based coding agent.

I resisted moving from Roo in VS Code to CC for this reason, and then tried it for a day, and didn't go back.

libraryofbabel · 2025-07-11T22:31:40 1752273100

Some excellent points. On “add selection to chat”, I just want to add that the Claude Code VS code extension automatically passes the current selection to the model. :)

I am genuinely curious if any Cursor or Windsurf users who have also tried Claude Code could speak to why they prefer the IDE-fork tools? I’ve only ever used Claude Code myself - what am I missing?

extr · 2025-07-11T22:39:13 1752273553

Cursor's tab completion model is legitimately fantastic and for many people is worth the entire $20 subscription. Lint fixes or syntax-level refactors are guessed and executed instantly with TAB with close to 100% accuracy. This is their final moat IMO, if Copilot manages to bring their tab completion up to near parity, very little reason to use Cursor.

olejorgenb · 2025-07-11T23:09:49 1752275389

Idk. When you're doing something it really gets it's super nice, but it's also off a lot of times and it's IMO super distracting when it constantly pop up. No way to explicitly request it instead - other than toggling, which seems to also turn off context/edit tracking, because after toggling on it does not suggest anything until you make some edits.

While Zed's model is not as good the UI is so much better IMO.

fipar · 2025-07-12T03:39:32 1752291572

Just to offer a different perspective, I use Cursor at work and, coming from emacs (which I still use) with copilot completions only when I request them with a shortcut, Cursor’s behavior drives me crazy.

MkLouis · 2025-07-12T05:10:05 1752297005

Which Emacs Package do you use for CoPilot, i tried using Copilot.el a long while ago, but had problems with it. Is there something new or does copilot.el fulfill your needs?

fipar · 2025-07-13T03:28:25 1752377305

Just copilot.el (I checked to be sure and I'm pulling from github.com/zerolfx/copilot.el)

My config is quite simple:

     (progn
       (require 'copilot)
       (add-hook 'prog-mode-hook 'copilot-mode)
       (add-hook 'git-commit-mode-hook 'copilot-mode)
       (add-hook 'org-mode-hook 'copilot-mode)
       (define-key copilot-completion-map (kbd "y") 'copilot-accept-completion)
       (define-key copilot-completion-map (kbd "n") 'copilot-next-completion)
       (define-key copilot-completion-map (kbd "c") 'copilot-clear-overlay)
       (global-set-key (kbd "C-M-S-s-u") 'copilot-complete)
       )

I only get suggestions if I use that key (the prefix looks huge but I have a Keyboardio Model 100 and I have that bound to the Any key, so I intentionally picked a crazy-long prefix hoping to avoid collision with other shortcuts) and that's the way I like to use these tools (which is why the behaviour in Cursor drives me crazy, though I admit I haven't spend time looking at its configuration, maybe it's something that can be turned off).

(Edit: formatting)

MkLouis · 2025-07-13T13:50:23 1752414623

I’ll give it another look, thanks! Seems so easy to do!

groggo · 2025-07-12T05:48:03 1752299283

I haven't used Cursor or Claude much, how different is it from Copilot? I bounce between desktop ChatGPT (which can update VS Code) and copilot. Is there an impression that those have fallen behind?

mdaniel · 2025-07-12T17:46:41 1752342401

IME, one of execution. Copilot is like having your cousin who works at Bestbuy try and help you code - it knows what a computer is, and speaks english, but is pretty bad at both

The story I've heard is that Cursor is making all their money on context management and prompting, to help smooth over the gap between "you know what I meant" and getting the underlying model to "know what you meant"

I haven't had as much experience with Claude or Claude Code to speak to those, but my colleagues speak of them highly

coolspot · 2025-07-12T06:43:45 1752302625

Github Copilot just added that about a week ago.

conradkay · 2025-07-11T23:08:43 1752275323

<https://forum.cursor.com/t/i-made-59-699-lines-of-agent-edit...>

It's quite interesting how little the Cursor power users use tab. Majority of the posts are some insane number of agent edits and close to (or exactly) 0 tabs.

Jcampuzano2 · 2025-07-12T00:39:42 1752280782

At my company we have an enterprise subscription and we're also all allowed to see the analytics for the entire company. Last I checked, I was literally the number one user of Tab and middle of the pack for agent.

It's interesting when I see videos or reddit posts about cursor and people getting rate limited and being super angry. In my experience tab is the number one feature, and I feel like most people using agent are probably overusing it tasks that would honestly take less time to do myself or using models way smarter than they need to be for the task at hand.

cardanome · 2025-07-12T15:17:28 1752333448

I use cursor strictly for agent edits and do anything else in a proper IDE meaning in a Jetbrains product that I run in a separate window.

Many of my co-workers do the same. VC Code is vastly inferior when it comes to editing and actual IDE feature so it is a non-starter when you do programming yourself.

I once tried AI tab-complete on Zed and it was all right but breaks my flow. Either the AI does the editing or I do it but mixing both annoys me.

breuleux · 2025-07-12T15:35:43 1752334543

I find tab extremely distracting and it was the first thing I turned off. I have no idea how people can tolerate it.

druskacik · 2025-07-11T22:47:08 1752274028

I'd like to ask the opposite question: why do people prefer command line tools? I tried both and I prefer working in IDE. The main reason is that I don't trust the LLMs too much and I like to see and potentially quickly edit the changes they make. With an IDE, I can iterate much faster than with the command line tool.

I haven't tried Claude Code VS Code extension. Did anyone replaced Cursor with this setup?

princevegeta89 · 2025-07-12T06:44:16 1752302656

I replaced. My opinion: Cursor sucks as an IDE. Cursor may have a average to above average quality in IDE assistance - but the IDE seems to get in the way. It's entire performance is based on the real-time performance and latency from their servers and sometimes it is way too slow. The TAB autocomplete that was working for you in the last 30 minutes suddenly doesn't work randomly, or just experiences severe delays that it stops making sense.

Besides that, the IDE seems poorly designed - some navigation options are confusing and it makes way too many intrusive changes (ex: automatically finishing strings).

I've since gone back to VS Code - with Cline (with OpenRouter and super cheap Qwen Coder models, Windsurf FREE, Claude Code with $20 per month) and I get great mileage from all of them.

rapind · 2025-07-11T22:57:18 1752274638

You're looking at (coloured) diffs in your shell is all when it comes to coding. It's pretty easy to setup MCP and have claude be the director. Like I have zen MCP running with an OpenRouter API key, and will ask claude to consult with pro (gemini) or o3, or both to come up with an architecture review / plan.

I honestly don't know how great that is, because it just reiterates what I was planning anyways, and I can't tell if it's just glazing, or it's just drawing the same general conclusions. Seriously though, it does a decent job, and you can discuss / ruminate over approaches.

I assume you can do all the same things in an editor. I'm just comfortable with a shell is all, and as a hardcore Vi user, I don't really want to use Visual Studio.

mat_b · 2025-07-12T04:01:12 1752292872

I also use vim heavily and I've found that I'm really enjoying Cursor + VS Code Vim extension. The cursor tab completion works very nicely in conjunction with vim navigate mode.

insane_dreamer · 2025-07-12T04:09:04 1752293344

JetBrains has CC integration where CC runs in a terminal window but uses the IDE (i.e., Pycharm) for diffing. Works well.

mdaniel · 2025-07-12T17:49:54 1752342594

heh, including "for diffing" is selling short when our new job as software developers now seems to be reviewing code, of which looking at a diff is only one tiny part. That goes infinitely more for dynamically typed languages, where there is no compiler to catch dumb typos. If I have to actually, no kidding, review code then I want all the introspections, find references, go to declaration, et al for catching the intern trying to cheat me

sunnybeetroot · 2025-07-11T23:23:46 1752276226

I can roll back to different checkpoints with Cursor easily. Maybe CC has it but the fact that I haven’t found it after using it daily is an example of Cursor having a better UX for me.

handfuloflight · 2025-07-12T03:02:58 1752289378

Or Cursor just gave him a better deal?

macrolime · 2025-07-12T15:55:22 1752335722

I like using Claude Code through Roo Code (vscode extension). I find it easier to work with text using a mouse, vscode diff viewer etc. I guess if you're very good at vim shortcuts etc you can use that in Claude Code instead of selecting text with a mouse. Claude Code has a vscode extension too so I feel that using Claude Code through vscode just adds a better UI.

rhodysurf · 2025-07-11T22:36:38 1752273398

It already does this btw, when you use Cc from the vscode terminal and select things it adds it to cc context automatically

greymalik · 2025-07-12T00:06:27 1752278787

As does Copilot

alanmoraes · 2025-07-11T22:26:28 1752272788

I never understood why those tools need to fork Visual Studio Code. Wouldn't an extension suffice?

efitz · 2025-07-11T22:37:48 1752273468

Cline and Roo Code (my favorite Cline fork) are fantastic and run as normal VS Code extensions.

Occasionally they lose their connection to the terminal in VSCode, but I’ve got no other integration complaints.

And I really prefer the bring-your-own-key model as opposed to letting the IDE be my middleman.

milofeynman · 2025-07-12T04:59:22 1752296362

Using cline for a bit made me realize cursor was doomed. Everything is just a gpt/anthropic wrapper of fancy prompts.

I can do most of what I want with cline, and I've gone back from large changes to just small changes and been moving much quicker. Large refactors/changes start to deviate from what you actually want to accomplish unless you have written a dissertation, and even then they fail.

mehphp · 2025-07-12T14:09:38 1752329378

I agree with all you’ve said but with regards to writing a dissertation for larger changes : have you tried letting it first right a plan for you as markdown (just keep this file uncommitted) and then let it build a checklist of things to do?

I find just referencing this file over and over works wonders and it respects items that were already checked off really well.

I can get a lot done really fast this way in small enough chunks so i know every bit of code and how it works (tweaking manually of course where needed).

But I can blow through some tickets way faster than before this way.

extr · 2025-07-11T22:27:47 1752272867

IIRC problem is that VS Code does not allow extensions to create custom UI in the panels areas except for WebViews(?). It makes for not a great experience. Plus Cursor does a lot with background indexing to make their tab completion model really good - more than would be possible with the extensions APIs available.