musbemus's comments

musbemus · 2025-09-07T11:16:24 1757243784

One thing I hope to see included is a precursor step when constructing specs where Claude is used to intelligently inquire about gaps to fill that would disambiguate the implementation. If you told an engineer to do something with a set of requirements and outcomes, they'd naturally also have follow-up questions to ensure alignment before executing.

joshstrange · 2025-09-07T12:46:13 1757249173

Yes, kind of like open AI’s deep research tool. I often find that a number of mistakes are made because no clarification questions or ask or even considered.

musbemus · 2025-09-07T11:06:59 1757243219

> And in the long run, the best way to get what you want is to deserve it.

Love this quote.

And just a comment on agency: it's not necessarily rewarded or even acknowledged in all environments. Some expect that you're literally a worker-robot and just need someone to carry out menial tasks from up top. You don't want to end up at one of these places, regardless of what your life situation is.

____tom____ · 2025-09-07T18:48:13 1757270893

Nice quote, sadly not true at all.

Currently, statistics show that changing jobs rapidly pays much more than staying one place, no matter whether you deserve it or not.

Whether or not you deserve it is not a metric that is often applied in this field.

musbemus · 2025-09-07T10:59:03 1757242743

May be worth it for founders if you're naturally obsessive about the problem you're solving. No bueno for employees (or anyone with less than say ~5% equity)

musbemus · 2025-09-04T20:36:41 1757018201

Someone point this at Trump and report back

musbemus · 2025-09-04T05:10:15 1756962615

While I agree generally with the premise that the silver bullet that AI coding has been marketed to be has underdelivered (even if it doesn't feel that way), I gotta point out that the experiment and its results don't do a good job of capturing that. One of the biggest parts of using these AI tools is knowing which tasks they're most suitable for (and sometimes it's using them in only certain subtasks of a task). As mentioned, some tasks they absolutely excel at. Flipping a coin and deciding to use it or not is crude and unrealistic. Hard to come up with a reliable method though, I also think METR has it's glaring issues.

musbemus · 2025-09-04T04:49:26 1756961366

You're exactly right. To be honest, in pretty much every case I've seen, indicating usage of a read-only resource directly in the prompt always outperforms using the MCP for it. Should really only be using MCP if you need MCP-specific functionality imo (elicitation, sampling)

musbemus · 2025-09-04T03:42:41 1756957361

404 error :/

musbemus · 2025-08-09T23:15:31 1754781331

If they do start to become unsustainable you might see more companies moving to a BYOK or usage-based billing model. If they do that, I don't know if the use cases for AI would justify the cost for consumers (but perhaps so for businesses). There's been a ton of build out of data centers so I do think the cost reduction we've seen so far may extrapolate but at the expense of more performant models. Hard to tell right now though

musbemus · 2025-08-09T03:21:16 1754709676

Mmmmm data!

Thanks for sharing. Have you tried anywhere else like Reddit or LinkedIn? Curious how X measures up.

Also, tried DMing but need to be verified lol