Thanks! We all dogfood Claude every day to do our own work here, and solving our...

jasonjmcghee · 2025-02-24T20:37:47 1740429467

Just want to say nice job and keep it up. Thrilled to start playing with 3.7.

In general, benchmarks seem to very misleading in my experience, and I still prefer sonnet 3.5 for _nearly_ every use case- except massive text tasks, which I use gemini 2.0 pro with the 2M token context window.

jasonjmcghee · 2025-02-24T22:00:01 1740434401

An update: "code" is very good. Just did a ~4 hour task in about an hour. It cost $3 which is more than I usual spend in an hour, but very worth it.

martinald · 2025-02-24T20:41:50 1740429710

I find the webdev arena tends to match my experience with models much more closely than other benchmarks: https://web.lmarena.ai/leaderboard. Excited to see how 3.7 performs!

LouisSayers · 2025-02-24T20:08:13 1740427693

Could you tell us a bit about the coding tools you use and how you go about interacting with Claude?

catherinewu · 2025-02-24T20:28:28 1740428908

We find that Claude is really good at test driven development, so we often ask Claude to write tests first and then ask Claude to iterate against the tests

Kerrick · 2025-02-24T20:48:36 1740430116

Write tests (plural) first, as in write more than one failing test before making it pass?

zarmin · 2025-02-25T01:40:03 1740447603

Time to look up TDD, my friend.

DrammBA · 2025-02-25T15:22:29 1740496949

One of today's lucky 10,000. His mind is about to expand beyond imagination.

DrammBA · 2025-02-26T02:37:58 1740537478

I wish I could delete my original comment now that I found out that Kerric wasn't a lucky 10,000, he's just an asshole...

zarmin · 2025-02-26T05:35:41 1740548141

Well, you lucky-10,000'd people who didn't know about the 10,000 thing. That's not nothing.

Kerrick · 2025-02-25T18:03:30 1740506610

Time to actually read Test-Driven Development By Example, my friend. Or if you can't stomach reading a whole book, read this: https://tidyfirst.substack.com/p/canon-tdd

TL;DR - If you're writing more than one failing test at a time, you are not doing Test-Driven Development.

zarmin · 2025-02-25T23:24:20 1740525860

oh my god, your comment was just a setup for you to be pedantic? all discourse on the internet is worthless. i don't know why i keep engaging.