More

kenforthewin · 2025-12-08T19:44:17 1765223057

Congrats. From my experience, Augment (https://augmentcode.com) is best in class for AI code context. How does this compare?

jellyotsiro · 2025-12-08T19:46:12 1765223172

augment is a coding agent. nia is an external context engine for coding agents that improves their code output quality

kenforthewin · 2025-12-08T21:09:06 1765228146

Sure, but Augment’s main value add is their context engine, and imo they do it really well. If all they had to do was launch an MCP for their context engine product to compete, I think the comparison is still worth exploring.

emmabotbot · 2025-12-08T21:36:56 1765229816

https://x.com/igoro/status/1995960021331706319

jellyotsiro · 2025-12-08T21:45:45 1765230345

yeah, their mcp is to provide better context of your own codebase. not external information.

kenforthewin · 2025-11-24T16:24:54 1764001494

It's fascinating to see the evolution of HN sentiment towards LLMs in real time. Just a few months ago, projects like these were a dime a dozen and every AI-related post had a skeptical comment at the top. Now I'm almost surprised to see a project like this hit the front page.

I don't have any particular opinion about this project itself, I'm sure there are legitimate use cases for wanting to trick LLMs or obfuscate content etc. But if these sorts of projects are a litmus test for AI skepticism, I'm seeing a clear trend: AI skeptics are losing ground on HN.

wdpatti · 2025-11-24T19:19:56 1764011996

I actually made this back in August but never posted it until now.

I agree with your point; many of the comments say that simple regex filtering can solve it, but they seem to ignore that it would break many languages that rely on these characters for things like accent marks.

ahazred8ta · 2025-11-25T20:24:38 1764102278

Feedback from college professors:

https://old.reddit.com/r/Professors/comments/1p58evc/defeati...

kenforthewin · 2025-11-21T15:47:32 1763740052

Similar project: https://github.com/aberemia24/code-executor-MCP

And the original Anthropic post that inspired both: https://www.anthropic.com/engineering/code-execution-with-mc...

data-ottawa · 2025-11-21T17:26:39 1763745999

> 47 tools = 141k tokens consumed before you write a single word

This is the real problem in my opinion.

There are a ton of great sounding MCP but in practice they have too many individual tools and way too much documentation for each tool. It inflates processing time and burns tokens.

I find MCP is the opposite of the Unix design philosophy. You want fewer tools with more options surfaced via schema, shorter documentation, and you want to rely on convention as much as possible.

You don’t want a create file, write file, and update file tools, you want one write file tool with the ability to do all of those things. Instead of ls and find you want your list files tool to support regex and fuzzy matching with a metadata list.

This is based on building these things for most of this year, so it’s anecdotal and ymmv.

As an example rust-mcp-filesystem has 24 tools, many with completely overlapping functionality: `head_file`, `tail_file`, `read_file_lines`, `read_text_file` plus multi-file variants; or there's `list_directory`, `list_directory_with_sizes`, `calculate_directory_size`, `search_files`, and `directory_tree`. I think that whole server could be 4-6 mcp tools and it would accelerate things.

kenforthewin · 2025-11-17T21:58:28 1763416708

No mention of coding benchmarks. I guess they've given up on competing with Claude and GPT-5 there. (and from my initial testing of grok 4.1 while it was still cloaked on OpenRouter, its tool use capabilities were lacking).

buu700 · 2025-11-17T22:55:13 1763420113

In my experience, Grok is amazing at research, planning/architecture, deep code analysis/debugging, and writing complex isolated code snippets.

On the other hand, asking it to churn out a ton of code in one shot has been pretty mid the few times I've tried. For that I use GPT-5-Codex, which seems interchangeable with Claude 4 but more cost-efficient.

theshrike79 · 2025-11-18T12:24:23 1763468663

Codex is good when you have a clear spec and an isolated feature.

Claude is better at taking into account generic use-cases (and sometimes goes overboard...)

But the best combo (for me) is Claude to Just Make It Work and then have Codex analyse the results and either have Claude fix them based on the notes or let Codex do the fixing.

buu700 · 2025-11-18T17:47:20 1763488040

Ah okay, that makes sense. I do a lot of planning with Gemini and Grok before the coding model ever gets involved, so that might be why I've never noticed a clear difference in output quality between GPT-5, GPT-5-Codex, and Claude 4.

theshrike79 · 2025-11-20T12:04:23 1763640263

TBH I really should do a lot more pre-planning for tasks - especially on new projects. But it's just so much more rewarding to shove Claude at a quick idea, watch some shows and come back to see what it figured out =)

LaurensBER · 2025-11-17T22:42:12 1763419332

Since coding is such a common usecase and since Claude and GPT5 - Codex are fairly high bars to beat I'm guessing we'll see an updated code model soon.

Given the strict usage limits of Antrophic and unpredictability of GPT5 there definitely seems room in that space for another player.

grim_io · 2025-11-17T22:55:23 1763420123

Yeah. Probably Google.

spiffytech · 2025-11-18T03:18:49 1763435929

They've got Grok Code Fast. Maybe they want to split than out from the general purpose model.

Rover222 · 2025-11-18T03:06:28 1763435188

I've often used Grok Heavy to get me past a problem when Claude gets stuck. Not always, but it usually can figure it out.

kenforthewin · 2025-11-16T14:54:22 1763304862

If, like me, you were wondering what PDS stands for: https://github.com/bluesky-social/pds

smitty1e · 2025-11-16T17:36:24 1763314584

Thanks. I figured "Practice Dangerous to Security" was a Pretty Darn Silly breaking of the TLA.

kenforthewin · 2025-11-10T15:43:17 1762789397

GitHub link 404s

kenforthewin · 2025-11-09T22:02:38 1762725758

An AI development workflow platform with GitHub integration. Built in Elixir / Phoenix. Early stages but it's a fun project.

https://github.com/kenforthewin/matic

kenforthewin · 2025-10-21T13:22:18 1761052938

It's a very millennial flavor of Reddit-coded edginess that appeals mostly to other Reddit millennials.

kenforthewin · 2025-07-05T22:52:58 1751755978

I'm with you, which is why I started the post by stating that most group chats don't need LLM assistants. But I do wonder if your friends have ever posted an AI generated image in the gc, or text you suspect was generated by LLMs? I would be very surprised if not.

phyzome · 2025-07-06T02:53:55 1751770435

I'm fairly sure no, unless you count the early days of DALL-E availability, when people were first exploring this stuff.

Most people I know are turned off by generative AI.

MongooseStudios · 2025-07-05T23:13:53 1751757233

Apples and oranges. But if they start sending me slop the outcome will be the same.

kenforthewin · 2025-07-05T22:37:26 1751755046

I can tell you actually read the post and put a lot of thought into this response - thanks!

leakycap · 2025-07-06T04:41:43 1751776903

Just because you do not appreciate or agree with this opinion does not make it less widespread.