More

kyle_grove · 2025-03-01T16:48:05 1740847685

There's the technique of model orthogonalization which can often zero out certain tendencies (most often, refusal), as demonstrated by many models on HuggingFace. There may be an existing open weights model on HuggingFace that uses orthogonalization to zero out positivity (or optimism)--or you could roll your own.

kyle_grove · 2025-02-24T15:18:34 1740410314

Honestly, I think the mode that will actually occur is that incumbent businesses never successfully adopt AI, but are just outcompeted by their AI-native competitors.

djtango · 2025-02-24T15:54:41 1740412481

Yes this is exactly how I see it happening - just like how Amazon and Google were computer-native companies

joquarky · 2025-02-24T17:50:03 1740419403

And Sears had all the opportunity to be Amazon

red-iron-pine · 2025-02-24T18:18:56 1740421136

Sears also did everything it could to annihilate itself while dot-com was happening.

their CEO was a believer of making his departments compete for resources leading to a brutal, dysfunctional clusterfuck. rent seeking behavior on the inside as well as outside.

EQYV · 2025-02-25T01:10:27 1740445827

Sounds kinda like Amazon..

Terr_ · 2025-02-24T23:40:28 1740440428

And some, both new and old, will collapse after severely misjudging what LLMs can safely and effectively be used for.

suraci · 2025-02-25T03:26:25 1740453985

it looks like a variant of Planck's principle https://en.wikipedia.org/wiki/Planck%27s_principle

kyle_grove · 2025-01-18T21:00:08 1737234008

IMO the lack of real version control and lack of reliable programmability have been significant impediments to impact and adoption. The control surfaces are more brittle than say, regex, which isn’t a good place to be.

I would quibble that there is a modicum of design in prompting; RLHF, DPO and ORPO are explicitly designing the models to be more promptable. But the methods don’t yet adequately scale to the variety of user inputs, especially in a customer-facing context.

My preference would be for the field to put more emphasis on control over LLMs, but it seems like the momentum is again on training LLM-based AGIs. Perhaps the Bitter Lesson has struck again.

dartos · 2025-01-19T15:38:44 1737301124

I’d agree with your quibble.

People are trying to design how to prompt, but it’s very different in both implementation and result than designing a programming language or a visual language, ofc.

kyle_grove · 2025-01-16T00:22:22 1736986942

I think in part because of YouTube demonization, which is how TikTok could poach the creators in the first place.

I suspect if they're mirroring content to YouTube, it's more to try to attract audience to TikTok than monetize through YouTube.

kyle_grove · 2025-01-15T23:26:07 1736983567

I would use the word 'fresh' for TikTok; like old school YouTube, there's quirkiness and variety.

xnx · 2025-01-16T03:18:52 1736997532

Exactly. In other threads on hacker news people have bemoaned the loss of the old weird web. I don't think anyone believed me that the same spirit exists in some sides of TikTok.

krapp · 2025-01-16T13:02:21 1737032541

I remember seeing threads here about how TikTok reminded people of the old web. It's wild seeing attitudes shift so completely.

kyle_grove · 2024-11-09T02:38:15 1731119895

My belief is that while eng manager empire building was the easier path to get promoted before 2022, it's not anymore, for two main reasons:

1. HC doesn't accrue like that anymore. 2. Many organizations are looking to delayer; harder to promote up to director when your org went from 9 runs to 5.

I hear a lot of the focus going to Tech Lead Manager roles--fewer reports but more hand-on keyboard than EM roles of the past.

scottlamb · 2024-11-09T02:47:53 1731120473

Interesting take, thanks. I worked for very different companies (FAANG vs. startup) pre- vs. post-2022, so I don't have continuity to compare.

kyle_grove · 2024-11-04T22:47:13 1730760433

As I understand it, the Q-hypothesis is often situated within the hypothesis of Marcan priority (Mark was the source for Luke and Matthew), and Q is a way of explaining agreements within Luke and Matthew that are not also found in Mark. The hypothesis would be that Luke and Matthew each combined text from Mark with Q.

kyle_grove · on May 8, 2024

I think (but cannot prove) that along the way, it was decided to explicitly measure ability to 'study to the test'. My theory goes that certain trendsetting companies decided that ability to 'grind at arbitrary technical thing' measures on-job adaptability. And then many other companies followed suit as a cargo cult thing.

If it were otherwise, and those trendsetting companies actually believed LeetCode tested programming ability, then why isn't LeetCode used in ongoing employee evaluation? Surely the skill of programming ability a) varies over an employee's tenure at a firm and b) is a strong predictor of employee impact over the near term. So I surmise that such companies don't believe this, and that therefore LeetCode serves some other purpose, in some semi-deliberate way.

geraldwhen · on May 8, 2024

I do code interviews because most candidates cannot declare a class or variable in a programming language of their choice.

I give a very basic business problem with no connection to any useful algorithm, and explicitly state that there are no gotchyas: we know all inputs and here’s what they are.

Almost everyone fails this interview, because somehow there are a lot of smooth tech talkers who couldn’t program to save their lives.

godelski · on May 8, 2024

I think I have a much lazier explanation. Leet code style questions were a good way to test expertise in the past. But the same time everyone starts to follow suit the test becomes ineffective. What's the saying? When everyone is talking about a stock, it's time to sell. Same thing.

pseudalopex · on May 8, 2024

> If it were otherwise, and those trendsetting companies actually believed LeetCode tested programming ability, then why isn't LeetCode used in ongoing employee evaluation?

Probably recent job performance is a stronger predictor of near future job performance.

kyle_grove · on April 24, 2024

I lived in Woodland for a time and I really wish I had heard about them back then so I could arrange a plant tour.

kyle_grove · on April 18, 2024

My 16GB M2 Air is doing it well.