More

ashleyn · 2026-01-31T18:44:27 1769885067

I'm guessing this "humanizer" actually does two things:

* grep to remove em dashes and emojis

* re-run through another llm with a prompt to remove excessive sycophantry and invalid url citations

emmp · 2026-01-31T19:04:28 1769886268

For student assignment cheating, only really the em dashes would still be in the output. But there are specific words and turns of phrases, specific constructions (e.g., 'it's not just x, but y'), and commonly used word choices. Really it's just a prim and proper corporate press release style voice -- this is not a usual university student's writing voice. I'm actually quite sure that you'd be able to easily pick out a first pass AI generated student assignment with em dashes removed from a set of legitimate assignments, especially if you are a native English speaker. You may not be able to systematically explain it, but your native speaker intuition can do it surprisingly well.

What AI detectors have largely done is try to formalize that intuition. They do work pretty well on simple adversaries (so basically, the most lazy student), but a more sophisticated user will do first, second, third passes to change the voice.

dbg31415 · 2026-01-31T18:53:03 1769885583

You’re absolutely right!

Ha. Every time an AI passionately agrees with me, after I’ve given it criticism, I’m always 10x more skeptical of the quality of the work.

glitchcrab · 2026-01-31T19:14:58 1769886898

Why? The AI is just regurgitating tokens (including the sycophancy). Don't anthropomorphise it.

20260126032624 · 2026-01-31T19:20:37 1769887237

Because of the way regurgitation works. "You're absolutely right" primes the next tokens to treat whatever preceded that as gospel truth, leaving no room for critical approaches.

otikik · 2026-01-31T19:58:50 1769889530

Because I was only 55% sure my comment was correct and the AI made it sound like it was the revelation of the century

the_fall · 2026-01-31T18:55:54 1769885754

No. No one is looking for em-dashes, except for some bozos on the internet. The "default voice" of all mainstream LLMs can be easily detected by looking at the statistical distribution of word / token sequences. AI detector tools work and have very low false negatives. They have some small percentage of false positives because a small percentage of humans pick up the same writing habits, but that's not relevant here.

The "humanizer" filters will typically just use an LLM prompted to rewrite the text in another voice (which can be as simple as "you're a person in <profession X> from <region Y> who prefers to write tersely"), or specifically flag the problematic word sequences and ask an LLM to rephrase.

They most certainly don't improve the "correctness" and don't verify references, though.

smrtinsert · 2026-01-31T20:17:09 1769890629

providers are also adding hidden characters and attempting to watermark if memory serves.

the_fall · 2026-01-31T20:44:27 1769892267

It's more complex than that. It's called SynthID-text and biases the probabilities of token generation in a way that can be recovered down the line.

ashleyn · 2026-01-29T14:53:55 1769698435

Curious what the rules were.

direwolf20 · 2026-01-29T16:07:39 1769702859

probably mostly "stay well away from people and stay away from these areas"

escapecharacter · 2026-01-29T14:59:23 1769698763

This is a EULA I'd love to read.

ashleyn · 2026-01-28T18:21:58 1769624518

It appears ChromeOS is being killed and they're porting much of its feature set into Android. This may be marketed as "ChromeOS", with identical functionality, and consumers won't be none the wiser.

ashleyn · 2026-01-28T18:20:31 1769624431

How will this succeed where the Motorola Atrix failed way back in 2011?

https://arstechnica.com/gadgets/2011/03/the-motorola-atrix-4...

duffyjp · 2026-01-28T22:53:41 1769640821

My Moto Edge 2024 has "Ready For" which is basically this still today. I plug in the USB-C cable normally connected to my work MacBook and I instantly get a full desktop experience; mouse, keyboard and sound included.

It's how I play Minecraft with my kids when they get the itch. Sometimes if I know I'm only gonna be zoning out on Youtube at night I'll use to to save a few watts too.

It can do 1440p at 120hz, all on a really affordable phone. It's nice.

bsimpson · 2026-01-28T18:45:00 1769625900

ChromeOS has a bigger influence on the market than a random phone model from CES when Android was still establishing itself.

wat10000 · 2026-01-28T19:03:33 1769627013

Phones were way less powerful 15 years ago and native software was much more important. A modern phone CPU running a browser on a larger screen takes care of a lot of what you need these days.

ortusdux · 2026-01-28T18:21:11 1769624471

How as adoption been for Samsung's DEX?

wronglebowski · 2026-01-28T18:29:14 1769624954

I've only used it when I'm in a pinch but it's handy. Blowing up mobile apps to a larger screen and multitasking isn't ideal certainly but I've been able to handle "email job" type activities while out of pocket. That said I've never heard of anyone else who's actually used it.

ashleyn · 2026-01-25T20:37:28 1769373448

Internet censorship is more of a reality and a problem than it felt at the dawn of the age of cheap wireless broadband. I can certainly see the value in local wikipedia copies if internet blocks, age gates, etc need to be contended with.

ashleyn · 2026-01-25T20:33:49 1769373229

Appears to use a Z80 CPU and shares some heritage with the SNES CD: https://forums.nesdev.org/viewtopic.php?t=17156

ashleyn · 2026-01-18T05:13:44 1768713224

I guess the first question I have is if these problems solved by LLMs are just low-hanging fruit that human researchers either didn't get around to or show much interest in - or if there's some actual beef here to the idea that LLMs can independently conduct original research and solve hard problems.

utopiah · 2026-01-18T06:26:43 1768717603

That's the first warning from the wiki : <<Erdős problems vary widely in difficulty (by several orders of magnitude), with a core of very interesting, but extremely difficult problems at one end of the spectrum, and a "long tail" of under-explored problems at the other, many of which are "low hanging fruit" that are very suitable for being attacked by current AI tools.>> https://github.com/teorth/erdosproblems/wiki/AI-contribution...

dyauspitr · 2026-01-18T05:15:29 1768713329

There is still value on letting these LLMs loose on the periphery and knocking out all the low hanging fruit humanity hasn’t had the time to get around to. Also, I don’t know this, but if it is a problem on Erdos I presume people have tried to solve it atleast a little bit before it makes it to the list.

utopiah · 2026-01-18T06:29:55 1768717795

Is there though? If they are "solved" (as in the tickbox mark them as such, through a validation process, e.g. another model confirming, formal proof passing, etc) but there is no human actually learning from them, what's the benefit? Completing a list?

I believe the ones that are NOT studied are precisely because they are seen as uninteresting. Even if they were to be solved in an interesting way, if nobody sees the proof because they are just too many and they are again not considered valuable then I don't see what is gained.

vessenes · 2026-01-18T16:40:43 1768754443

Some problems are ‘uninteresting’ in that they show results that aren’t immediately seen as useful. However, solutions may end up having ‘interesting’ connections or ideas or mathematical tools that are used elsewhere.

More broadly, I think there’s a perspective that literally just building out thousands more true statements in Lean is going to keep cementing math’s broadening knowledge framework. This is not building a giant castle a-la Wiles, it’s laying bricks in the outhouse, but someday those bricks might be useful.

ogogmad · 2026-01-18T12:15:02 1768738502

You don't see value in having a cheap way to detect when a problem is easy or hard? That would seem unimaginative.

ashleyn · 2026-01-13T20:30:26 1768336226

Phind was the first AI search I used as well. But they seemed to be quickly outfoxed by Perplexity. I started using Perplexity after it was recommended to me as having fewer hallucinations - now it can integrate its tools with SOTA models like Opus.

ashleyn · 2026-01-11T21:32:28 1768167148

Would love to know the thought process and rationale of whoever underwrote that policy. My experience suggests you should never trust unsupervised LLMs for anything life or mission critical.

ashleyn · 2026-01-08T16:27:58 1767889678

Having to prime it with more context and more guardrails seems to imply they're getting worse. That's fewer context and guardrails it can infer/intuit.

theptip · 2026-01-08T18:08:33 1767895713

No, they are not getting worse. Again, look at METR task times.

The peak capability is very obviously, and objectively, increasing.

The scaffolding you need to elicit top performance changes each generation. I feel it’s less scaffolding now to get good results. (Lots of the “scaffolding” these days is less “contrived AI prompt engineering” and more “well understood software engineering best practices”.)

falloutx · 2026-01-08T17:37:07 1767893827

Why the downvotes, this comment makes sense. If you need to write more guardrails that does increase the work and at some point amount of guardrails needed to make these things work in every case would be just impractical. I personally dont want my codebase to be filled baby sitting instructions for code agents.