I am not sure whether the videos are representative of real life performance or ...

ksynwa · 2025-03-12T16:55:52 1741798552

AI demos and even live presentations have exacerbated my trust issues. The tech has great uses but there is no modesty from the proprieters.

Miraste · 2025-03-12T20:00:14 1741809614

Google in particular has had some egregiously fake AI demos in the past.

throwaway314155 · 2025-03-13T09:51:42 1741859502

> Reminds of the robot arm in Iron Man 1.

It's an impressive demo but perhaps you are misremembering Jarvis from Iron Man which is not only far faster but is effectively a full AGI system even at that point.

Sorry if this feels pedantic, perhaps it is. But it seems like an analogy that invites pedantry from fans of that movie.

Philpax · 2025-03-13T14:41:09 1741876869

The robot arms in the movie are implied to have their own AIs driving them; Tony speaks to the malfunctioning one directly several times throughout the movie.

Jarvis is AGI, yes, but is not what's being referred to here.

throwaway314155 · 2025-03-13T19:27:45 1741894065

Ah good point!

whereismyacc · 2025-03-12T16:33:24 1741797204

i thought it was really cool when it picked up the grapes by the vine

edit: it didn't.

yorwba · 2025-03-12T16:43:53 1741797833

Here it looks like its squeezing a grape instead: https://www.youtube.com/watch?v=HyQs2OAIf-I&t=43s Bit hard to tell whether it remained intact.

flutas · 2025-03-12T20:50:33 1741812633

The leaf on the darker grapes looks like a fabric leaf, I'd kinda bet they're all fake for these demos / testing.

Don't need the robot to smash a grape when we can use a fake grape that won't smash.

yencabulator · 2025-03-13T17:18:25 1741886305

The bananas are clearly plastic and make a "doink" nose when dropped into the bowl.

genewitch · 2025-03-13T10:37:32 1741862252

Haha show the whole room and work either on a concrete floor or a transparent table.

This video reeks of the same shenanigans as perpetual motion machine videos.

whereismyacc · 2025-03-12T18:13:45 1741803225

welp i guess i should get my sight checked

glandium · 2025-03-13T08:13:49 1741853629

And how it just dropped the grapes, as well as the banana. If they were real fruits, you wouldn't want that to happen.

jansan · 2025-03-13T08:52:03 1741855923

I remember a cartoon where a quality inspection guy smashes bananas with a "certified quality" stamp before they go into packaging.

saberience · 2025-03-12T17:02:46 1741798966

[flagged]

nomel · 2025-03-12T17:54:58 1741802098

This is, nearly exactly, like saying you've seen screens slowly display text before, so you're not impressed with LLM.

How it's doing it is the impressive part.

asadm · 2025-03-12T18:46:22 1741805182

the difference is the dynamic nature of things here.

Current arms and their workspaces are calibrated to mm. Here it's more messy.

Older algorithms are more brittle than having a model do it.

KoolKat23 · 2025-03-12T17:22:39 1741800159

For the most part that's been on known objects, these are objects it has not seen.

mkagenius · 2025-03-12T17:33:59 1741800839

Not specifically trained on but most likely the Vision models have seen it. Vision models like Gemini flash/pro are already good at vision tasks on phones[1] - like clicking on UI elements and scrolling to find stuff etc. The planning of what steps to perform is also quite good with Pro model (slightly worse than GPT 4o in my opinion)

1. A framework to control your phone using Gemini - https://github.com/BandarLabs/clickclickclick

KoolKat23 · 2025-03-12T19:21:47 1741807307

That's a really cool framework you've linked.