A whole load of data structures, like bloom filters, are 'efficiency tricks'. Fo...

nkrisc · 2025-05-02T10:30:14 1746181814

Why would that be better than a bloom filter (or similar)?

chupy · 2025-05-02T14:46:30 1746197190

Because it has AI in it's name /s

ozgrakkurt · 2025-05-02T14:40:23 1746196823

There are actually “learned” bloom filters if anyone is interested in machine learning/bloom filter relation. but it is not related to chatbots

esafak · 2025-05-02T13:25:00 1746192300

The way that would work is that the LLM would translate that sentence into a tool call, which would query a data store that does the heavy lifting. Also, there is an ML task called "Learning to hash", which is about optimizing the hash for the task: https://learning2hash.github.io/

hinkley · 2025-05-03T19:21:56 1746300116

I would imagine a heuristic to keep enough of the filter in L2 cache to avoid pipeline stalls might be useful. Sort of a double bloom, but weighted for common lookups.