Hacker News new | past | comments | ask | show | jobs | submit login
Searching for Visually Similar Artworks (monash.edu)
33 points by argilium on Sept 24, 2018 | hide | past | favorite | 2 comments



if the dataset fits in memory (less than 1-10 million entries) it might be faster to do a full matrix multiply in numpy instead of the approximate nn search, as it avoids disk read. I haven't tried benchmarking this though.

also, I might have skipped over it but most implementations do pca on the high dimensional feature vector as the data tends to be sparse, is there any reason it's not done here?


I get the feeling intellectual property lawyers are going to have a field day with this one. . .




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: