Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is the "same-segment-repeat" check across multiple episodes? Or just a single episode has the same segment repeated a few twice, so it works for this specific network?


The current implementation searches only within a single episode. Searching across multiple episodes could be done but would introduce false positives if a show re-uses intro music or other assets, and personally I’d prefer to keep that stuff, so I’d need to figure out heuristics to differentiate it from ads.


Some interview podcasts start with a 3-4 second clip from the interview. Does this cause false positives for you?


The OP said elsewhere they are using this[1] library, which allows you to specify minimum seconds to match, so you'd presumably set it to match 20 seconds or whatever minimum length podcast commercials usually are.

Most other audio fingerprinting libraries I've seen allow you to specify min/max time, as well.

HTH.

1. https://github.com/AddictedCS/soundfingerprinting


Exactly- 20 seconds is the number I arrived at after a bit of trial and error.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: