Show HN: Podcast Audiograms – Make clips with captions, title and audio waves

rememberlenny · on Dec 6, 2021

Hi all! Lenny from Milk Video here.

Last week, our team released a way to quickly turn podcasts into social audiograms. There is no sign up necessary and its entirely free to use! We are trying to get feedback and would love your input!

Audiograms have been a popular way to create a podcast trailer to share on social media.

If you have a podcast or a video event series, we would love your feedback!

Feel free to comment here and I'll follow along, or also email me directly at lenny@milkvideo.com

Thank you!

GistNoesis · on Dec 7, 2021

Cool app. The transcript was better than I expected.

My experience was quite negative though. I tried to transcript the knife game song, but when it's picking up the speed, it skipped some words and I couldn't add them perfectly. Then I couldn't remove my errors because it can't edit more than 5 words at a time, and automatically select more. I fought with the UI for more than 10 minutes and when I tried to download the video to get the (best I could get) imperfect result of my hard work, contrary to what you said ^^ it requested my email so I couldn't download it.

The timing of the lighting of the words is almost OK, but there is a noticeable increase in viewing experience when the words light-up at exactly the right time.

It still needs some UI bug hunting to make it usable but it's on the right track.

rememberlenny · on Dec 7, 2021

Thanks for taking the time to try it out!

I’m noting these and will fix these respectively.

I really appreciate your taking the time and sharing this.

If you’d be open to chatting, I’d appreciate any more you’d be open to sharing: https://calendly.com/rememberlenny/15-min

rememberlenny · on Dec 7, 2021

Important note -

I expected at most 100 people to try this out today, but we had over 3000 new users sign up.

It’s possible our download rendering is not working as desired, so I will revisit the core issues and fix accordingly!

rvense · on Dec 6, 2021

Looks pretty great! Oddly enough, I came to HN to take a break from coding something eerily similar (different use case and target audience though). Now when I launch in a month everyone will think I stole your idea. Thanks for that, I guess, and good luck!

rememberlenny · on Dec 6, 2021

Up to chat?

https://calendly.com/rememberlenny/15-min

rememberlenny · on Dec 6, 2021

1 minute video demo here:

https://twitter.com/rememberlenny/status/1467934586202112003...

userhacker · on Dec 7, 2021

Pretty cool application, we are also working on a similar tool at https://revoldiv.com/

rememberlenny · on Dec 7, 2021

Awesome!

Up to chat?

https://calendly.com/rememberlenny/15-min

bmau5 · on Dec 6, 2021

I was just looking for something like this. We've had several podcast recently, but sharing the entire clip to social media doesn't get the same engagement. Can't wait to try this. Thanks!

rememberlenny · on Dec 6, 2021

Please let us know how we can make it better!

If you are open to chatting, I'd love to know how I can better make something you need.

Heres a quick booking link: https://calendly.com/rememberlenny/15-min

abnercoimbre · on Dec 6, 2021

Hi Lenny!

This looks tempting. I use Headliner [0] when making promotional clips for my conference (here's an example [1].)

I do pay for their premium service. Have you compared Milk Video with Headliner? Would I be able to switch to yours?

> There is no sign up necessary and its entirely free to use!

For how long is this true?

[0] https://headliner.app

[1] https://twitter.com/AbnerCoimbre/status/1314292021792702464

rememberlenny · on Dec 6, 2021

Headliner is wonderful!

We do provide a paid service, but for a business oriented audience making content for webinars, so this is a tangential feature.

Our free service will Be forever. We have a few features we plan to introduce, such as fast processing, save my templates, and automated clipping.

If you are open to sharing more about your use cases, I’d love to make sure we are building for you to save time.

https://calendly.com/rememberlenny/15-min

theluketaylor · on Dec 6, 2021

Interesting as a web-based tool, potentially very useful.

Overcast has similar functionality that Marco has buried a little too deep in the app for most people to find.

https://marco.org/2019/04/27/overcast-clip-sharing

wenbin · on Dec 6, 2021

Nice product execution!

If you want to provide easy onboarding for users to search & select episodes, we've got a podcast search api: https://www.listennotes.com/api/

rememberlenny · on Dec 6, 2021

Love it!

yakk0 · on Dec 6, 2021

Compared to other Audiogram tools I've used, this is really easy. Processing time is a bit long, but I did test it with a 2 hour podcast.

I have a couple feature suggestions that I think would be really helpful to podcast producers. Our show has mp3 chapters, and we also include matching time codes in our show notes. If you could read one (or both) of those you could suggest segments for users to clip out.

It would also be great if you could read a RSS feed and use that as an input in addition to uploading single files.

rememberlenny · on Dec 6, 2021

These are wonderful suggestions.

To confirm I understand them, I'm writing them below and would love a way to get in touch with you.

1. You want to be able to create clips based on parsing show notes. Since you already do the work to identify key moments, we could use that to speed up your clip making process.

2. You want to import content based on the RSS feed. This will make it easier to get content in the tool, and avoid the "where is the file" rigmarole.

Separately, the processing time is slow. We have a way to improve this, and is on the queue!

Thank you!

yakk0 · on Dec 6, 2021

Correct, but also with 1, there is a way to embed chapters into a mp3 file that's a part of the ID3 standard. We use Forecast (https://overcast.fm/forecast) to do it. I've found that most podcast applications either parse these chapters or timecodes in show notes (or support both), and that would be another way to identify moments that as a producer we've called out in the file.

for 2, the RSS feed would include show notes and greatly speed up the process for bulk production.

If you'd like to get in touch with me, I'm on twitter as @yakk0dotorg. thanks again!

ahstilde · on Dec 6, 2021

It's always magical to see a use case be elevated to a built-in feature. I was doing exactly this with Milk.video for a few podcasts [1]. I'm excited to try this out!

[1] Example: https://twitter.com/aakashdotio/status/1359160836435169280

rememberlenny · on Dec 6, 2021

Thank you for being our earliest supporter!!!

dylanbfox · on Dec 6, 2021

Nice work! The UI is really simple - and love not having to log in to use it. Have you thought about leveraging the ListenNotes API (https://www.listennotes.com/api/) to automatically pull in the podcast episodes via search vs having to upload them?

rememberlenny · on Dec 6, 2021

This is incredible. Thank you!!

ChrisArchitect · on Dec 6, 2021

Impressive. Didn't expect it to be automatically text/captioned. There are other things like this around that make waveform audiograms. Could the text be turned off optionally?

Anyways, very nice for a free clip maker with logo and no watermark.

rememberlenny · on Dec 6, 2021

100%

Our core product is actually more about creating freeform visuals. This itself is more for captions.

If you are interested in just doing a waveform + logo, you can make your own template/layout in the parent application https://milkvideo.com.

I'm also free to personally walk you through in case that might be helpful. You can book time here: https://calendly.com/rememberlenny/15-min or call me directly!

dannyeei · on Dec 6, 2021

Awesome work! This looks great!

One feature I would love to see is the ability to upload multiple audio clips at the same time. That would differentiate you from the competition

rememberlenny · on Dec 6, 2021

Ah ha! I love this idea.

In a less structured way, thats what our core application is for.

If you are interested in trying it out, I'd love to personally show you how it works!

Feel free to book time or call me directly: https://calendly.com/rememberlenny/15-min

fareesh · on Dec 6, 2021

Would love to hear about your tech stack, approach, etc.

rememberlenny · on Dec 6, 2021

Great question!

There are three main parts to note: designing the videos, the backend for handling media, and the download renderer.

The video designing process is a React app that leverages a Ruby on Rails based backend API. The React app handles the views, and if you look at the UI, you will see how the app steps are persisted in the URL. Our React app is based on Redux Toolkit, which is phenomenal. The Rails application is a normal API with Sidekiq (aka Redis) workers, which handle asynchronous tasks.

Our UI renders a number of interesting elements which are respectively generated/prepared in various service oriented streams. The most important is our transcript API, which is from https://www.AssemblyAI.com - they are the best, cheapest, highest quality and best development experience transcription tool. We also have a series of Lambda functions that handle uploaded audio/video file prep, such that we can encode files in a unified format and parse out the audio data needed for visualizing things like the audio waveform or the animated audio frequency data.

A few interesting tidbits are that we use Lambda Layers extensively. We have functions written Ruby and JavaScript where we move the vendor Gems or node_modules into a shared Lambda Layer, and then we also use EFS to run Python based functions that have dependencies that are too big for the Lambda itself.

Our video rendering is also pretty neat, in that we leverage the browser as a rendering view, and batch process screenshots of each frame of the final output video, using a AWS-based container orchestration process.

In summary, this tool is based on the entire work we've been doing for our larger company. Since we can leverage that, we are spinning up a number of single purpose utility projects based on what customers ask for.

If this is interesting at all, we are hiring advanced Javascript engineers who are comfortable learning new things.

You can reach me at lenny@milkvideo.com or if you just want to chat to learn more/ask questions, please feel free to book time here: https://calendly.com/rememberlenny/15-min

fareesh · on Dec 7, 2021

I was primarily curious about the video generation system. There are a number of different ways to generate programmatic video - ranging from pure ffmpeg + image and text layers, to some kind of headless browser that spits out frames rendered by a canvas and then a ffmpeg or equivalent process to aggregate them into a video. There are also python libraries like moviepy which offer their own API to create layers. Each of these has their own different levels of performance, and I was curious about whether there is a de-facto best approach for this sort of thing where someone had evaluated all of these options and settled on one after looking at all the tradeoffs.

danielgladstone · on Dec 6, 2021

Genius product idea! Excited to see folks using it.

awwstn · on Dec 6, 2021

This was super smooth. Way easier than the other audiogram generators I've used. Awesome work!

rememberlenny · on Dec 6, 2021

Thank you!

rushils · on Dec 6, 2021

This looks awesome! Can’t wait to try it out for some of our marketing materials!

rememberlenny · on Dec 6, 2021

Thank you!!!

unamashana · on Dec 6, 2021

Could this be used for Instagram?

rememberlenny · on Dec 6, 2021

100%! Instagram, Tiktok, etc.

This tool is just a small piece of our overall product.

In the actual application, you can create content (like this audiogram) once, and then export it for various dimensions without needing to do any redesigning as well.