Its not that simple, but it would be straight forward to duplicate the outputs of this with a simple LLM + ffmpeg workflow. They did mention a custom model on the landing page, and if they've trained one then you would be spending much more money on each output than they are. Because without a fine-tuned model there would be a lot of inference done for QA and refinement of each prompt | clip | frame .
no, not really. I think they are massively over-valued but in the tech world... what else is new? I view those tools as mostly a convenience. They are integrating things into nice easy packages to use. That's the value.
With this... eh. Most people don't need to make more than one or two explainer videos, so are they going to take on a new monthly fee for that? And then there are power users who do it all the time, but almost surely have their own workflow put together that is customized to exactly what they want.
At any point, one of the big players could introduce this as a feature for their main product.