I was primarily curious about the video generation system. There are a number of different ways to generate programmatic video - ranging from pure ffmpeg + image and text layers, to some kind of headless browser that spits out frames rendered by a canvas and then a ffmpeg or equivalent process to aggregate them into a video. There are also python libraries like moviepy which offer their own API to create layers. Each of these has their own different levels of performance, and I was curious about whether there is a de-facto best approach for this sort of thing where someone had evaluated all of these options and settled on one after looking at all the tradeoffs.