OK, I misunderstood there. Running the code, which generates 15 images, on a standard GPU Colab environment takes about 2 minutes. It may be possible to submit a single batch of text summaries to the DALL-E mini model, which would improve performance a good deal.