Really!
I always imagine a system:
Composed of 5 specialized computers.
One for each category: audio, text, video, image.
One analyzer to coordinate everything.
This would be my API that I could access with mobile devices.
Here's a scenario:
I could talk to my phone about ideas, in the background it would create apps prototypes, create posters, make music based on something i whistle, teach me ask i ask question about a topic.
For the currently used architectures, it doesn't make sense to have 5 specialized, dedicated computers as the "AI" text processing and "AI" video processing and the others use very similar architectures and there's no benefit from specialization, the "video-specialized" hardware would be just as good at processing text and vice versa.
A processor has different cores, Computers may have several hard-drives, 4 sticks of ram.
Each component can run in parallel.for example, if a long video processing task is underway and your text generation component is idle, it could assist.
Should the audio component fail , only that specific part would be affected.
One for each category: audio, text, video, image.
One analyzer to coordinate everything.
This would be my API that I could access with mobile devices.
Here's a scenario:
I could talk to my phone about ideas, in the background it would create apps prototypes, create posters, make music based on something i whistle, teach me ask i ask question about a topic.
We could delegate the mundane stuff to it.