"we process the tweets directly in your viewport as you would have seen them."
The primary issue with this approach is, even with the speed of gpt-4o mini, often times you're going to be displaying the "harmful" content for enough time for the brain to process it.
This is especially true when you're dealing with images and short 1 sentence content like twitter.
I think you'll want a safety mode, where nothing is displayed/or you have a css-blur on it, until it has been vetted.
Forgot to mention we actually process ahead of the viewport as well so other than the first load it's highly unlikely you will see an unfiltered tweet (from personal experience of using it). But thanks for the idea at the end I think I'll add that!
The primary issue with this approach is, even with the speed of gpt-4o mini, often times you're going to be displaying the "harmful" content for enough time for the brain to process it. This is especially true when you're dealing with images and short 1 sentence content like twitter. I think you'll want a safety mode, where nothing is displayed/or you have a css-blur on it, until it has been vetted.