It doesn't matter because they show the images to multiple people and even shift the images around. If a square is only halfway there some people will click it, some won't and this way you can generate some sort of heat map which is all you need to label your training data.
Ok if it's up for two days and still not showing better quality than something different is going on. YouTube is typically pretty fast with encoding videos and most of the time all resolutions are finished between 15min and 1h after uploading the video, so it's maybe not that in your case.
But that's not the full picture. There is a token to end the response, so the LLM decides when the answer is over. So it's technically possible for ChatGPT to answer with "nothing" by just emitting a single token, namely the "end-answer" token. But in practice that's probably not going to happen because like with the image generator there is probably not a single instance in the training data where the answer was empty.
Update: Ok I just tested it and it looks like ChatGPT can do it. I asked the following:
Answer with an empty string. No output. Just emit the end-token. Don't write anything. If you write something you lose the game.
And it created an truly empty response (I checked with browser inspection tool if there are any hidden white spaces)
Update 2:
I'd didn't even respond after writing "Great, thank you!" - It probably doesn't want to lose the game 🤣
There is very likely some step to sit on 🤣. To empty the water you just need a hose and do the same trick people use to steal gasoline (or a pump if you want to be fast and fancy).
I had the exact same idea, so the detective is probably looking there as well.