Because the AI has to do the work of translating the image to text before deciding what to do with it