Because the AI has to do the work of translating the image to text before deciding what to do with it

  • S13Ni@lemmy.studio
    link
    fedilink
    arrow-up
    12
    ·
    20 hours ago

    So start normalizing using ffmpeg to type in whatever you want to say, and render it as a video with just static text on white background to make it even more expensive?