• Natanael@infosec.pub
    link
    fedilink
    arrow-up
    1
    ·
    8 个月前

    Not necessarily, but errors would be less obvious or weirder since it would spend more time in training

      • Natanael@infosec.pub
        link
        fedilink
        arrow-up
        3
        ·
        8 个月前

        Weirder in that it gets better at “photorealism” (textures, etc) but subjects might be nonsensical. Only teaching it how to avoid automated detection will not teach it to understand what scenes mean.

        • LarmyOfLone@lemm.ee
          link
          fedilink
          arrow-up
          1
          ·
          8 个月前

          I believe most image generating models are too small (like only 4GB RAM). Deepseek R1 is 1.5TB ram (or half or quarter that at reduced precision) to get some semblance of “general knowledge”. So to get the “semantics” of an image right, not just the “syntax” you’d need bigger models and probably more data describing images. Of course, do we really want that?