Screenshot of this question was making the rounds last week. But this article covers testing against all the well-known models out there.

Also includes outtakes on the ‘reasoning’ models.

  • SuspciousCarrot78@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    14 hours ago

    I see what the issue is. Basic reasoning and logic seem artificial to you.Telling.

    Of course it’s bad faith. You claimed you were opened to reasoned debate and then you tried to prompt inject to see if I was a bot.

    But not being able to distinguish an LLM from a human in a reasoning debate? That rather undermines the entire " LLMs are just spicy auto complete" point.

    • zalgotext@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      2
      ·
      23 hours ago

      You’re not gonna convince me, and I’m not gonna convince you. I’m done with this conversation before you devolve further into personal attacks.