Farmdude@lemmy.world to

Ask Lemmy@lemmy.world · 5 months ago

Can we trust LLM CALCULATIONS?.

32

Can we trust LLM CALCULATIONS?.

Farmdude@lemmy.world to

Ask Lemmy@lemmy.world · 5 months ago

Ok, you have a moderately complex math problem you needed to solve. You gave the problem to 6 LLMS all paid versions. All 6 get the same numbers. Would you trust the answer?

Chat

Pika@sh.itjust.works
link
fedilink
English
arrow-up
2·
5 months ago
Just yesterday I was fiddling around with a logic test in python. I wanted to see how well deepseek could analyze the intro line to a for loop, it properly identified what it did in the description, but when it moved onto giving examples it contradicted itself and took 3 or 4 replies before it realized that it contradicted itself.

Ask Lemmy@lemmy.world

asklemmy@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A Fediverse community for open-ended, thought provoking questions

Rules: (interactive)

1) Be nice and; have fun

Doxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can’t say something nice, don’t say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them

2) All posts must end with a '?'

This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?

3) No spam

Please do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.

4) NSFW is okay, within reason

Just remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either [email protected] or [email protected]. NSFW comments should be restricted to posts tagged [NSFW].

5) This is not a support community.

It is not a place for ‘how do I?’, type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email [email protected]. For other questions check our partnered communities list, or use the search function.

6) No US Politics.

Please don’t post about current US Politics. If you need to do this, try [email protected] or [email protected]

Reminder: The terms of service apply here too.

Partnered Communities:

No Stupid Questions

You Should Know

Logo design credit goes to: tubbadu

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1.76K users / day
5.78K users / week
10.2K users / month
20.4K users / 6 months
1 local subscriber
38.3K subscribers
8.54K Posts
464K Comments
Modlog