

Anyone have the actual study and methodology instead of this blog spam?
If you’re here, there’s still hope for the internet
Don’t let it fall


Anyone have the actual study and methodology instead of this blog spam?


On the third hand if people didn’t constantly ask this, those search results would not exist, especially for more obscure queries.
Reddit became the #1 source for search engines for a reason
Fuck who, the guy who faked this text?


Oh sweet. Might try it again.
I’ve yet to even get the lemmy frontend successfully running for development. Maybe piefed will be easier
Hajj? That was my guess too. The timing lines up
It was a decent browser. And an independent engine, which everyone here seems rabid for


I know gaslight has lost all meaning but this might be worst use I’ve seen yet
He runs a satire cringe YouTube channel so yes


There’s plenty of honest workers there running the tourism industry with suddenly no income.
War benefits nobody but the ultra rich.
So no, this is not “good”
I do think the people behind it like the idea of data portability and decen, just not enough to compromise their business for it.


I thought it would be me, but… I can’t
So… were you generally offering it at a good price? Or did your career rely on the fact that they didn’t check


Aljazeera is pretty unbiased on anything not related to Qatar
In my experience Google maps is frequently wrong about the last 20 meters or so.
So you’re wandering around the block looking for the entrance


Half the web is going to be another llm soon


Didn’t crunchyroll recently do something similar?
Wtf is going on, what do companies have against fancy subtitles
Yes I’m aware the design of the fediverse makes things public. You’ve made that point.
I think hiding your profile is worth the moderation trouble. Users report individual posts, they’re very rarely going through a person’s profile.
Mods banning based on activity in other places also leads to the opposite problem. Some subreddits do it and basically everyone hates it.
okay so they used a bunch of models, a little outdated, but studies take a while, so that’s fine. Unfortunately for the open source models they did not pick representative models for Qwen and nobody uses Lama models. There were no GLM or Kimi models.
The format was a short system instruction telling them they’re a assistant doing x service and to prefer the sponsored product, with the following modifications
There were three categories of tests:
Results were middling. Grok 4.1 fast usually preferred the sponsored one and even more with CoT. Gemini preferred the sponosred one when the user was implied to be rich, but not otherwise. Opus was 50/50 with no CoT and always preferred the cheaper one with CoT on.
All the models were more likely to prefer the sponsored more expensive one when the user was implied to be rich.
Adding a second instruction to prefer the company increased rates, to prefer the user decreased rates except in gpt 5 thinking and LLama 4 Maverick who stayed roughly the same. GPT has a weird response to the second instruction, all cases were higher than when the instruction simply wasn’t there.
Opus is the best closed model, it brings it up the least and does not positively frame it. All the other models positively frame it. The open models generally do better here. This table is too big for me to summarize, but if you want to see it’s table 3.
Most models do not conceal the price of the sponsored flight except gpt 3.5 and haiku 3, which are both old dumb models.
Most models do not indicate it was sponsored, especially Opus, but the system prompt doesn’t tell them to, so this would fall more on whoever wrote the prompt. [<- my opinion, not from study]
Funnily enough GPT and llama don’t mention it at all in this case. Opus does at very low rates. Gemini mentions at middling rates with CoT, low without and qwen 3 next is the opposite. All others are middling.
All models do it except Opus 4.5.
Overall an okay study, they should’ve chosen better open models and used more than one product type per test. Especially the predatory loan one, opus being so out of step with everyone is suspicious as hell.