@BetaDoggo_

BetaDoggo_@lemmy.world · 1 day ago

We’re Costco guys

BetaDoggo_@lemmy.world · 1 day ago

I’d guess the 3 key staff members leaving all at once without notice had something to do with it.

BetaDoggo_@lemmy.world · 18 days ago

This is actually pretty smart because it switches the context of the action. Most intermediate users avoid clicking random executables by instinct but this is different enough that it doesn’t immediately trigger that association and response.

BetaDoggo_@lemmy.world · edit-2 19 days ago

All signs point to this being a finetune of gpt4o with additional chain of thought steps before the final answer. It has exactly the same pitfalls as the existing model (9.11>9.8 tokenization error, failing simple riddles, being unable to assert that the user is wrong, etc.). It’s still a transformer and it’s still next token prediction. They hide the thought steps to mask this fact and to prevent others from benefiting from all of the finetuning data they paid for.

BetaDoggo_@lemmy.world · 21 days ago

The role of biodegradable materials in the next generation of Saw traps

BetaDoggo_@lemmy.world · 26 days ago

It’s cool but it’s more or less just a party trick.

BetaDoggo_@lemmy.world · edit-2 1 month ago

How many times is this same article going to be written? Model collapse from synthetic data is not a concern at any scale when human data is in the mix. We have entire series of models now trained with mostly synthetic data: https://huggingface.co/docs/transformers/main/model_doc/phi3. When using entirely unassisted outputs error accumulates with each generation but this isn’t a concern in any real scenarios.

BetaDoggo_@lemmy.world · 2 months ago

Based on the pricing they’re probably betting most users won’t use it. The cheapest api pricing for flux dev is 40 images per dollar, or about 10 images a day spending $8 a month. With pro they would get half that. This is before considering the cost of the language model.

BetaDoggo_@lemmy.world · 2 months ago

About a dozen methods they could use https://arxiv.org/pdf/2312.07913v2

BetaDoggo_@lemmy.world · 2 months ago

New record for most buzz words in a headline.

BetaDoggo_@lemmy.world · 2 months ago

Most of this seems true (or was at the time) but this is outdated now. Mr. Beast is no longer managed by Night Media.

BetaDoggo_@lemmy.world · 2 months ago

I feel like they should at least provide them with a laptop If they’re going to do unpaid promotion.

BetaDoggo_@lemmy.world · 2 months ago

The animation is flashy but the plot and storytelling can’t even compare to the game.

BetaDoggo_@lemmy.world · 3 months ago

She immigrated when she was 15, 30 years before she made the Queen of Canada claim. You can’t deport someone after 30 years of citizenship for mental illness.

BetaDoggo_@lemmy.world · 3 months ago

What’s the deal with Alpine not using GNU? Is it a technical or ideological thing? Or is it another “because we can” type distro?

BetaDoggo_@lemmy.world · 4 months ago

On Discord, the black hole for useful information.

BetaDoggo_@lemmy.world · 6 months ago

Cohere’s command-r models are trained for exactly this type of task. The real struggle is finding a way to feed relevant sources into the model. There are plenty of projects that have attempted it but few can do more than pulling the first few search results.

BetaDoggo_@lemmy.world · 7 months ago

There should be no difference because the video track hasn’t been touched. Some software will display the length of the longest track rather than the length of the main video track. It’s likely that the the audio track was originally longer than the video track and because of the offset it’s now shorter.

You can use tools like ffmpeg and mediainfo to count the actual frames in each to verify.

BetaDoggo_@lemmy.world · 7 months ago

Test comment

BetaDoggo_@lemmy.world · 7 months ago

Koboldcpp should allow you to run much larger models with a little bit of ram offloading. There’s a fork that supports rocm for AMD cards: https://github.com/YellowRoseCx/koboldcpp-rocm

Make sure to use quantized models for the best performace, q4k_M being the standard.