Frankly, I don’t think you seriously tested anything that you’ve mentioned here.
Nobody’s using Qwen because it doesn’t do tool calls. Nobody really uses ollama for useful workloads because they don’t own the hardware to make it good enough.
That’s not to say that I don’t want self-hosted models to be good. I absolutely do. But let’s be realistic here.
Frankly, I don’t think you seriously tested anything that you’ve mentioned here.
Nobody’s using Qwen because it doesn’t do tool calls. Nobody really uses ollama for useful workloads because they don’t own the hardware to make it good enough.
That’s not to say that I don’t want self-hosted models to be good. I absolutely do. But let’s be realistic here.