Yes, but the basic problem doesn’t change; you’re spending billions to make millions. And Deepseek’s approach only works because they’re able to essentially distill the output of less efficient models like Llama and GPT. So they haven’t actually solved the underlying technical issues, they’ve just found a way to break into the industry as a smaller player.
At the end of the day, the problem is not that you can’t ever make something useful with transformer models; it’s that you cannot make that useful thing in a way that is cost effective. That’s especially a problem if you expect big companies like Microsoft or OpenAI to continue to offer these services at an affordable price. Yes, Copilot can help you code, but that’s worth Jack shit if the only way for Microsoft to recoup their investment is by charging $200 a month for it.
It does have a large initial cost. It also has a large ongoing cost. GPU time is really, really pricey.
Even putting aside training and infrastructure, OpenAI still loses money on even their most expensive paid subscribers. While guys like Deepseek have shown ways of reducing those costs, they’re still not enough to make these models profitable to run at the kind of workloads they’re intended to handle, and attempts to reduce their fallibility make them even more expensive, because they basically just involve running the model multiple times over.
Yes, but the basic problem doesn’t change; you’re spending billions to make millions. And Deepseek’s approach only works because they’re able to essentially distill the output of less efficient models like Llama and GPT. So they haven’t actually solved the underlying technical issues, they’ve just found a way to break into the industry as a smaller player.
At the end of the day, the problem is not that you can’t ever make something useful with transformer models; it’s that you cannot make that useful thing in a way that is cost effective. That’s especially a problem if you expect big companies like Microsoft or OpenAI to continue to offer these services at an affordable price. Yes, Copilot can help you code, but that’s worth Jack shit if the only way for Microsoft to recoup their investment is by charging $200 a month for it.
ai has large initial cost, but older models will continue to exist and the open source models will continue to take potential profit from the corps
It does have a large initial cost. It also has a large ongoing cost. GPU time is really, really pricey.
Even putting aside training and infrastructure, OpenAI still loses money on even their most expensive paid subscribers. While guys like Deepseek have shown ways of reducing those costs, they’re still not enough to make these models profitable to run at the kind of workloads they’re intended to handle, and attempts to reduce their fallibility make them even more expensive, because they basically just involve running the model multiple times over.