Skip to main content

Researchers recreated DeepSeek’s core technology for just $30

A group of researchers at the University of California, Berkeley, say they’ve recreated the core technology found in China’s revolutionary DeepSeek AI for just $30. This extremely cheap DeepSeek recreation is yet another indicator that while models from larger companies have been impressive, there may be much more affordable ways to build them. Led by Ph.D. candidate Jiayi Pan, the team replicated DeepSeek R1-Zero’s reinforcement learning capabilities using a small language model with just 3 billion parameters. Despite its relatively modest size, the AI demonstrated self-verification and search abilities, key features that allow it to refine its own responses iteratively. To test their DeepSeek recreation, the Berkeley team used the Countdown game, a numerical puzzle based on the British game show where players must use arithmetic to reach a target number. Initially, the model produced random guesses, but through reinforcement learning, it developed techniques for self-correction and iterative problem-solving. Eventually, it learned to revise its answers until it arrived at the correct solution. They also experimented with multiplication, where the AI broke down equations using the distributive property, much like humans might mentally solve large multiplication problems. This demonstrated the model’s ability to adapt its strategy based on the problem. What’s particularly impressive is that the entire recreation cost them just $30, Pan claims in a post on Nitter. This is a mind-boggling fraction of what leading AI firms spend on large-scale training. The researchers tested multiple model sizes, starting with a 500-million-parameter model that could only guess and stop, regardless of accuracy. When scaled to 1.5 billion parameters, the DeepSeek recreation began incorporating revision techniques. Models between 3 and 7 billion parameters showed significant improvement, solving problems in fewer steps with better accuracy, Pan and the other researchers report. For some context, OpenAI charges $15 per million tokens via its API at the time of writing, while DeepSeek offers a much lower cost of $0.55 per million tokens. The Berkeley team’s findings suggest that highly capable AI models can be developed for a fraction of the cost currently invested by leading AI companies. Despite how cheap it is, there are many reasons you should probably avoid DeepSeek.
One reason is that some experts are skeptical about DeepSeek’s claimed affordability. AI researcher Nathan Lambert has raised concerns about whether DeepSeek’s reported $5 million training cost for its 671-billion-parameter model accurately reflects the full picture. The AI also sends a lot of data back to China, which is certainly cause for concern and is already leading to DeepSeek bans throughout the U.S. In fact, Lambert estimates that DeepSeek AI’s annual operational expenses could be anywhere between $500 million and over $1 billion, considering everything from infrastructure, energy consumption, and research personnel costs. OpenAI also claims there is evidence DeepSeek was trained using ChatGPT, which could help account for some of the reduced costs. Even so, the Berkeley team’s work proves that cutting-edge reinforcement learning can be achieved without the enormous budgets that industry giants like OpenAI, Google, and Microsoft currently allocate. With some AI labs spending up to $10 billion annually on training models, this research highlights what could become a potentially disruptive shift in the field.

Comments

Popular posts from this blog

iPhone 16e: Next Apple iPhone SE reveals itself in new leaked footage

The next iPhone SE has been pictured again in a new set of leaks. Currently thought of as the iPhone SE 4 or iPhone 16e, it still remains unclear whether the mid-range iPhone will line up with a Dynamic Island or Apple's infamous notch across its 6.1-inch display. Alex Alderson, Published 01/27/2025 Apple iPhone Leaks / Rumors Almost a fortnight has passed since Sonny Dickson shared the first photos of the long-awaited successor to the iPhone SE 3rd Gen (curr. $160.97 - renewed on Amazon). Now, leaker Majin Bu has published another round of photos that show the so-called iPhone SE 4 or iPhone 16e, which show the device in greater detail, which we have embedded below. The same leaker has also uploaded a short video of the same device, albeit still in dummy form. Nonetheless, the overall design shown should represent what companies are working against when designing third-party cases. For instance, the iPhone SE 4 has a single oversized rear-facing camera, which is reputed to be a ...

The decision-maker's playbook: integrating Generative AI for optimal results

Step into the future with GenAI: revolutionize business, make lightning-fast decisions, and dominate the competition. http://dlvr.it/TJM9f4

Manus To Launch Its Mobile App For Its Users

Manus AI, a new Chinese artificial intelligence (AI) agent, has shared its beta testing updates and has stated that the Manus AI app will be launched for its users on their mobile devices. The firm has also shared other updates with its community. Manus has shared five updates for its AI. The firm said that it will have better multimodal capabilities than before. It will have a longer context. All tasks will utilize Claude 3.7 to power the AI model. These changes will result in a more stable sandbox. It will also have a premium subscription plan beta test while maintaining limited free access. Manus AI has recently shared a post on X (formerly known as Twitter) stating that, “While we’re working hard around the clock to scale our infrastructure and accommodate everyone, we’ve had to temporarily limit access to Manus during this development phase. We are also working on optimizing our current usage rates to provide better value for our users.” At last, the firm also thanked i...