{"id":22052,"date":"2025-09-18T00:07:47","date_gmt":"2025-09-18T00:07:47","guid":{"rendered":"https:\/\/naijaglobalnews.org\/?p=22052"},"modified":"2025-09-18T00:07:47","modified_gmt":"2025-09-18T00:07:47","slug":"secrets-of-chinese-ai-model-deepseek-revealed-in-landmark-paper","status":"publish","type":"post","link":"https:\/\/naijaglobalnews.org\/?p=22052","title":{"rendered":"Secrets of Chinese AI Model DeepSeek Revealed in Landmark Paper"},"content":{"rendered":"<p>\n<\/p>\n<p class=\"article_pub_date-zPFpJ\">September 17, 2025<\/p>\n<p class=\"article_read_time-ZYXEi\">4 min read<\/p>\n<p>Secrets of DeepSeek AI Model Revealed in Landmark Paper<\/p>\n<p>The first peer-reviewed study of the DeepSeek AI model shows how a Chinese start-up firm made the market-shaking LLM for $300,000<\/p>\n<p class=\"article_authors-ZdsD4\">By Elizabeth Gibney &amp; Nature magazine <\/p>\n<p>DeepSeek says its R1 model did not learn by copying examples generated by other LLMs.<\/p>\n<p>Iain Masterton\/Alamy Live News<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">The success of DeepSeek\u2019s powerful artificial intelligence (AI) model R1 \u2014 that made the US stock market plummet when it was released in January \u2014 did not hinge on being trained on the output of its rivals, researchers at the Chinese firm have said. The statement came in documents released alongside a peer-reviewed version of the R1 model, published today in Nature.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">R1 is designed to excel at \u2018reasoning\u2019 tasks such as mathematics and coding, and is a cheaper rival to tools developed by US technology firms. As an \u2018open weight\u2019 model, it is available for anyone to download and is the most popular such model on the AI community platform Hugging Face to date, having been downloaded 10.9 million times.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">The paper updates a preprint released in January, which describes how DeepSeek augmented a standard large language model (LLM) to tackle reasoning tasks. Its supplementary material reveals for the first time how much R1 cost to train: the equivalent of just US$294,000. This comes on top of the $6 million or so that the company, based in Hangzhou, spent to make the base LLM that R1 is built on, but the total amount is still substantially less than the tens of millions of dollars that rival models are thought to have cost. DeepSeek says R1 was trained mainly on Nvidia\u2019s H800 chips, which in 2023 became forbidden from being sold to China under US export controls.<\/p>\n<h2>On supporting science journalism<\/h2>\n<p>If you&#8217;re enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.<\/p>\n<h2 id=\"rigorous-review\" class=\"\" data-block=\"sciam\/heading\">Rigorous review<\/h2>\n<p class=\"\" data-block=\"sciam\/paragraph\">R1 is thought to be the first major LLM to undergo the peer-review process. \u201cThis is a very welcome precedent,\u201d says Lewis Tunstall, a machine-learning engineer at Hugging Face who reviewed the Nature paper. \u201cIf we don&#8217;t have this norm of sharing a large part of this process publicly, it becomes very hard to evaluate whether these systems pose risks or not.\u201d<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">In response to peer-review comments, the DeepSeek team reduced anthropomorphizing in its descriptions and added clarifications of technical details, including the kinds of data the model was trained on, and its safety. \u201cGoing through a rigorous peer-review process certainly helps verify the validity and usefulness of the model,\u201d says Huan Sun, an AI researcher at Ohio State University in Columbus. \u201cOther firms should do the same.\u201d<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">DeepSeek\u2019s major innovation was to use an automated kind of the trial-and-error approach known as pure reinforcement learning to create R1. The process rewarded the model for reaching correct answers, rather than teaching it to follow human-selected reasoning examples. The company says that this is how its model learnt its own reasoning-like strategies, such as how to verify its workings without following human-prescribed tactics. To boost efficiency, the model also scored its own attempts using estimates, rather than employing a separate algorithm to do so, a technique known as group relative policy optimization.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">The model has been \u201cquite influential\u201d among AI researchers, says Sun. \u201cAlmost all work in 2025 so far that conducts reinforcement learning in LLMs might have been inspired by R1 one way or another.\u201d<\/p>\n<h2 id=\"training-technique\" class=\"\" data-block=\"sciam\/heading\">Training technique<\/h2>\n<p class=\"\" data-block=\"sciam\/paragraph\">Media reports in January suggested that researchers at OpenAI, the company, based in San Francisco, California, that created ChatGPT and the \u2018o\u2019 series of reasoning models, thought DeepSeek had used outputs from OpenAI models to train R1, a method that could have accelerated a model\u2019s abilities while using fewer resources.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">DeepSeek has not published its training data as part of the paper. But, in exchanges with referees, the firm\u2019s researchers stated that R1 did not learn by copying reasoning examples that were generated by OpenAI models. However, they acknowledged that, like most other LLMs, R1\u2019s base model was trained on the web, so it will have ingested any AI-generated content already on the Internet.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">This rebuttal is \u201cas convincing as what we could see in any publication\u201d, says Sun. Tunstall adds that although he can\u2019t be 100% sure R1 wasn\u2019t trained on OpenAI examples, replication attempts by other labs suggest that DeepSeek\u2019s recipe for reasoning is probably good enough to not need to do this. \u201cI think the evidence now is fairly clear that you can get very high performance just using pure reinforcement learning,\u201d he says.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">For researchers, R1 is still very competitive, Sun says. In a challenge to complete scientific tasks such as analyzing and visualizing data, known as ScienceAgentBench, Sun and colleagues found that although R1 was not first for accuracy, it was one of the best models in terms of balancing ability with cost.<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">Other researchers are now trying to apply the methods used to create R1 to improve the reasoning-like abilities of existing LLMs, as well as extending them to domains beyond mathematics and coding, says Tunstall. In that way, he adds, R1 has \u201ckick-started a revolution.\u201d<\/p>\n<p class=\"\" data-block=\"sciam\/paragraph\">This article is reproduced with permission and was first published on September 17, 2025.<\/p>\n<h2 class=\"subscriptionPleaHeading-DMY4w\">It\u2019s Time to Stand Up for Science<\/h2>\n<p class=\"subscriptionPleaText--StZo\">If you enjoyed this article, I\u2019d like to ask for your support. <span class=\"subscriptionPleaItalicFont-i0VVV\">Scientific American<\/span> has served as an advocate for science and industry for 180 years, and right now may be the most critical moment in that two-century history.<\/p>\n<p class=\"subscriptionPleaText--StZo\">I\u2019ve been a <span class=\"subscriptionPleaItalicFont-i0VVV\">Scientific American<\/span> subscriber since I was 12 years old, and it helped shape the way I look at the world. <span class=\"subscriptionPleaItalicFont-i0VVV\">SciAm <\/span>always educates and delights me, and inspires a sense of awe for our vast, beautiful universe. I hope it does that for you, too.<\/p>\n<p class=\"subscriptionPleaText--StZo\">If you subscribe to <span class=\"subscriptionPleaItalicFont-i0VVV\">Scientific American<\/span>, you help ensure that our coverage is centered on meaningful research and discovery; that we have the resources to report on the decisions that threaten labs across the U.S.; and that we support both budding and working scientists at a time when the value of science itself too often goes unrecognized.<\/p>\n<p class=\"subscriptionPleaText--StZo\">In return, you get essential news, captivating podcasts, brilliant infographics, can&#8217;t-miss newsletters, must-watch videos, challenging games, and the science world&#8217;s best writing and reporting. You can even gift someone a subscription.<\/p>\n<p class=\"subscriptionPleaText--StZo\">There has never been a more important time for us to stand up and show why science matters. I hope you\u2019ll support us in that mission.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>September 17, 2025 4 min read Secrets of DeepSeek AI Model Revealed in Landmark Paper The first peer-reviewed study of the DeepSeek AI model shows how a Chinese start-up firm made the market-shaking LLM for $300,000 By Elizabeth Gibney &amp; Nature magazine DeepSeek says its R1 model did not learn by copying examples generated by<\/p>\n","protected":false},"author":1,"featured_media":22053,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[50],"tags":[4261,13525,4090,4029,10246,7138,5686],"class_list":{"0":"post-22052","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-environment","8":"tag-chinese","9":"tag-deepseek","10":"tag-landmark","11":"tag-model","12":"tag-paper","13":"tag-revealed","14":"tag-secrets"},"_links":{"self":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/posts\/22052","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=22052"}],"version-history":[{"count":0,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/posts\/22052\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/media\/22053"}],"wp:attachment":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=22052"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=22052"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=22052"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}