This article is written in American English, which has its own spelling conventions (color, defense, traveled) and some terms that are used in it may be different or absent from other varieties of English. According to the relevant style guide, this should not be changed without broad consensus.
This article is within the scope of WikiProject Artificial Intelligence, a collaborative effort to improve the coverage of Artificial intelligence on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Artificial IntelligenceWikipedia:WikiProject Artificial IntelligenceTemplate:WikiProject Artificial IntelligenceArtificial Intelligence
This article is within the scope of WikiProject China, a collaborative effort to improve the coverage of China related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ChinaWikipedia:WikiProject ChinaTemplate:WikiProject ChinaChina-related
This article is within the scope of WikiProject Companies, a collaborative effort to improve the coverage of companies on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.CompaniesWikipedia:WikiProject CompaniesTemplate:WikiProject Companiescompany
This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing
This article is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Computer scienceWikipedia:WikiProject Computer scienceTemplate:WikiProject Computer scienceComputer science
DeepSeek' is part of WikiProject Transhumanism, which aims to organize, expand, clean up, and guide Transhumanism related articles on Wikipedia. If you would like to participate, you can edit this article, or visit the project page for more details.TranshumanismWikipedia:WikiProject TranshumanismTemplate:WikiProject TranshumanismTranshumanism
Add Transhumanism navigation template on the bottom of all transhumanism articles; (use {{Transhumanism}} or see navigation template)
Add Transhumanism info box to all transhumanism related talk pages (use {{Wpa}} or see info box)
Add [[Category:transhumanism]] to the bottom of all transhumanism related articles, so it shows up on the list of transhumanism articles
Maintenance / Etc
Find/cite sources for all positions of an article (see citing sources.
Try to expand stubs, however, some "new" articles may be neologisms, as this is common with positions on theories on life and may be suitable for deletion (see deletion process)
Watch the list of transhumanism related articles and add to accordingly (see transhumanism articles)
This article is within the scope of WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.StatisticsWikipedia:WikiProject StatisticsTemplate:WikiProject StatisticsStatistics
This article is within the scope of WikiProject Robotics, a collaborative effort to improve the coverage of Robotics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.RoboticsWikipedia:WikiProject RoboticsTemplate:WikiProject RoboticsRobotics
This article is within the scope of WikiProject Effective Altruism, a collaborative effort to improve the coverage of topics relevant to effective altruism on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Effective AltruismWikipedia:WikiProject Effective AltruismTemplate:WikiProject Effective AltruismEffective Altruism
This article is within the scope of WikiProject Technology, a collaborative effort to improve the coverage of technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TechnologyWikipedia:WikiProject TechnologyTemplate:WikiProject TechnologyTechnology
This article has been viewed enough times in a single week to appear in the Top 25 Report. The week in which this happened:
The Development and Research section has two mentions of K-V caching, associated to two sources, the papers "DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models" and "DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence" but I did a quick search on both of these papers and I couldn't find the word cache anywhere. I'm sure there's a source for this somewhere, or maybe I'm missing something, but could somebody either verify that these sources actually support this discussion, or provide sources that do? I added two verification-needed tags where it comes up. Truthnope (talk) 22:21, 10 March 2025 (UTC)[reply]
DeepSeek has often been described as open source example. Yet other sources and this article distinguish it from genuine open source, e.g. here it says The DeepSeek algorithm is ‘open weight,’ which is similar to but different from ‘open source.’. It seems to be a bit more than that though and kind of closer to and enable open source, see here: The engineers said they were compelled to act by DeepSeek’s “black box” release philosophy. Technically, R1 is “open” in that the model is permissively licensed, which means it can be deployed largely without restrictions. However, R1 isn’t “open source” by the widely accepted definition because some of the tools used to build it are shrouded in mystery. Like many high-flying AI companies, DeepSeek is loathe to reveal its secret sauce. and hereDeepSeek doesn’t disclose … training code used to train its models. […] DeepSeek’s models are similarly opaque, but HuggingFace is trying to unravel the mystery. On 28 January, it announced Open-R1, an effort to create a fully open-source version of DeepSeek-R1. […] Regardless of Open-R1’s success, however, Bakouch says DeepSeek’s impact goes well beyond the open AI community. “The excitement isn’t just in the open-source community.
So this means Category:Open-source artificial intelligence wouldn't be good to add here despite that a) it is highly relevant to open source AI and b) has often been called open source but could be added if there was an article on the HuggingFace variant if they make it fully open source right? What about a category for open weights AI and what's the current state on a fully open source variant of it? Prototyperspective (talk) 22:42, 1 April 2025 (UTC)[reply]
I've been using Google's Gemini 2.5 Pro Experimental large language model to create summaries for the most popular articles with {{Technical}} templates. This article, DeepSeek, has such a template in the "Overview of models" section. Here is the paragraph summary at grade 5 reading level which Gemini 2.5 Pro suggested for that section:
DeepSeek has created several special computer programs called models. Some models, like DeepSeek Coder, are good at helping write computer instructions. Others, like DeepSeek-LLM, are made for general chatting and writing. They also made models just for solving math problems and models called R1 that focus on thinking step-by-step. DeepSeek keeps making newer versions like V2 and V3, which learn from lots of information and sometimes use special tricks to work faster or better. People can use these models, but there might be rules about how much they can change them.
While I have read and may have made some modifications to that summary, I am not going to add it to the section because I want other editors to review, revise if appropriate, and add it instead. This is an experiment with a few dozen articles initially to see how these suggestions are received, and after a week or two, I will decide how to proceed. Thank you for your consideration. Cramulator (talk) 12:15, 2 April 2025 (UTC)[reply]
I am retracting this and the other LLM-generated suggestions due to clear negative consensus at the Village Pump. I will be posting a thorough postmortem report in mid-April to the source code release page. Thanks to all who commented on the suggestions both negatively and positively, and especially to those editors who have manually addressed the overly technical cleanup issue on six, so far, of the 68 articles where these suggestions were posted. Cramulator (talk) 22:05, 4 April 2025 (UTC)[reply]