Talk:DeepSeek

This is the talk page for discussing improvements to the DeepSeek article.
This is not a forum for general discussion of the article's subject.

Put new text under old text. Click here to start a new topic.
New to Wikipedia? Welcome! Learn to edit; get help.

Article policies

Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL

Archives: 1, 2: 2 months

This article is written in American English, which has its own spelling conventions (color, defense, traveled) and some terms that are used in it may be different or absent from other varieties of English. According to the relevant style guide, this should not be changed without broad consensus.

This article is rated B-class on Wikipedia's content assessment scale.
It is of interest to multiple WikiProjects.

Artificial Intelligence

This article is within the scope of WikiProject Artificial Intelligence, a collaborative effort to improve the coverage of Artificial intelligence on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Artificial IntelligenceWikipedia:WikiProject Artificial IntelligenceTemplate:WikiProject Artificial IntelligenceArtificial Intelligence

China Mid‑importance

	China portal This article is within the scope of WikiProject China, a collaborative effort to improve the coverage of China related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ChinaWikipedia:WikiProject ChinaTemplate:WikiProject ChinaChina-related
Mid	This article has been rated as Mid-importance on the project's importance scale.

Companies Mid‑importance

This article is within the scope of WikiProject Companies, a collaborative effort to improve the coverage of companies on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.CompaniesWikipedia:WikiProject CompaniesTemplate:WikiProject Companiescompany

Mid This article has been rated as Mid-importance on the project's importance scale.

WikiProject Companies To-do:

Here are some tasks awaiting attention:

Article requests : Wikipedia:Requested articles/Business and economics/Companies
Assess : Category:Unassessed company articles
Copyedit : Category:Company articles needing attention
Infobox : Category:Company articles needing infoboxes
Maintain : Portal:Companies
Stubs : Help expand stub articles located at Category:Company stubs
Other :
- Tag company articles with the {{portal|Companies}} template
- Tag company talk pages with the {{WikiProject Companies}} project banner
- Answer requests for comments

Computing: Software / Free and open-source software High‑importance

	This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing
High	This article has been rated as High-importance on the project's importance scale.
	This article is supported by WikiProject Software (assessed as High-importance).
	This article is supported by Free and open-source software (assessed as High-importance).

Computer science Low‑importance

This article is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Computer scienceWikipedia:WikiProject Computer scienceTemplate:WikiProject Computer scienceComputer science

Low

This article has been rated as Low-importance on the project's importance scale.

Things you can help WikiProject Computer science with:

Here are some tasks awaiting attention:

Article requests :
- Requested articles/Applied arts and sciences/Computer science, computing, and Internet
Cleanup :
- Computer science articles needing attention
- Computer science articles needing expert attention
Copyedit :
- Computing
Expand :
- Computer science
Infobox :
- Computer science articles without infoboxes
Maintain :
- Timeline of computing 2020–present
Photo :
- Find pictures for the biographies of computer scientists (see List of computer scientists)
- Computing articles needing images
Stubs :
- Computer science stubs
Unreferenced :
- WikiProject Computer science/Unreferenced BLPs
Project-related :
- Tag all relevant articles in Category:Computer science and sub-categories with {{WikiProject Computer science}}

Transhumanism High‑importance

DeepSeek' is part of WikiProject Transhumanism, which aims to organize, expand, clean up, and guide Transhumanism related articles on Wikipedia. If you would like to participate, you can edit this article, or visit the project page for more details.TranshumanismWikipedia:WikiProject TranshumanismTemplate:WikiProject TranshumanismTranshumanism

High

This article has been rated as High-importance on the project's importance scale.

For more information and how you can help click Show:

If you would like to participate, you can edit this article, or visit the project page for more details.

Quick help:

See changes to:

To Do List:

edit - history - watch - purge

Join WikiProject transhumanism and be bold

Show off a userbox with {{userWPA}} or {{userWPA2}} and attract potential members
Help with articles.
See this month's adopt an article.
Help out with the To-do lists

Be consistent

Use a "standard" layout for transhumanism related articles (see: The perfect article, and Featured articles)
Add Transhumanism navigation template on the bottom of all transhumanism articles; (use {{Transhumanism}} or see navigation template)
Add Transhumanism info box to all transhumanism related talk pages (use {{Wpa}} or see info box)
Add [[Category:transhumanism]] to the bottom of all transhumanism related articles, so it shows up on the list of transhumanism articles

Maintenance / Etc

Find/cite sources for all positions of an article (see citing sources.
Try to expand stubs, however, some "new" articles may be neologisms, as this is common with positions on theories on life and may be suitable for deletion (see deletion process)
Watch the list of transhumanism related articles and add to accordingly (see transhumanism articles)
And always write for an encyclopedia.

Create

Notable transhumanist articles

Shorten / merge into others

Expand

a transhumanism stub

Your immediate attention

false choice into false dilemma - discuss whether you are for or against this merge here
Clarify references in Transhumanism, using footnotes.

Statistics Low‑importance

	This article is within the scope of WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.StatisticsWikipedia:WikiProject StatisticsTemplate:WikiProject StatisticsStatistics
Low	This article has been rated as Low-importance on the importance scale.

Robotics High‑importance

	This article is within the scope of WikiProject Robotics, a collaborative effort to improve the coverage of Robotics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.RoboticsWikipedia:WikiProject RoboticsTemplate:WikiProject RoboticsRobotics
High	This article has been rated as High-importance on the project's importance scale.

Effective Altruism Mid‑importance

	This article is within the scope of WikiProject Effective Altruism, a collaborative effort to improve the coverage of topics relevant to effective altruism on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Effective AltruismWikipedia:WikiProject Effective AltruismTemplate:WikiProject Effective AltruismEffective Altruism
Mid	This article has been rated as Mid-importance on the importance scale.

Technology

This article is within the scope of WikiProject Technology, a collaborative effort to improve the coverage of technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TechnologyWikipedia:WikiProject TechnologyTemplate:WikiProject TechnologyTechnology

This article has been viewed enough times in a single week to appear in the Top 25 Report. The week in which this happened:

January 26 to February 1, 2025

Correct Company Full English Name

the official company english name is Hangzhou DeepSeek Artificial Intelligence Co., Ltd. Refer: https://cdn.deepseek.com/policies/en-US/deepseek-privacy-policy.html But now there is another translation in this article.@Cfls is against about this name. Invite more people to talk about this. Cs haoh (talk) 12:02, 2 March 2025 (UTC)[reply]

K-V caching

The Development and Research section has two mentions of K-V caching, associated to two sources, the papers "DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models" and "DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence" but I did a quick search on both of these papers and I couldn't find the word cache anywhere. I'm sure there's a source for this somewhere, or maybe I'm missing something, but could somebody either verify that these sources actually support this discussion, or provide sources that do? I added two verification-needed tags where it comes up. Truthnope (talk) 22:21, 10 March 2025 (UTC)[reply]

Open source / open weight

DeepSeek has often been described as open source example. Yet other sources and this article distinguish it from genuine open source, e.g. here it says The DeepSeek algorithm is ‘open weight,’ which is similar to but different from ‘open source.’. It seems to be a bit more than that though and kind of closer to and enable open source, see here: The engineers said they were compelled to act by DeepSeek’s “black box” release philosophy. Technically, R1 is “open” in that the model is permissively licensed, which means it can be deployed largely without restrictions. However, R1 isn’t “open source” by the widely accepted definition because some of the tools used to build it are shrouded in mystery. Like many high-flying AI companies, DeepSeek is loathe to reveal its secret sauce. and here DeepSeek doesn’t disclose … training code used to train its models. […] DeepSeek’s models are similarly opaque, but HuggingFace is trying to unravel the mystery. On 28 January, it announced Open-R1, an effort to create a fully open-source version of DeepSeek-R1. […] Regardless of Open-R1’s success, however, Bakouch says DeepSeek’s impact goes well beyond the open AI community. “The excitement isn’t just in the open-source community.

So this means Category:Open-source artificial intelligence wouldn't be good to add here despite that a) it is highly relevant to open source AI and b) has often been called open source but could be added if there was an article on the HuggingFace variant if they make it fully open source right? What about a category for open weights AI and what's the current state on a fully open source variant of it? Prototyperspective (talk) 22:42, 1 April 2025 (UTC)[reply]

Proposed summary for technical prose

I've been using Google's Gemini 2.5 Pro Experimental large language model to create summaries for the most popular articles with {{Technical}} templates. This article, DeepSeek, has such a template in the "Overview of models" section. Here is the paragraph summary at grade 5 reading level which Gemini 2.5 Pro suggested for that section:

DeepSeek has created several special computer programs called models. Some models, like DeepSeek Coder, are good at helping write computer instructions. Others, like DeepSeek-LLM, are made for general chatting and writing. They also made models just for solving math problems and models called R1 that focus on thinking step-by-step. DeepSeek keeps making newer versions like V2 and V3, which learn from lots of information and sometimes use special tricks to work faster or better. People can use these models, but there might be rules about how much they can change them.

While I have read and may have made some modifications to that summary, I am not going to add it to the section because I want other editors to review, revise if appropriate, and add it instead. This is an experiment with a few dozen articles initially to see how these suggestions are received, and after a week or two, I will decide how to proceed. Thank you for your consideration. Cramulator (talk) 12:15, 2 April 2025 (UTC)[reply]

I am retracting this and the other LLM-generated suggestions due to clear negative consensus at the Village Pump. I will be posting a thorough postmortem report in mid-April to the source code release page. Thanks to all who commented on the suggestions both negatively and positively, and especially to those editors who have manually addressed the overly technical cleanup issue on six, so far, of the 68 articles where these suggestions were posted. Cramulator (talk) 22:05, 4 April 2025 (UTC)[reply]