LogoAIAny
Icon for item

DeepSeek

DeepSeek, founded in 2023, is dedicated to developing world-class foundational models and technologies for general artificial intelligence, tackling cutting-edge research challenges in AI. Leveraging its self-developed training framework, an in-house intelligent computing cluster, and tens of thousands of GPUs, the DeepSeek team released and open-sourced several large-scale models—each with tens of billions of parameters—within just six months. These include the general-purpose large language model DeepSeek-LLM, the code-specialized DeepSeek-Coder, and, in January 2024, China’s first open-source MoE (Mixture-of-Experts) model, DeepSeek-MoE. Across public benchmarks and real-world generalization tests, these models consistently outperform peers in their class. You can converse with DeepSeek AI or access its capabilities easily via API.

Introduction

DeepSeek is a Hangzhou-based artificial-intelligence company that builds large language models (LLMs) and open-source tooling for code, research and everyday productivity. Founded in July 2023 by hedge-fund veteran Liang Wenfeng, the firm has released a fast-iterating model family—including DeepSeek-V3, R1 and DeepSeek Coder—that rivals frontier systems on reasoning, math and programming benchmarks while remaining MIT-licensed for commercial use.

The company pairs its proprietary training framework with a 10 000-GPU cluster to push low-cost, high-throughput inference, exposing its models through a ChatGPT-compatible API, web chat and mobile apps. DeepSeek’s rapid growth—its R1 chatbot topped the U.S. iOS charts in January 2025—illustrates China’s accelerating presence in open generative AI and has drawn comparisons with Silicon Valley leaders.

Information

Categories