The GitHub presence of DeepSeek, an organization under the username deepseek-ai, features a wide range of public repositories primarily developed in Python, C++, and Cuda. Notable projects include DeepSeek-V3, DeepSeek-Coder, and FlashMLA, which focus on AI and machine learning applications, highlighting the organization's commitment to innovative solutions in the field.
No description provided for this repository.
No description provided for this repository.
Integrate the DeepSeek API into popular software
DeepSeek Coder: Let the Code Write Itself
Contexts Optical Compression
Janus-Series: Unified Multimodal Understanding and Generation Models
FlashMLA: Efficient Multi-head Latent Attention Kernels
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
DeepEP: an efficient expert-parallel communication library
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepSeek LLM: Let there be answers
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
A lightweight data processing framework built on DuckDB and 3FS.
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
DeepSeek-VL: Towards Real-World Vision-Language Understanding
No description provided for this repository.
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
Visual Causal Flow
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
No description provided for this repository.
No description provided for this repository.
A kernel library written in tilelang
Expert Parallelism Load Balancer
No description provided for this repository.
Analyze computation-communication overlap in V3/R1.
A curated list of open-source projects related to DeepSeek Coder
Expert Specialized Fine-Tuning
No description provided for this repository.
An early research stage expert-parallel load balancer for MoE models based on linear programming.
Deepseek-ai builds various projects on GitHub that focus on artificial intelligence and machine learning. Their repositories include notable tools like DeepSeek-V3, DeepSeek-Coder, and FlashMLA, which are designed to enhance AI capabilities.
Deepseek-ai primarily utilizes Python, C++, and Cuda for their public repositories. These languages support their development of advanced AI tools and libraries, facilitating innovative solutions in machine learning and computational efficiency.
Yes, all of deepseek-ai's repositories on GitHub are public. This transparency allows the community to access and contribute to their projects, fostering collaboration and innovation in the AI domain.
Monitor DeepSeek with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account