DeepSeek是一个活跃的组织,其在GitHub上的公共存在展示了一系列多样化的项目。主要使用Python、C++、Cuda和Makefile等编程语言,DeepSeek的知名项目包括DeepSeek-V3、DeepSeek-Coder和DeepSeek-OCR等,致力于开发和优化深度学习工具。
此仓库未提供描述。
此仓库未提供描述。
Integrate the DeepSeek API into popular software
DeepSeek Coder: Let the Code Write Itself
Contexts Optical Compression
Janus-Series: Unified Multimodal Understanding and Generation Models
FlashMLA: Efficient Multi-head Latent Attention Kernels
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
DeepEP: an efficient expert-parallel communication library
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepSeek LLM: Let there be answers
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
A lightweight data processing framework built on DuckDB and 3FS.
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
DeepSeek-VL: Towards Real-World Vision-Language Understanding
此仓库未提供描述。
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
Visual Causal Flow
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
此仓库未提供描述。
此仓库未提供描述。
A kernel library written in tilelang
Expert Parallelism Load Balancer
此仓库未提供描述。
Analyze computation-communication overlap in V3/R1.
A curated list of open-source projects related to DeepSeek Coder
Expert Specialized Fine-Tuning
此仓库未提供描述。
An early research stage expert-parallel load balancer for MoE models based on linear programming.
deepseek-ai在GitHub上构建多个与深度学习相关的项目。其主要项目包括DeepSeek-V3和DeepSeek-Coder,专注于AI开发和自动化编程。
deepseek-ai主要使用Python、C++、Cuda和Makefile等编程语言。这些语言支持其开发高效的深度学习工具和库。
是的,deepseek-ai的所有仓库都是公开的。这使得其他开发者可以访问和贡献于其项目,促进了技术的共享与发展。