RepoGuard
Updated 8 h ago
Distributed (Deep) Machine Learning Community

Organization

Public GitHub footprint of Distributed (Deep) Machine Learning Community

@dmlc
View profile on GitHub

A Community of Awesome Machine Learning Projects

51

Public repositories

69,603

Total stars

1,738

Followers

The dmlc organization on GitHub focuses on a wide range of machine learning projects, showcasing expertise in languages such as C++, Python, and Jupyter Notebook. Notable repositories include xgboost, a scalable library for gradient boosting, and dgl, which simplifies deep learning on graphs.

Top languages

C++ 21Python 11Jupyter Notebook 3Cuda 2JavaScript 1Julia 1TypeScript 1HTML 1

Public repositories

xgboost

28,468

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++
Updated Jun 12, 2026

dgl

14,276

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python
Updated Jun 11, 2026

gluon-cv

5,922

Gluon CV Toolkit

Python
Updated Jun 2, 2026

gluon-nlp

2,546

NLP made easy

Python
Updated Jun 11, 2026

decord

2,492

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++
Updated Jun 12, 2026

nnvm

1,649

No description provided for this repository.

C++
Updated May 24, 2026

ps-lite

1,562

A lightweight parameter server interface

C++
Updated Jun 12, 2026

dlpack

1,222

common in-memory tensor structure

C++
Updated Jun 10, 2026

mshadow

1,117

Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning

C++
Updated May 26, 2026

minpy

1,096

NumPy interface with mixed backend execution

Python
Updated Apr 24, 2026

cxxnet

1,027

move forward to https://github.com/dmlc/mxnet

C++
Updated Apr 18, 2026

dmlc-core

877

A common bricks library for building scalable and portable distributed machine learning.

C++
Updated May 21, 2026

treelite

822

Universal model exchange and serialization format for decision tree forests

C++
Updated Jun 9, 2026

minerva

712

Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bindings and C++ bindings are both available. The resulting code can be run on CPU or GPU. Multi-GPU support is very easy.

C++
Updated Jun 8, 2026

parameter_server

649

moved to https://github.com/dmlc/ps-lite

C++
Updated Jun 12, 2026

mxnet-notebooks

608

Notebooks for MXNet

Jupyter Notebook
Updated Feb 13, 2026

rabit

512

Reliable Allreduce and Broadcast Interface for distributed machine learning

C++
Updated Jun 6, 2026

mxnet.js

433

MXNetJS: Javascript Package for Deep Learning in Browser (without server)

JavaScript
Updated Sep 8, 2025

tensorboard

370

Standalone TensorBoard for visualizing in deep learning

Python
Updated Apr 24, 2026

MXNet.jl

370

MXNet Julia Package - flexible and efficient deep learning in Julia

Unknown Language
Updated Jan 18, 2026

wormhole

336

Deprecated

C++
Updated Jun 6, 2026

mxnet-memonger

306

Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets

Python
Updated Apr 24, 2026

XGBoost.jl

304

XGBoost Julia Package

Julia
Updated May 15, 2026

difacto

298

Distributed Factorization Machines

C++
Updated Jun 6, 2026

mxnet-model-gallery

266

Pre-trained Models of DMLC Project

Unknown Language
Updated Feb 5, 2024

GNNLens2

261

Visualization tool for Graph Neural Networks

TypeScript
Updated Apr 10, 2026

HalideIR

206

Symbolic Expression and Statement Module for new DSLs

C++
Updated Mar 29, 2026

mxnet-gtc-tutorial

132

MXNet Tutorial for NVidia GTC 2016.

Jupyter Notebook
Updated Jun 11, 2026

experimental-lda

127

No description provided for this repository.

C++
Updated Jan 4, 2024

keras

124

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on MXNet, Theano or TensorFlow.

Python
Updated Apr 15, 2026

MXNet.cpp

115

C++ interface for mxnet

C++
Updated Dec 26, 2024

experimental-mf

88

cache-friendly multithread matrix factorization

C++
Updated Jun 6, 2026

web-data

84

The repo to host all the web data including images for documents in dmlc projects.

Jupyter Notebook
Updated Oct 13, 2024

nnvm-fusion

72

Kernel Fusion and Runtime Compilation Based on NNVM

C++
Updated Apr 24, 2026

tl2cgen

48

TL2cgen (TreeLite 2 C GENerator) is a model compiler for decision tree models

C++
Updated Jun 11, 2026

dmlc.github.io

28

No description provided for this repository.

HTML
Updated Apr 4, 2025

cub

24

No description provided for this repository.

Cuda
Updated May 11, 2026

xgboost-devops

9

Host custom actions; keep track of manual approval requests for CI jobs.

Shell
Updated Jun 12, 2026

caffe

9

Caffe: a fast open framework for deep learning.

C++
Updated Oct 15, 2022

mxnet-deepmark

7

Benchmark speed and other issues internally, before push to deep-mark

Python
Updated Dec 22, 2021

mxnet-examples

5

MXNet Example

Unknown Language
Updated Apr 24, 2026

numpy-ml

5

Machine learning, in numpy

Unknown Language
Updated Mar 2, 2024

DeepSpeed

4

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python
Updated Feb 3, 2025

drat

4

Drat Repository for DMLC R packages

Unknown Language
Updated Aug 25, 2023

xgboost-bench

4

No description provided for this repository.

Python
Updated Aug 4, 2023

ccache

2

ccache – a fast compiler cache

Unknown Language
Updated Sep 4, 2023

dmlc.r-universe.dev

1

No description provided for this repository.

Unknown Language
Updated Feb 10, 2026

nccl

1

Optimized primitives for collective multi-GPU communication

Cuda
Updated Feb 21, 2025

gluon-nlp-notebooks

1

No description provided for this repository.

Unknown Language
Updated May 9, 2024

docs-redirect-for-mxnet

1

redirect mxnet.readthedocs.io to mxnet.io

Python
Updated Dec 9, 2021

nn-examples

1

No description provided for this repository.

Unknown Language
Updated Dec 9, 2021

Frequently asked questions

What does dmlc build on GitHub?

dmlc builds various machine learning tools and libraries on GitHub, including xgboost for gradient boosting and dgl for deep learning on graphs. Their repository offerings cater to diverse needs in the machine learning community.

Which programming languages does dmlc use?

The primary programming languages used by dmlc include C++, Python, Jupyter Notebook, Cuda, JavaScript, and Julia. This diverse language usage supports a range of applications in machine learning and deep learning.

Are dmlc's repositories public?

Yes, all of dmlc's repositories are public on GitHub. This openness allows developers and researchers to access, contribute to, and utilize their machine learning projects freely.

Is this exposure intended?

Monitor Distributed (Deep) Machine Learning Community with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account