RepoGuard
Updated 8 h ago
DataTalksClub

Organization

Public GitHub footprint of DataTalksClub

@DataTalksClub
View profile on GitHub

The place to talk about data

World wide

34

Public repositories

80,681

Total stars

8,027

Followers

DataTalksClub has a substantial public GitHub presence, focusing on data-centric education and projects. Their repositories include a variety of programming languages such as Jupyter Notebook, Python, and JavaScript. Notable projects like data-engineering-zoomcamp and mlops-zoomcamp provide free, structured courses for data enthusiasts worldwide.

Top languages

Jupyter Notebook 12Python 5HTML 2JavaScript 1SCSS 1CSS 1TypeScript 1

Public repositories

data-engineering-zoomcamp

42,380

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook
Updated Jun 13, 2026

mlops-zoomcamp

14,785

Free MLOps course from DataTalks.Club

Jupyter Notebook
Updated Jun 13, 2026

machine-learning-zoomcamp

13,242

Learn ML engineering for free in 4 months! Register here 👇🏼

Jupyter Notebook
Updated Jun 12, 2026

llm-zoomcamp

6,282

LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.

Jupyter Notebook
Updated Jun 13, 2026

ai-dev-tools-zoomcamp

1,124

AI Dev Tools Zoomcamp is a free course that helps you use AI tools to write better code, faster. We're starting the first cohort of this course on November 18, 2025! Sign up here to join us 👇🏼

JavaScript
Updated Jun 12, 2026

stock-markets-analytics-zoomcamp

879

Course Materials for Analytics in Stock Markets Zoomcamp

Jupyter Notebook
Updated Jun 12, 2026

project-of-the-week

417

Learn by doing: DIY project groups at DataTalks.Club

Unknown Language
Updated Jun 8, 2026

awesome-data-podcasts

388

A list of awesome data podcasts

Unknown Language
Updated May 6, 2026

datatalksclub.github.io

313

The web page for DataTalks.Club, a global online community of data enthusiasts

Python
Updated Jun 11, 2026

nyc-tlc-data

208

Backup for NYC TLC data for the DE Zoomcamp course

Unknown Language
Updated Jun 10, 2026

open-source-llm-zoomcamp

192

A free mini-course about Open-Source LLMs

Unknown Language
Updated Jun 9, 2026

data-paths

149

Learning paths for data roles

Unknown Language
Updated Apr 21, 2026

course-management-platform

77

Django-based course management platform for Zoomcamps

Python
Updated Jun 12, 2026

data-analytics-interviews

68

Data analytics interview questions and answers

Unknown Language
Updated Mar 19, 2026

zoomcamp-analytics

34

Public data and analytics for our open course

Jupyter Notebook
Updated Mar 7, 2026

kaggle-qa-challenge-starter

29

The getting started notebook for the DTC Zoomcamp Q&A challenge

Jupyter Notebook
Updated Nov 11, 2024

docs

18

No description provided for this repository.

SCSS
Updated Jun 10, 2026

reading-club-nlp

18

Notes from our NLP reading club!

Unknown Language
Updated Dec 28, 2025

kitchenware-competition-starter

15

A starter notebook for the Kitchenware classification competition on Kaggle

Jupyter Notebook
Updated Nov 11, 2024

whylogs-workshop

14

The code from the whylogs workshop in DataTalks.Club on 29 March 2022

Jupyter Notebook
Updated Jun 8, 2026

llm-zoomcamp-saturncloud

12

Saturn Cloud starter code for LLM Zoomcamp

Jupyter Notebook
Updated Apr 23, 2026

reading-club-books

12

No description provided for this repository.

Unknown Language
Updated May 27, 2024

podcast-summary-generation

9

This project aims to generate structured PDF reports from podcast interviews, highlighting key takeaways, quotes, and insights. The goal is to create shareable and accessible summaries for a broader audience.

Python
Updated Apr 28, 2026

website-django

5

The DTC website in Django

Jupyter Notebook
Updated Mar 8, 2025

faq

4

FAQ for Zoomcamp courses

Python
Updated Jun 10, 2026

carousel-automation

3

A project to automate carousel creation

CSS
Updated Mar 31, 2026

datamailer

2

Django mailing service for audiences, campaigns, transactional email, and engagement tracking

Python
Updated May 28, 2026

.github

1

No description provided for this repository.

Unknown Language
Updated Jun 10, 2026

courses

1

No description provided for this repository.

HTML
Updated Jan 9, 2026

mediakit

0

DataTalks.Club sponsorship media kit (Jekyll + rustkyll, GitHub Pages)

HTML
Updated Jun 10, 2026

zoomcamp-template

0

Shared template and reference for DataTalks.Club zoomcamps: structure spec, README templates, and collected helper scripts

Jupyter Notebook
Updated Jun 10, 2026

exasol-workshop

0

Workshop starter with auto-refreshing AWS credentials for GitHub Codespaces

Unknown Language
Updated Mar 10, 2026

exasol-workshop-starter

0

Workshop starter with auto-refreshing AWS credentials for GitHub Codespaces

Unknown Language
Updated Mar 10, 2026

surveys

0

Hosting our survey results

TypeScript
Updated Jan 29, 2026

Frequently asked questions

What does DataTalksClub build on GitHub?

DataTalksClub develops educational resources and courses focused on data engineering, machine learning, and MLOps. Their repositories include interactive materials for users to learn and apply data skills effectively.

Which programming languages does DataTalksClub use?

DataTalksClub primarily utilizes Jupyter Notebook, Python, JavaScript, HTML, SCSS, and CSS in their repositories. This diverse mix supports various data-related projects and educational content.

Are DataTalksClub's repositories public?

Yes, all of DataTalksClub's repositories on GitHub are public. This transparency allows anyone interested in data to access valuable resources and participate in learning opportunities.

Is this exposure intended?

Monitor DataTalksClub with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account