The place to talk about data
34
Public repositories
80,681
Total stars
8,027
Followers
DataTalksClub has a substantial public GitHub presence, focusing on data-centric education and projects. Their repositories include a variety of programming languages such as Jupyter Notebook, Python, and JavaScript. Notable projects like data-engineering-zoomcamp and mlops-zoomcamp provide free, structured courses for data enthusiasts worldwide.
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
Free MLOps course from DataTalks.Club
Learn ML engineering for free in 4 months! Register here 👇🏼
LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.
AI Dev Tools Zoomcamp is a free course that helps you use AI tools to write better code, faster. We're starting the first cohort of this course on November 18, 2025! Sign up here to join us 👇🏼
Course Materials for Analytics in Stock Markets Zoomcamp
Learn by doing: DIY project groups at DataTalks.Club
A list of awesome data podcasts
The web page for DataTalks.Club, a global online community of data enthusiasts
Backup for NYC TLC data for the DE Zoomcamp course
A free mini-course about Open-Source LLMs
Learning paths for data roles
Django-based course management platform for Zoomcamps
Data analytics interview questions and answers
Public data and analytics for our open course
The getting started notebook for the DTC Zoomcamp Q&A challenge
No description provided for this repository.
Notes from our NLP reading club!
A starter notebook for the Kitchenware classification competition on Kaggle
The code from the whylogs workshop in DataTalks.Club on 29 March 2022
Saturn Cloud starter code for LLM Zoomcamp
No description provided for this repository.
This project aims to generate structured PDF reports from podcast interviews, highlighting key takeaways, quotes, and insights. The goal is to create shareable and accessible summaries for a broader audience.
The DTC website in Django
FAQ for Zoomcamp courses
A project to automate carousel creation
Django mailing service for audiences, campaigns, transactional email, and engagement tracking
No description provided for this repository.
No description provided for this repository.
DataTalks.Club sponsorship media kit (Jekyll + rustkyll, GitHub Pages)
Shared template and reference for DataTalks.Club zoomcamps: structure spec, README templates, and collected helper scripts
Workshop starter with auto-refreshing AWS credentials for GitHub Codespaces
Workshop starter with auto-refreshing AWS credentials for GitHub Codespaces
Hosting our survey results
DataTalksClub develops educational resources and courses focused on data engineering, machine learning, and MLOps. Their repositories include interactive materials for users to learn and apply data skills effectively.
DataTalksClub primarily utilizes Jupyter Notebook, Python, JavaScript, HTML, SCSS, and CSS in their repositories. This diverse mix supports various data-related projects and educational content.
Yes, all of DataTalksClub's repositories on GitHub are public. This transparency allows anyone interested in data to access valuable resources and participate in learning opportunities.
Monitor DataTalksClub with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account