RepoGuard
Updated 1 h ago
Explosion

Organization

Public GitHub footprint of Explosion

@explosion
View profile on GitHub

Software company specializing in developer tools and tailored solutions for AI and Natural Language Processing

Berlin, Germany

79

Public repositories

57,166

Total stars

1,488

Followers

Explosion is an organization on GitHub that focuses on developer tools and solutions for AI and Natural Language Processing. Its public repositories feature a wide array of projects, including spaCy, thinc, and spacy-course, primarily developed using Python, Cython, and Jupyter Notebook. This diverse GitHub presence highlights its commitment to open-source contributions in the field of NLP.

Top languages

Python 53Cython 5Jupyter Notebook 4C++ 3C 3JavaScript 2TypeScript 2CSS 1

Public repositories

spaCy

33,658

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python
Updated Jun 13, 2026

thinc

2,890

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Python
Updated Jun 10, 2026

spacy-course

2,422

👩‍🏫 Advanced NLP with spaCy: A free online course

Python
Updated Jun 10, 2026

spacy-models

1,881

💫 Models for the spaCy Natural Language Processing (NLP) library

Python
Updated Jun 13, 2026

sense2vec

1,672

🦆 Contextually-keyed word vectors

Python
Updated Jun 11, 2026

projects

1,431

🪐 End-to-end NLP workflows from prototype to production

Python
Updated Jun 2, 2026

spacy-transformers

1,406

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Python
Updated Jun 11, 2026

spacy-llm

1,392

🦙 Integrating LLMs into structured NLP pipelines

Python
Updated Jun 5, 2026

spacy-layout

903

📚 Process PDFs, Word documents and more with spaCy

Python
Updated Jun 6, 2026

curated-transformers

896

🤖 A PyTorch library of curated Transformer models and their composable components

Python
Updated Jun 5, 2026

spacy-streamlit

857

👑 spaCy building blocks and visualizers for Streamlit apps

Python
Updated May 28, 2026

spacy-stanza

748

💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy

Python
Updated May 11, 2026

prodigy-recipes

507

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

Jupyter Notebook
Updated May 12, 2026

srsly

481

🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)

Python
Updated Mar 23, 2026

wasabi

469

🍣 A lightweight console printing and formatting toolkit

Python
Updated May 8, 2026

cymem

461

💥 Cython memory pool for RAII-style memory management

Cython
Updated May 17, 2026

displacy

345

:boom: displaCy.js: An open-source NLP visualiser for the modern web

JavaScript
Updated Apr 9, 2026

floret

341

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

C++
Updated May 12, 2026

prodigy-openai-recipes

322

✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3

Python
Updated Mar 5, 2026

lightnet

320

🌓 Bringing pjreddie's DarkNet out of the shadows #yolo

C
Updated Aug 27, 2025

spacy-notebooks

288

💫 Jupyter notebooks for spaCy examples and tutorials

Jupyter Notebook
Updated May 19, 2026

spacy-services

239

💫 REST microservices for various spaCy-related tasks

Python
Updated Apr 17, 2026

cython-blis

237

💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!

C
Updated May 14, 2026

displacy-ent

200

:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web

CSS
Updated Jan 25, 2026

tokenizations

195

Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/

Rust
Updated May 18, 2026

confection

193

:candy: Confection: the sweetest config system for Python

Python
Updated Apr 27, 2026

jupyterlab-prodigy

189

🧬 A JupyterLab extension for annotating data with Prodigy

TypeScript
Updated May 9, 2026

catalogue

183

Super lightweight function registries for your library

Python
Updated Jun 13, 2026

spacymoji

182

💙 Emoji handling and meta data for spaCy with custom extension attributes

Python
Updated Mar 8, 2026

wheelwright

175

🎡 Automated build repo for Python wheels and source packages

Python
Updated Jan 9, 2026

spacy-dev-resources

123

💫 Scripts, tools and resources for developing spaCy

Python
Updated Jun 8, 2026

spacy-lookups-data

115

📂 Additional lookup tables and data resources for spaCy

Python
Updated Mar 29, 2026

radicli

110

🕊️ Radically lightweight command-line interfaces

Python
Updated May 30, 2026

spacy-experimental

105

🧪 Cutting-edge experimental spaCy components and features

Python
Updated Dec 15, 2025

thinc-apple-ops

103

🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library

Cython
Updated Mar 28, 2026

weasel

94

🦦 weasel: A small and easy workflow system

Python
Updated Apr 27, 2026

talks

94

💥 Browser-based slides or PDFs of our talks and presentations

JavaScript
Updated Aug 20, 2024

healthsea

91

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

Python
Updated Apr 12, 2026

preshed

88

💥 Cython hash tables that assume keys are pre-hashed

Cython
Updated Apr 7, 2026

spacy-pkuseg

70

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python
Updated Apr 13, 2026

spacy-huggingface-pipelines

65

💥 Use Hugging Face text and token classification pipelines directly in spaCy

Python
Updated May 26, 2026

spacy-ray

56

☄️ Parallel and distributed training with spaCy and Ray

Python
Updated Jul 24, 2025

murmurhash

47

💥 Cython bindings for MurmurHash2

C++
Updated Apr 19, 2026

ml-datasets

47

🌊 Machine learning dataset loaders for testing and example scripts

Python
Updated Mar 26, 2026

spacy-huggingface-hub

45

🤗 Push your spaCy pipelines to the Hugging Face Hub

Python
Updated Jun 3, 2026

assets

45

💥 Explosion Assets

Unknown Language
Updated Oct 30, 2025

wikid

39

Generate a SQLite database from Wikipedia & Wikidata dumps.

Python
Updated May 13, 2026

prodigy-pdf

37

A Prodigy plugin for PDF annotation

Python
Updated Feb 9, 2026

spacy-alignments

35

💫 A spaCy package for Yohei Tamura's Rust tokenizations library

Python
Updated Mar 27, 2026

spacy-curated-transformers

32

spaCy entry points for Curated Transformers

Python
Updated Mar 27, 2026

spacy-vscode

32

spaCy extension for Visual Studio Code

Python
Updated Feb 25, 2026

vscode-prodigy

30

🧬 A VS Code extension for annotating data with Prodigy

TypeScript
Updated Jun 11, 2024

prodigy-hf

21

Train huggingface models on top of Prodigy annotations

Python
Updated Nov 19, 2024

spacy-benchmarks

20

💫 Runtime performance comparison of spaCy against other NLP libraries

Python
Updated Jan 27, 2023

os-signpost

18

Wrapper for the macOS signpost API

Cython
Updated Jun 5, 2026

spacy-vectors-builder

18

🌸 Train floret vectors

Python
Updated Sep 16, 2024

conll-2012

13

A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.

Python
Updated May 30, 2026

prodigy-evaluate

13

🔎 A Prodigy plugin for evaluating spaCy pipelines

Python
Updated Nov 23, 2024

curated-tokenizers

12

Lightweight piece tokenization library

Cython
Updated Oct 28, 2024

spacy-loggers

12

📟 Logging utilities for spaCy

Python
Updated May 6, 2024

prodigy-segment

10

Select pixels in Prodigy via Facebook's Segment-Anything model.

Python
Updated Nov 19, 2025

thinc_gpu_ops

9

🔮 GPU kernels for Thinc

C++
Updated Jan 28, 2023

prodigy-ann

5

A Prodigy pluging for ANN techniques

Python
Updated Nov 23, 2025

prodigy-whisper

5

Audio transcription with OpenAI's whisper model in the loop.

Python
Updated Dec 4, 2024

spacy-legacy

4

🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility

Python
Updated Jan 4, 2024

princetondh

4

Code for our presentation in Princeton DH 2023 April.

Jupyter Notebook
Updated Dec 19, 2023

prodigy-lunr

3

A Prodigy plugin for document search via LUNR

Python
Updated Apr 4, 2025

ec2buildwheel

3

No description provided for this repository.

Python
Updated Apr 4, 2025

fastapi-explosion-extras

2

No description provided for this repository.

Python
Updated May 4, 2026

.github

2

:octocat: GitHub settings

Unknown Language
Updated Oct 26, 2025

span-labeling-datasets

2

Loaders for various span labeling datasets

Python
Updated Dec 31, 2024

spacy-biaffine-parser

1

No description provided for this repository.

Python
Updated May 8, 2024

aiGrunn-2023

1

Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines

Python
Updated Nov 10, 2023

spacy-io-binder

1

📒 Repository used to build Binder images for the interactive spaCy code examples

Jupyter Notebook
Updated Jan 23, 2023

blis

1

BLAS-like Library Instantiation Software Framework

C
Updated Sep 16, 2022

ellf-terraform-cluster

0

Terraform definitions for self-hosted Ellf clusters

HCL
Updated Jun 9, 2026

gha-cibuildwheel

0

No description provided for this repository.

Unknown Language
Updated Mar 24, 2026

curated-transformers-addons

0

Add-ons for Curated Transformers

Python
Updated Oct 4, 2023

nginx_acm_ssl_proxy

0

Nginx container that allows for environmental variable use to set nginx configuration.

Shell
Updated Aug 19, 2022

Frequently asked questions

What does explosion build on GitHub?

Explosion builds various tools and libraries focused on AI and Natural Language Processing on GitHub. Notable projects include spaCy for NLP and thinc for deep learning, showcasing their expertise in these domains.

Which programming languages does explosion use?

Explosion primarily uses Python, along with Cython, Jupyter Notebook, C++, C, and JavaScript for its repositories. This selection of languages supports their focus on developing advanced AI and NLP solutions.

Are explosion's repositories public?

Yes, explosion's repositories are public on GitHub. This allows users to access and contribute to their projects, fostering collaboration in the development of tools for Natural Language Processing and AI.

Is this exposure intended?

Monitor Explosion with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account