Software company specializing in developer tools and tailored solutions for AI and Natural Language Processing
79
Repositórios públicos
57.166
Total de estrelas
1.488
Seguidores
A organização Explosion possui uma presença significativa no GitHub, com uma ampla gama de repositórios focados em ferramentas para desenvolvedores e soluções personalizadas em Inteligência Artificial e Processamento de Linguagem Natural. As principais linguagens utilizadas incluem Python e Cython, com repositórios notáveis como spaCy e thinc, que são amplamente utilizados na comunidade de desenvolvedores.
💫 Industrial-strength Natural Language Processing (NLP) in Python
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
👩🏫 Advanced NLP with spaCy: A free online course
💫 Models for the spaCy Natural Language Processing (NLP) library
🦆 Contextually-keyed word vectors
🪐 End-to-end NLP workflows from prototype to production
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
🦙 Integrating LLMs into structured NLP pipelines
📚 Process PDFs, Word documents and more with spaCy
🤖 A PyTorch library of curated Transformer models and their composable components
👑 spaCy building blocks and visualizers for Streamlit apps
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
🍣 A lightweight console printing and formatting toolkit
💥 Cython memory pool for RAII-style memory management
:boom: displaCy.js: An open-source NLP visualiser for the modern web
🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
🌓 Bringing pjreddie's DarkNet out of the shadows #yolo
💫 Jupyter notebooks for spaCy examples and tutorials
💫 REST microservices for various spaCy-related tasks
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
:candy: Confection: the sweetest config system for Python
🧬 A JupyterLab extension for annotating data with Prodigy
Super lightweight function registries for your library
💙 Emoji handling and meta data for spaCy with custom extension attributes
🎡 Automated build repo for Python wheels and source packages
💫 Scripts, tools and resources for developing spaCy
📂 Additional lookup tables and data resources for spaCy
🕊️ Radically lightweight command-line interfaces
🧪 Cutting-edge experimental spaCy components and features
🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library
🦦 weasel: A small and easy workflow system
💥 Browser-based slides or PDFs of our talks and presentations
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
💥 Cython hash tables that assume keys are pre-hashed
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
💥 Use Hugging Face text and token classification pipelines directly in spaCy
☄️ Parallel and distributed training with spaCy and Ray
💥 Cython bindings for MurmurHash2
🌊 Machine learning dataset loaders for testing and example scripts
🤗 Push your spaCy pipelines to the Hugging Face Hub
💥 Explosion Assets
Generate a SQLite database from Wikipedia & Wikidata dumps.
A Prodigy plugin for PDF annotation
💫 A spaCy package for Yohei Tamura's Rust tokenizations library
spaCy entry points for Curated Transformers
spaCy extension for Visual Studio Code
🧬 A VS Code extension for annotating data with Prodigy
Train huggingface models on top of Prodigy annotations
💫 Runtime performance comparison of spaCy against other NLP libraries
Wrapper for the macOS signpost API
🌸 Train floret vectors
A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.
🔎 A Prodigy plugin for evaluating spaCy pipelines
Lightweight piece tokenization library
📟 Logging utilities for spaCy
Select pixels in Prodigy via Facebook's Segment-Anything model.
🔮 GPU kernels for Thinc
A Prodigy pluging for ANN techniques
Audio transcription with OpenAI's whisper model in the loop.
🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
Code for our presentation in Princeton DH 2023 April.
A Prodigy plugin for document search via LUNR
Nenhuma descrição fornecida para este repositório.
Nenhuma descrição fornecida para este repositório.
:octocat: GitHub settings
Loaders for various span labeling datasets
Nenhuma descrição fornecida para este repositório.
Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines
📒 Repository used to build Binder images for the interactive spaCy code examples
BLAS-like Library Instantiation Software Framework
Terraform definitions for self-hosted Ellf clusters
Nenhuma descrição fornecida para este repositório.
Add-ons for Curated Transformers
Nginx container that allows for environmental variable use to set nginx configuration.
A Explosion constrói uma variedade de ferramentas e bibliotecas no GitHub, com foco em Inteligência Artificial e Processamento de Linguagem Natural. Projetos notáveis incluem spaCy, thinc e spacy-course, que são utilizados por desenvolvedores em todo o mundo.
A Explosion utiliza várias linguagens de programação em seus repositórios, com ênfase em Python e Cython. Outras linguagens como Jupyter Notebook, C++, C e JavaScript também estão presentes em suas contribuições no GitHub.
Sim, todos os repositórios da Explosion são públicos no GitHub. Isso permite que desenvolvedores e pesquisadores acessem e contribuam para os projetos, promovendo uma comunidade ativa em torno das ferramentas de Processamento de Linguagem Natural.
Monitore Explosion com o RepoGuard e receba alertas no momento em que um novo repositório público aparecer.
Monitore esta conta