Software company specializing in developer tools and tailored solutions for AI and Natural Language Processing
79
Öffentliche Repositories
57.166
Sterne gesamt
1.488
Follower
Die Organisation Explosion hat eine bedeutende Präsenz auf GitHub, mit einer Vielzahl von öffentlichen Repositories, die sich auf KI und Natural Language Processing konzentrieren. Zu den wichtigsten Projekten gehören spaCy, thinc und spacy-course, die in Python und anderen Programmiersprachen entwickelt wurden. Diese Repositories bieten Entwicklern wertvolle Werkzeuge und Ressourcen.
💫 Industrial-strength Natural Language Processing (NLP) in Python
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
👩🏫 Advanced NLP with spaCy: A free online course
💫 Models for the spaCy Natural Language Processing (NLP) library
🦆 Contextually-keyed word vectors
🪐 End-to-end NLP workflows from prototype to production
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
🦙 Integrating LLMs into structured NLP pipelines
📚 Process PDFs, Word documents and more with spaCy
🤖 A PyTorch library of curated Transformer models and their composable components
👑 spaCy building blocks and visualizers for Streamlit apps
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
🍣 A lightweight console printing and formatting toolkit
💥 Cython memory pool for RAII-style memory management
:boom: displaCy.js: An open-source NLP visualiser for the modern web
🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
🌓 Bringing pjreddie's DarkNet out of the shadows #yolo
💫 Jupyter notebooks for spaCy examples and tutorials
💫 REST microservices for various spaCy-related tasks
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
:candy: Confection: the sweetest config system for Python
🧬 A JupyterLab extension for annotating data with Prodigy
Super lightweight function registries for your library
💙 Emoji handling and meta data for spaCy with custom extension attributes
🎡 Automated build repo for Python wheels and source packages
💫 Scripts, tools and resources for developing spaCy
📂 Additional lookup tables and data resources for spaCy
🕊️ Radically lightweight command-line interfaces
🧪 Cutting-edge experimental spaCy components and features
🍏 Make Thinc faster on macOS by calling into Apple's native Accelerate library
🦦 weasel: A small and easy workflow system
💥 Browser-based slides or PDFs of our talks and presentations
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
💥 Cython hash tables that assume keys are pre-hashed
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
💥 Use Hugging Face text and token classification pipelines directly in spaCy
☄️ Parallel and distributed training with spaCy and Ray
💥 Cython bindings for MurmurHash2
🌊 Machine learning dataset loaders for testing and example scripts
🤗 Push your spaCy pipelines to the Hugging Face Hub
💥 Explosion Assets
Generate a SQLite database from Wikipedia & Wikidata dumps.
A Prodigy plugin for PDF annotation
💫 A spaCy package for Yohei Tamura's Rust tokenizations library
spaCy entry points for Curated Transformers
spaCy extension for Visual Studio Code
🧬 A VS Code extension for annotating data with Prodigy
Train huggingface models on top of Prodigy annotations
💫 Runtime performance comparison of spaCy against other NLP libraries
Wrapper for the macOS signpost API
🌸 Train floret vectors
A slightly cleaned up version of the scripts & data for the CoNLL 2012 Coreference task.
🔎 A Prodigy plugin for evaluating spaCy pipelines
Lightweight piece tokenization library
📟 Logging utilities for spaCy
Select pixels in Prodigy via Facebook's Segment-Anything model.
🔮 GPU kernels for Thinc
A Prodigy pluging for ANN techniques
Audio transcription with OpenAI's whisper model in the loop.
🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
Code for our presentation in Princeton DH 2023 April.
A Prodigy plugin for document search via LUNR
Keine Beschreibung für dieses Repository vorhanden.
Keine Beschreibung für dieses Repository vorhanden.
:octocat: GitHub settings
Loaders for various span labeling datasets
Keine Beschreibung für dieses Repository vorhanden.
Materials for the aiGrunn 2023 talk on spaCy Transformer pipelines
📒 Repository used to build Binder images for the interactive spaCy code examples
BLAS-like Library Instantiation Software Framework
Terraform definitions for self-hosted Ellf clusters
Keine Beschreibung für dieses Repository vorhanden.
Add-ons for Curated Transformers
Nginx container that allows for environmental variable use to set nginx configuration.
Explosion entwickelt auf GitHub eine Reihe von Tools und Bibliotheken, die sich auf Natural Language Processing und KI konzentrieren. Wichtige Projekte sind spaCy, thinc und spacy-course, die Entwicklern helfen, fortschrittliche NLP-Anwendungen zu erstellen.
Explosion verwendet hauptsächlich Python für seine Projekte, unterstützt jedoch auch Cython, Jupyter Notebook, C++, C und JavaScript. Diese Sprachen ermöglichen eine effektive Entwicklung von Softwarelösungen für KI und NLP.
Ja, alle Repositories von explosion sind öffentlich zugänglich. Dies ermöglicht es Entwicklern und Forschern, die bereitgestellten Werkzeuge und Ressourcen zu nutzen und zur Weiterentwicklung der Projekte beizutragen.
Überwache Explosion mit RepoGuard und werde benachrichtigt, sobald ein neues öffentliches Repository auftaucht.
Diesen Account überwachen