RepoGuard
Updated 8 h ago
Stanford NLP

Organization

Public GitHub footprint of Stanford NLP

@stanfordnlp
View profile on GitHub
Stanford, CA

54

Public repositories

68,725

Total stars

2,614

Followers

The Stanford NLP organization has a significant presence on GitHub, showcasing a wide range of repositories primarily in Python, Java, and C. Notable projects include DSPy for language model programming, CoreNLP for various NLP tasks, and the Stanza library for language processing, among others.

Top languages

Python 29HTML 6Java 4Jupyter Notebook 2C 1TeX 1Lua 1OpenEdge ABL 1

Public repositories

dspy

35,002

DSPy: The framework for programming—not prompting—language models

Python
Updated Jun 13, 2026

CoreNLP

10,083

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java
Updated Jun 12, 2026

stanza

7,807

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python
Updated Jun 11, 2026

GloVe

7,220

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

C
Updated Jun 11, 2026

cs224n-winter17-notes

1,601

Course notes for CS224N Winter17

TeX
Updated Jun 7, 2026

pyreft

1,570

Stanford NLP Python library for Representation Finetuning (ReFT)

Python
Updated Jun 10, 2026

treelstm

895

Tree-structured Long Short-Term Memory networks (http://arxiv.org/abs/1503.00075)

Lua
Updated May 27, 2026

pyvene

883

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python
Updated Jun 3, 2026

string2string

563

String-to-String Algorithms for Natural Language Processing

Jupyter Notebook
Updated May 17, 2026

python-stanford-corenlp

517

Python interface to CoreNLP using a bidirectional server-client interface.

Python
Updated May 29, 2026

mac-network

513

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

Python
Updated May 23, 2026

phrasal

214

A large-scale statistical machine translation system written in Java.

Java
Updated Mar 30, 2026

spinn

209

SPINN (Stack-augmented Parser-Interpreter Neural Network): fast, batchable, context-aware TreeRNNs

Python
Updated Jun 5, 2026

axbench

198

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

Python
Updated Jun 12, 2026

coqa-baselines

176

The baselines used in the CoQA paper

Python
Updated Dec 28, 2025

cocoa

162

Framework for learning dialogue agents in a two-player game setting.

Python
Updated Jun 8, 2026

stanza-old

137

Stanford NLP group's shared Python tools.

Python
Updated Apr 19, 2026

chirpycardinal

135

Stanford's Alexa Prize socialbot

Python
Updated Mar 2, 2026

stanfordnlp

123

[Deprecated] This library has been renamed to "Stanza". Latest development at: https://github.com/stanfordnlp/stanza

Python
Updated May 26, 2026

wge

118

Workflow-Guided Exploration: sample-efficient RL agent for web tasks

Python
Updated Mar 4, 2026

pdf-struct

94

Logical structure analysis for visually structured documents

Python
Updated Mar 17, 2026

cs224n-web

61

http://cs224n.stanford.edu

HTML
Updated May 11, 2026

thoughtbubbles

43

No description provided for this repository.

Python
Updated May 14, 2026

stanza-train

41

Model training tutorials for the Stanza Python NLP Library

Python
Updated Nov 26, 2025

ColBERT-QA

39

Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)

Unknown Language
Updated May 24, 2026

phrasenode

38

Mapping natural language commands to web elements

Python
Updated Dec 19, 2025

stanza-resources

36

No description provided for this repository.

Unknown Language
Updated Jun 12, 2026

contract-nli-bert

36

A baseline system for ContractNLI (https://stanfordnlp.github.io/contract-nli/)

Python
Updated May 10, 2026

sempre-plot

28

Semantic Parser with Execution

Java
Updated Feb 25, 2026

color-describer

26

Code for Learning to Generate Compositional Color Descriptions

OpenEdge ABL
Updated Aug 12, 2024

miniwob-plusplus-demos

21

Demos for the MiniWoB++ benchmark

Unknown Language
Updated Oct 14, 2025

python-corenlp-protobuf

20

Python bindings for Stanford CoreNLP's protobufs.

Python
Updated Aug 12, 2024

multi-distribution-retrieval

16

Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval

Python
Updated Feb 18, 2026

huggingface-models

15

Scripts for pushing models to huggingface repos

Python
Updated Mar 13, 2026

contract-nli

10

ContractNLI: A Dataset for Document-level Natural Language Inference for Contracts

HTML
Updated Apr 18, 2026

en-worldwide-newswire

10

An English NER dataset built from foreign newswire

Python
Updated Feb 8, 2025

universe

9

Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

Python
Updated Mar 17, 2026

nlp-meetup-demo

9

No description provided for this repository.

Java
Updated Feb 11, 2026

sentiment-treebank

8

Updated version of SST

Python
Updated Aug 12, 2024

handparsed-treebank

7

Extra hand parsed data for training models

Perl
Updated Jun 8, 2026

cs224n_gpt

7

No description provided for this repository.

Python
Updated Apr 30, 2026

plot-data

6

datasets for plotting

Jupyter Notebook
Updated Aug 12, 2024

preft

4

No description provided for this repository.

Python
Updated Jun 6, 2026

coqa

3

CoQA -- A Conversational Question Answering Challenge

Shell
Updated Jan 29, 2026

plot-interface

3

Web interface for the plotting project

JavaScript
Updated Aug 12, 2024

pdf-struct-dataset

2

Dataset for pdf-struct (https://github.com/stanfordnlp/pdf-struct)

HTML
Updated Mar 4, 2026

chirpy-parlai-blenderbot-fork

2

A fork of ParlAI supporting Chirpy Cardinal's custom neural generator

Python
Updated Aug 25, 2024

pdf-struct-models

2

A repository for hosting models for https://github.com/stanfordnlp/pdf-struct

HTML
Updated Oct 30, 2023

stanford-nlp-history

1

A history of NLP at Stanford, initially written for the Stanford NLP 25 year reunion in 2025

HTML
Updated Mar 30, 2026

nn-depparser

1

A re-implementation of nndep using PyTorch.

Python
Updated Oct 17, 2024

chirpycardinal23

1

Stanford's Alexa Prize socialbot [internal]

Unknown Language
Updated Jul 16, 2024

sindhi-tokenization

0

Sindhi tokenization data from ISRA

Unknown Language
Updated Sep 8, 2023

sail-blog-new-post

0

The repository for making new post submissions to the SAIL Blog

HTML
Updated Apr 6, 2021

corenlp-docs-dev

0

No description provided for this repository.

Unknown Language
Updated Jul 16, 2020

Frequently asked questions

What does stanfordnlp build on GitHub?

Stanford NLP builds a variety of tools and libraries focused on natural language processing. Key repositories include DSPy, CoreNLP, and Stanza, which cater to different aspects of NLP tasks.

Which programming languages does stanfordnlp use?

The primary programming languages used by stanfordnlp include Python, Java, C, Jupyter Notebook, HTML, and TeX, reflecting a diverse approach to natural language processing and software development.

Are stanfordnlp's repositories public?

Yes, all of stanfordnlp's repositories are public on GitHub. This transparency allows users and developers to access, use, and contribute to their various NLP tools and libraries.

Is this exposure intended?

Monitor Stanford NLP with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account