Stanford NLP는 GitHub에서 다양한 공개 저장소를 운영하고 있으며, 주로 Python, Java, C, TeX와 같은 프로그래밍 언어를 사용합니다. 주요 저장소로는 DSPy, CoreNLP, stanza 등이 있으며, 이는 자연어 처리(NLP)와 관련된 도구들을 제공합니다.
DSPy: The framework for programming—not prompting—language models
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
Course notes for CS224N Winter17
Stanford NLP Python library for Representation Finetuning (ReFT)
Tree-structured Long Short-Term Memory networks (http://arxiv.org/abs/1503.00075)
Stanford NLP Python library for understanding and improving PyTorch models via interventions
String-to-String Algorithms for Natural Language Processing
Python interface to CoreNLP using a bidirectional server-client interface.
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)
A large-scale statistical machine translation system written in Java.
SPINN (Stack-augmented Parser-Interpreter Neural Network): fast, batchable, context-aware TreeRNNs
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
The baselines used in the CoQA paper
Framework for learning dialogue agents in a two-player game setting.
Stanford NLP group's shared Python tools.
Stanford's Alexa Prize socialbot
[Deprecated] This library has been renamed to "Stanza". Latest development at: https://github.com/stanfordnlp/stanza
Workflow-Guided Exploration: sample-efficient RL agent for web tasks
Logical structure analysis for visually structured documents
http://cs224n.stanford.edu
이 저장소에 대한 설명이 제공되지 않았습니다.
Model training tutorials for the Stanza Python NLP Library
Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)
Mapping natural language commands to web elements
이 저장소에 대한 설명이 제공되지 않았습니다.
A baseline system for ContractNLI (https://stanfordnlp.github.io/contract-nli/)
Semantic Parser with Execution
Code for Learning to Generate Compositional Color Descriptions
Demos for the MiniWoB++ benchmark
Python bindings for Stanford CoreNLP's protobufs.
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
Scripts for pushing models to huggingface repos
ContractNLI: A Dataset for Document-level Natural Language Inference for Contracts
An English NER dataset built from foreign newswire
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
이 저장소에 대한 설명이 제공되지 않았습니다.
Updated version of SST
Extra hand parsed data for training models
이 저장소에 대한 설명이 제공되지 않았습니다.
datasets for plotting
이 저장소에 대한 설명이 제공되지 않았습니다.
CoQA -- A Conversational Question Answering Challenge
Web interface for the plotting project
Dataset for pdf-struct (https://github.com/stanfordnlp/pdf-struct)
A fork of ParlAI supporting Chirpy Cardinal's custom neural generator
A repository for hosting models for https://github.com/stanfordnlp/pdf-struct
A history of NLP at Stanford, initially written for the Stanford NLP 25 year reunion in 2025
A re-implementation of nndep using PyTorch.
Stanford's Alexa Prize socialbot [internal]
Sindhi tokenization data from ISRA
The repository for making new post submissions to the SAIL Blog
이 저장소에 대한 설명이 제공되지 않았습니다.
stanfordnlp는 자연어 처리(NLP) 관련 다양한 도구와 라이브러리를 개발합니다. 주요 프로젝트로는 CoreNLP와 stanza가 있으며, 이는 여러 언어의 토큰화 및 구문 분석 기능을 제공합니다.
stanfordnlp는 주로 Python, Java, C, TeX 및 Jupyter Notebook과 같은 프로그래밍 언어를 사용합니다. 이러한 언어들은 자연어 처리 도구 개발에 널리 활용됩니다.
예, stanfordnlp는 GitHub에서 모든 저장소를 공개하고 있습니다. 이를 통해 연구자와 개발자들은 다양한 NLP 도구와 라이브러리를 자유롭게 이용할 수 있습니다.