10 h ago에 업데이트됨

Organization

Ai2의 공개 GitHub 발자국

@allenai

GitHub에서 프로필 보기

Seattle, WA

584

공개 저장소

77,201

총 별점

4,769

팔로워

allenai는 Seattle, WA에 위치한 조직으로, Python, Jupyter Notebook, Scala, Rust, C#, Lua와 같은 다양한 프로그래밍 언어를 사용하여 폭넓은 공개 GitHub 저장소를 운영하고 있습니다. 주요 프로젝트로는 olmocr, allennlp, OLMo 등이 있으며, NLP와 AI 연구에 중점을 두고 있습니다.

주요 언어

Python 83Jupyter Notebook 3Scala 2Rust 2C# 1Lua 1HTML 1Java 1

공개 저장소

olmocr

★17,387

Toolkit for linearizing PDFs for LLM datasets/training

Python

업데이트됨 2026년 6월 13일

allennlp

★11,892

An open-source NLP research library, built on PyTorch.

Python

업데이트됨 2026년 6월 13일

OLMo

★6,554

Modeling, training, eval, and inference code for OLMo

Python

업데이트됨 2026년 6월 12일

open-instruct

★3,752

AllenAI's post-training codebase

Python

업데이트됨 2026년 6월 12일

RL4LMs

★2,388

A modular RL library to fine-tune language models to human preferences

Python

업데이트됨 2026년 6월 6일

longformer

★2,196

Longformer: The Long-Document Transformer

Python

업데이트됨 2026년 6월 5일

scispacy

★1,964

A full spaCy pipeline and models for scientific/biomedical documents.

Python

업데이트됨 2026년 6월 12일

ai2thor

★1,739

An open-source platform for Visual AI.

업데이트됨 2026년 6월 11일

scibert

★1,703

A BERT model for scientific text.

Python

업데이트됨 2026년 6월 10일

dolma

★1,508

Data and tools for generating and inspecting OLMo pre-training data.

Python

업데이트됨 2026년 6월 8일

objaverse-xl

★1,297

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Python

업데이트됨 2026년 6월 13일

OLMo-core

★1,289

PyTorch building blocks for the OLMo ecosystem

Python

업데이트됨 2026년 6월 13일

s2orc

★1,064

S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/

Python

업데이트됨 2026년 6월 10일

natural-instructions

★1,047

Expanding natural instructions

Python

업데이트됨 2026년 6월 10일

OLMoE

★1,026

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook

업데이트됨 2026년 6월 9일

molmo

★914

Code for the Molmo Vision-Language Model

Python

업데이트됨 2026년 6월 11일

XNOR-Net

★870

ImageNet classification using binary Convolutional Neural Networks

Lua

업데이트됨 2026년 6월 9일

papermage

★797

library supporting NLP and CV research on scientific papers

Python

업데이트됨 2026년 6월 8일

visprog

★773

Official code for VisProg (CVPR 2023 Best Paper!)

Python

업데이트됨 2026년 6월 8일

scitldr

★759

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 9일

pdffigures2

★748

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Scala

업데이트됨 2026년 6월 7일

reward-bench

★721

RewardBench: the first evaluation tool for reward models.

Python

업데이트됨 2026년 6월 12일

molmo2

★643

Code for the Molmo2 Vision-Language Model

Python

업데이트됨 2026년 6월 12일

molmoact2

★605

Official Repository for MolmoAct2

Python

업데이트됨 2026년 6월 13일

specter

★583

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Python

업데이트됨 2026년 6월 13일

WildDet3D

★576

Allen Institute for AI: WildDet3D: Scaling Promptable 3D Detection in the Wild

Python

업데이트됨 2026년 6월 12일

molmoweb

★567

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 11일

allennlp-models

★563

Officially supported AllenNLP models

Python

업데이트됨 2026년 6월 9일

Holodeck

★553

CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.

Python

업데이트됨 2026년 6월 6일

dont-stop-pretraining

★543

Code associated with the Don't Stop Pretraining ACL 2020 paper

Python

업데이트됨 2026년 6월 5일

OLMoASR

★491

An open-source implementation of Whisper

Python

업데이트됨 2026년 6월 3일

s2orc-doc2json

★469

Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)

Python

업데이트됨 2026년 6월 6일

procthor

★441

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Python

업데이트됨 2026년 6월 12일

deep_qa

★403

A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)

Python

업데이트됨 2026년 6월 6일

allenact

★382

An open source framework for research in Embodied-AI from AI2.

Python

업데이트됨 2026년 6월 9일

olmes

★379

Reproducible, flexible LLM evaluations

Python

업데이트됨 2026년 6월 10일

molmoact

★369

Official Repository for MolmoAct

Python

업데이트됨 2026년 6월 12일

vla-evaluation-harness

★368

One framework to evaluate any VLA model on any robot simulation benchmark.

Python

업데이트됨 2026년 6월 12일

ScienceWorld

★363

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Scala

업데이트됨 2026년 6월 10일

molmospaces

★358

An end-to-end open ecosystem for robot learning

Python

업데이트됨 2026년 6월 12일

satlas-super-resolution

★341

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 10일

ai2-scholarqa-lib

★281

Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

Python

업데이트됨 2026년 6월 7일

satlas

★280

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 11일

s2-folks

★275

Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.

알 수 없는 언어

업데이트됨 2026년 6월 10일

scifact

★263

Data and models for the SciFact verification task.

Python

업데이트됨 2026년 6월 10일

WildBench

★254

Benchmarking LLMs with Challenging Tasks from Real Users

Python

업데이트됨 2026년 6월 8일

olmoearth_pretrain

★246

Earth system foundation model data, training, and eval

Python

업데이트됨 2026년 6월 12일

asta-paper-finder

★244

frozen-in-time version of our Paper Finder agent for reproducing evaluation results

Python

업데이트됨 2026년 6월 12일

real-toxicity-prompts

★233

이 저장소에 대한 설명이 제공되지 않았습니다.

Jupyter Notebook

업데이트됨 2026년 6월 11일

discoveryworld

★215

A virtual environment for developing and evaluating automated scientific discovery agents.

Python

업데이트됨 2026년 6월 10일

hidden-networks

★198

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 8일

autodiscovery-neurips

★182

Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"

Python

업데이트됨 2026년 6월 4일

medicat

★176

Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references

Python

업데이트됨 2026년 6월 12일

pixmo-docs

★163

ACL 2025: Synthetic data generation pipelines for text-rich images.

Python

업데이트됨 2026년 6월 5일

discoverybench

★147

Discovering Data-driven Hypotheses in the Wild

Python

업데이트됨 2026년 6월 12일

SERA

★146

Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.

Python

업데이트됨 2026년 6월 13일

satlaspretrain_models

★144

이 저장소에 대한 설명이 제공되지 않았습니다.

Jupyter Notebook

업데이트됨 2026년 6월 9일

IFBench

★142

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 11일

agent-baselines

★142

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 8일

SPECTER2

★136

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 5일

bolmo-core

★134

Code for Bolmo: Byteifying the Next Generation of Language Models

Python

업데이트됨 2026년 6월 10일

wildguard

★125

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python

업데이트됨 2026년 6월 12일

aokvqa

★116

Official repository for the A-OKVQA dataset

Python

업데이트됨 2026년 6월 5일

asta-bench

★109

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 13일

S2AND

★109

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

Python

업데이트됨 2026년 6월 4일

infinigram-api

★101

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 12일

DecomP

★99

Repository for Decomposed Prompting

Python

업데이트됨 2026년 6월 9일

robothor-challenge

★99

RoboTHOR Challenge

Python

업데이트됨 2026년 6월 4일

MolmoBot

★90

Code and website for "MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation".

Python

업데이트됨 2026년 6월 10일

rslearn

★89

A tool for developing remote sensing datasets and models.

Python

업데이트됨 2026년 6월 11일

duplodocus

★85

Tooling for exact and MinHash deduplication of large-scale text datasets

Rust

업데이트됨 2026년 6월 5일

olmoearth_projects

★74

OlmoEarth projects

Python

업데이트됨 2026년 6월 12일

codenav

★69

CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.

Python

업데이트됨 2026년 6월 6일

atlantes

★66

Efficient and low latency real-time global-scale GPS trajectory modeling

Python

업데이트됨 2026년 6월 10일

phone2proc

★63

📱👉🏠 Perform conditional procedural generation to generate houses like your own!

Python

업데이트됨 2026년 6월 10일

paper-embedding-public-apis

★60

Collection of public APIs for embedding scientific papers

알 수 없는 언어

업데이트됨 2026년 6월 7일

ruletaker

★55

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 7일

EMO

★42

이 저장소에 대한 설명이 제공되지 않았습니다.

HTML

업데이트됨 2026년 6월 10일

fermi

★37

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 3일

artifact-linker

★36

ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery

Python

업데이트됨 2026년 6월 10일

c4-documentation

★33

이 저장소에 대한 설명이 제공되지 않았습니다.

알 수 없는 언어

업데이트됨 2026년 6월 6일

signal-and-noise

★30

Measuring the Signal to Noise Ratio in Language Model Evaluation

Python

업데이트됨 2026년 6월 12일

recoma

★30

Reasoning by Communicating with Agents

Python

업데이트됨 2026년 6월 5일

persona-bias

★29

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 9일

natural-instructions-v1

★28

Benchmarking Generalization to New Tasks from Natural Language Instructions

Python

업데이트됨 2026년 6월 11일

grobid

★23

A machine learning software for extracting information from scholarly documents

Java

업데이트됨 2026년 6월 12일

rslearn_projects

★22

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 9일

olmo-eval

★18

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 13일

twentyquestions

★17

A web application for playing 20 Questions to crowdsource common sense. 🤖

Python

업데이트됨 2026년 6월 7일

asta-plugins

★16

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 6월 12일

MolmoPoint-GUISyn

★15

Synthetic GUI Pointing Data Generation

Python

업데이트됨 2026년 6월 6일

s6ui

★12

A fast AWS S3 browser, with inspiration from s5cmd

Rust

업데이트됨 2026년 6월 5일

layout-parser

★5

A Python Library for Document Layout Understanding

Python

업데이트됨 2026년 6월 4일

molmospaces-resources

★4

Resource manager for MolmoSpaces

Python

업데이트됨 2026년 6월 11일

skiff2-actions

★3

GitHub actions for skiff2 repositories.

TypeScript

업데이트됨 2026년 6월 8일

OlmoEarth-Feedback

★2

Repo for collection of feedback on OlmoEarth

알 수 없는 언어

업데이트됨 2026년 6월 5일

mujoco

★2

이 저장소에 대한 설명이 제공되지 않았습니다.

C++

업데이트됨 2026년 6월 4일

personalized-scholarqa-eval

★2

Evaluation code for the paper "Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users"

Python

업데이트됨 2026년 6월 3일

molmospaces_policy_zoo

★0

Policy zoo for data generation + evaluation in MolmoSpaces

Python

업데이트됨 2026년 6월 12일

fairseq

★0

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python

업데이트됨 2026년 6월 3일

자주 묻는 질문

allenai는 GitHub에서 무엇을 개발하나요?

allenai는 NLP 및 AI 연구를 위한 여러 오픈 소스 프로젝트를 개발합니다. 주요 저장소로는 allennlp와 olmocr이 있으며, 다양한 데이터 처리 및 모델 훈련 도구를 제공합니다.

allenai는 어떤 프로그래밍 언어를 사용하나요?

allenai는 주로 Python을 사용하며, Jupyter Notebook, Scala, Rust, C#, Lua와 같은 언어도 활용합니다. 이러한 언어들은 다양한 프로젝트에서 사용됩니다.

allenai의 저장소는 공개인가요?

네, allenai의 저장소는 모두 공개되어 있습니다. 사용자는 GitHub에서 이들의 프로젝트를 탐색하고 기여할 수 있습니다.

이 노출이 의도된 것인가요?

Ai2을 RepoGuard로 모니터링하고 새로운 공개 저장소가 나타나는 순간 알림을 받으세요.

이 계정 모니터링하기