10 h agoに更新されました

Organization

Ai2の公開GitHubフットプリント

@allenai

GitHubでプロフィールを見る

Seattle, WA

584

公開リポジトリ

77,201

合計スター

4,769

フォロワー

allenaiは、Seattle, WAに拠点を置く組織で、GitHub上に幅広いリポジトリを展開しています。主なプログラミング言語にはPython、Jupyter Notebook、Scala、Rust、C#、Luaが含まれます。特に、olmocrやallennlpなどのプロジェクトは、NLPやLLMデータセットに関する重要なツールとして広く利用されています。

主要な言語

Python 83Jupyter Notebook 3Scala 2Rust 2C# 1Lua 1HTML 1Java 1

公開リポジトリ

olmocr

★17,387

Toolkit for linearizing PDFs for LLM datasets/training

Python

更新済み 2026年6月13日

allennlp

★11,892

An open-source NLP research library, built on PyTorch.

Python

更新済み 2026年6月13日

OLMo

★6,554

Modeling, training, eval, and inference code for OLMo

Python

更新済み 2026年6月12日

open-instruct

★3,752

AllenAI's post-training codebase

Python

更新済み 2026年6月12日

RL4LMs

★2,388

A modular RL library to fine-tune language models to human preferences

Python

更新済み 2026年6月6日

longformer

★2,196

Longformer: The Long-Document Transformer

Python

更新済み 2026年6月5日

scispacy

★1,964

A full spaCy pipeline and models for scientific/biomedical documents.

Python

更新済み 2026年6月12日

ai2thor

★1,739

An open-source platform for Visual AI.

更新済み 2026年6月11日

scibert

★1,703

A BERT model for scientific text.

Python

更新済み 2026年6月10日

dolma

★1,508

Data and tools for generating and inspecting OLMo pre-training data.

Python

更新済み 2026年6月8日

objaverse-xl

★1,297

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Python

更新済み 2026年6月13日

OLMo-core

★1,289

PyTorch building blocks for the OLMo ecosystem

Python

更新済み 2026年6月13日

s2orc

★1,064

S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/

Python

更新済み 2026年6月10日

natural-instructions

★1,047

Expanding natural instructions

Python

更新済み 2026年6月10日

OLMoE

★1,026

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook

更新済み 2026年6月9日

molmo

★914

Code for the Molmo Vision-Language Model

Python

更新済み 2026年6月11日

XNOR-Net

★870

ImageNet classification using binary Convolutional Neural Networks

Lua

更新済み 2026年6月9日

papermage

★797

library supporting NLP and CV research on scientific papers

Python

更新済み 2026年6月8日

visprog

★773

Official code for VisProg (CVPR 2023 Best Paper!)

Python

更新済み 2026年6月8日

scitldr

★759

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月9日

pdffigures2

★748

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Scala

更新済み 2026年6月7日

reward-bench

★721

RewardBench: the first evaluation tool for reward models.

Python

更新済み 2026年6月12日

molmo2

★643

Code for the Molmo2 Vision-Language Model

Python

更新済み 2026年6月12日

molmoact2

★605

Official Repository for MolmoAct2

Python

更新済み 2026年6月13日

specter

★583

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Python

更新済み 2026年6月13日

WildDet3D

★576

Allen Institute for AI: WildDet3D: Scaling Promptable 3D Detection in the Wild

Python

更新済み 2026年6月12日

molmoweb

★567

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月11日

allennlp-models

★563

Officially supported AllenNLP models

Python

更新済み 2026年6月9日

Holodeck

★553

CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.

Python

更新済み 2026年6月6日

dont-stop-pretraining

★543

Code associated with the Don't Stop Pretraining ACL 2020 paper

Python

更新済み 2026年6月5日

OLMoASR

★491

An open-source implementation of Whisper

Python

更新済み 2026年6月3日

s2orc-doc2json

★469

Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)

Python

更新済み 2026年6月6日

procthor

★441

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Python

更新済み 2026年6月12日

deep_qa

★403

A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)

Python

更新済み 2026年6月6日

allenact

★382

An open source framework for research in Embodied-AI from AI2.

Python

更新済み 2026年6月9日

olmes

★379

Reproducible, flexible LLM evaluations

Python

更新済み 2026年6月10日

molmoact

★369

Official Repository for MolmoAct

Python

更新済み 2026年6月12日

vla-evaluation-harness

★368

One framework to evaluate any VLA model on any robot simulation benchmark.

Python

更新済み 2026年6月12日

ScienceWorld

★363

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Scala

更新済み 2026年6月10日

molmospaces

★358

An end-to-end open ecosystem for robot learning

Python

更新済み 2026年6月12日

satlas-super-resolution

★341

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月10日

ai2-scholarqa-lib

★281

Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

Python

更新済み 2026年6月7日

satlas

★280

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月11日

s2-folks

★275

Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.

不明な言語

更新済み 2026年6月10日

scifact

★263

Data and models for the SciFact verification task.

Python

更新済み 2026年6月10日

WildBench

★254

Benchmarking LLMs with Challenging Tasks from Real Users

Python

更新済み 2026年6月8日

olmoearth_pretrain

★246

Earth system foundation model data, training, and eval

Python

更新済み 2026年6月12日

asta-paper-finder

★244

frozen-in-time version of our Paper Finder agent for reproducing evaluation results

Python

更新済み 2026年6月12日

real-toxicity-prompts

★233

このリポジトリに関する説明は提供されていません。

Jupyter Notebook

更新済み 2026年6月11日

discoveryworld

★215

A virtual environment for developing and evaluating automated scientific discovery agents.

Python

更新済み 2026年6月10日

hidden-networks

★198

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月8日

autodiscovery-neurips

★182

Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"

Python

更新済み 2026年6月4日

medicat

★176

Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references

Python

更新済み 2026年6月12日

pixmo-docs

★163

ACL 2025: Synthetic data generation pipelines for text-rich images.

Python

更新済み 2026年6月5日

discoverybench

★147

Discovering Data-driven Hypotheses in the Wild

Python

更新済み 2026年6月12日

SERA

★146

Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.

Python

更新済み 2026年6月13日

satlaspretrain_models

★144

このリポジトリに関する説明は提供されていません。

Jupyter Notebook

更新済み 2026年6月9日

IFBench

★142

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月11日

agent-baselines

★142

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月8日

SPECTER2

★136

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月5日

bolmo-core

★134

Code for Bolmo: Byteifying the Next Generation of Language Models

Python

更新済み 2026年6月10日

wildguard

★125

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python

更新済み 2026年6月12日

aokvqa

★116

Official repository for the A-OKVQA dataset

Python

更新済み 2026年6月5日

asta-bench

★109

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月13日

S2AND

★109

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

Python

更新済み 2026年6月4日

infinigram-api

★101

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月12日

DecomP

★99

Repository for Decomposed Prompting

Python

更新済み 2026年6月9日

robothor-challenge

★99

RoboTHOR Challenge

Python

更新済み 2026年6月4日

MolmoBot

★90

Code and website for "MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation".

Python

更新済み 2026年6月10日

rslearn

★89

A tool for developing remote sensing datasets and models.

Python

更新済み 2026年6月11日

duplodocus

★85

Tooling for exact and MinHash deduplication of large-scale text datasets

Rust

更新済み 2026年6月5日

olmoearth_projects

★74

OlmoEarth projects

Python

更新済み 2026年6月12日

codenav

★69

CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.

Python

更新済み 2026年6月6日

atlantes

★66

Efficient and low latency real-time global-scale GPS trajectory modeling

Python

更新済み 2026年6月10日

phone2proc

★63

📱👉🏠 Perform conditional procedural generation to generate houses like your own!

Python

更新済み 2026年6月10日

paper-embedding-public-apis

★60

Collection of public APIs for embedding scientific papers

不明な言語

更新済み 2026年6月7日

ruletaker

★55

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月7日

EMO

★42

このリポジトリに関する説明は提供されていません。

HTML

更新済み 2026年6月10日

fermi

★37

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月3日

artifact-linker

★36

ArtifactLinker: Linking Scientific Artifacts for Automatic State-of-the-Art Discovery

Python

更新済み 2026年6月10日

c4-documentation

★33

このリポジトリに関する説明は提供されていません。

不明な言語

更新済み 2026年6月6日

signal-and-noise

★30

Measuring the Signal to Noise Ratio in Language Model Evaluation

Python

更新済み 2026年6月12日

recoma

★30

Reasoning by Communicating with Agents

Python

更新済み 2026年6月5日

persona-bias

★29

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月9日

natural-instructions-v1

★28

Benchmarking Generalization to New Tasks from Natural Language Instructions

Python

更新済み 2026年6月11日

grobid

★23

A machine learning software for extracting information from scholarly documents

Java

更新済み 2026年6月12日

rslearn_projects

★22

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月9日

olmo-eval

★18

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月13日

twentyquestions

★17

A web application for playing 20 Questions to crowdsource common sense. 🤖

Python

更新済み 2026年6月7日

asta-plugins

★16

このリポジトリに関する説明は提供されていません。

Python

更新済み 2026年6月12日

MolmoPoint-GUISyn

★15

Synthetic GUI Pointing Data Generation

Python

更新済み 2026年6月6日

s6ui

★12

A fast AWS S3 browser, with inspiration from s5cmd

Rust

更新済み 2026年6月5日

layout-parser

★5

A Python Library for Document Layout Understanding

Python

更新済み 2026年6月4日

molmospaces-resources

★4

Resource manager for MolmoSpaces

Python

更新済み 2026年6月11日

skiff2-actions

★3

GitHub actions for skiff2 repositories.

TypeScript

更新済み 2026年6月8日

OlmoEarth-Feedback

★2

Repo for collection of feedback on OlmoEarth

不明な言語

更新済み 2026年6月5日

mujoco

★2

このリポジトリに関する説明は提供されていません。

C++

更新済み 2026年6月4日

personalized-scholarqa-eval

★2

Evaluation code for the paper "Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users"

Python

更新済み 2026年6月3日

molmospaces_policy_zoo

★0

Policy zoo for data generation + evaluation in MolmoSpaces

Python

更新済み 2026年6月12日

fairseq

★0

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python

更新済み 2026年6月3日

よくある質問

allenaiはGitHubで何を構築していますか？

allenaiは、主に自然言語処理や機械学習に関するオープンソースプロジェクトを構築しています。特に、allennlpやolmocrなどのリポジトリは、研究や開発において重要な役割を果たしています。

allenaiが使用しているプログラミング言語は何ですか？

allenaiは、主にPythonを使用しており、Jupyter Notebook、Scala、Rust、C#、Luaなども活用しています。これにより、多様なプロジェクトを効率的に開発しています。

allenaiのリポジトリは公開されていますか？

はい、allenaiのリポジトリはすべて公開されています。これにより、他の開発者や研究者が彼らのツールやライブラリを利用し、貢献することが可能です。

この露出は意図的ですか？

RepoGuardでAi2を監視し、新しい公開リポジトリが現れた瞬間に警告を受け取ります。

このアカウントを監視する