Обновлено 4 h ago

Organization

Публичный след на GitHub EleutherAI

@EleutherAI

Просмотреть профиль на GitHub

The Internet

182

Публичные репозитории

28 707

Всего звезд

4 327

Подписчики

EleutherAI активно ведет свою деятельность на GitHub, имея широкий спектр публичных репозиториев. Основные языки программирования включают Python, Jupyter Notebook и C++. Среди заметных проектов можно выделить lm-evaluation-harness и gpt-neox, которые сосредоточены на оценке и реализации языковых моделей.

Основные языки

Python 53Jupyter Notebook 18C++ 2JavaScript 2Rust 1Cuda 1CMake 1

Публичные репозитории

lm-evaluation-harness

★12 941

A framework for few-shot evaluation of language models.

Python

Обновлено 13 июн. 2026 г.

gpt-neox

★7 442

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python

Обновлено 12 июн. 2026 г.

pythia

★2 818

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook

Обновлено 13 июн. 2026 г.

math-lm

★1 095

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

cookbook

★844

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python

Обновлено 11 июн. 2026 г.

sparsify

★727

Sparsify transformers with SAEs and transcoders

Python

Обновлено 12 июн. 2026 г.

polyglot

★487

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

Неизвестный язык

Обновлено 11 июн. 2026 г.

vqgan-clip

★353

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

concept-erasure

★255

Erasing concepts from neural representations with provable guarantees

Python

Обновлено 13 июн. 2026 г.

elk

★220

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python

Обновлено 11 июн. 2026 г.

nanoGPT-mup

★196

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python

Обновлено 11 июн. 2026 г.

oslo

★175

OSLO: Open Source for Large-scale Optimization

Python

Обновлено 11 июн. 2026 г.

aria

★106

Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)

Python

Обновлено 11 июн. 2026 г.

dps

★93

Data processing system for polyglot

Python

Обновлено 11 июн. 2026 г.

improved-t5

★76

Experiments for efforts to train a new and improved t5

Python

Обновлено 11 июн. 2026 г.

minetest

★74

Minetest is an open source voxel game engine with easy modding and game creation

C++

Обновлено 11 июн. 2026 г.

aria-amt

★70

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Python

Обновлено 11 июн. 2026 г.

bergson

★60

Mapping out the "memory" of neural nets with data attribution

Python

Обновлено 13 июн. 2026 г.

magiCARP

★58

One stop shop for all things carp

Python

Обновлено 11 июн. 2026 г.

semantic-memorization

★44

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

features-across-time

★41

Understanding how features learned by neural networks evolve throughout training

Python

Обновлено 11 июн. 2026 г.

hae-rae

★33

Описание для этого репозитория не предоставлено.

Неизвестный язык

Обновлено 11 июн. 2026 г.

rnngineering

★32

Engineering the state of RNN language models (Mamba, RWKV, etc.)

Jupyter Notebook

Обновлено 11 июн. 2026 г.

elk-generalization

★31

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Python

Обновлено 11 июн. 2026 г.

steering-llama3

★30

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

tokengrams

★27

Efficiently computing & storing token n-grams from large corpora

Rust

Обновлено 11 июн. 2026 г.

training-jacobian

★24

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

w2s

★24

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

deep-ignorance

★19

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

polyglot-data

★19

data related codebase for polyglot project

Python

Обновлено 11 июн. 2026 г.

pile_dedupe

★18

Pile Deduplication Code

Python

Обновлено 11 июн. 2026 г.

latent-video-diffusion

★16

Latent video diffusion

Python

Обновлено 11 июн. 2026 г.

NeMo

★16

NeMo: a toolkit for conversational AI

Python

Обновлено 11 июн. 2026 г.

attribute

★15

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

exploring-contrastive-topology

★15

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

polyapprox

★13

Closed-form polynomial approximations to neural networks

Python

Обновлено 11 июн. 2026 г.

pilev2

★13

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

lm_dataformat

★11

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

transformer-reasoning

★10

Experiments in transformer knowledge and reasoning

Jupyter Notebook

Обновлено 11 июн. 2026 г.

architecture-objective

★10

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

attention-probes

★8

Linear probes with attention weighting

Python

Обновлено 11 июн. 2026 г.

equinox-llama

★8

Equinox implementation of llama3 and llama3.1

Python

Обновлено 11 июн. 2026 г.

GPTeacher

★8

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

Неизвестный язык

Обновлено 11 июн. 2026 г.

minetest-baselines

★8

Baseline agents for Minetest tasks.

Python

Обновлено 11 июн. 2026 г.

aria-utils

★6

MIDI tokenizers and pre-processing utils.

Python

Обновлено 11 июн. 2026 г.

cupbearer

★6

A library for mechanistic anomaly detection

Jupyter Notebook

Обновлено 11 июн. 2026 г.

weak-to-strong

★6

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

trlx

★6

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python

Обновлено 11 июн. 2026 г.

minetest-interpretabilty-notebook

★6

Jupyter notebook for the interpretablity section of the minetester blog post

Jupyter Notebook

Обновлено 11 июн. 2026 г.

CodeCARP

★6

Data collection pipeline for CodeCARP. Includes PyCharm plugins.

Неизвестный язык

Обновлено 11 июн. 2026 г.

clearnets

★5

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

optax-galore

★5

Adds GaLore style projection wrappers to optax optimizers

Python

Обновлено 11 июн. 2026 г.

architecture-experiments

★5

Repository to host architecture experiments and development using Paxml and Praxis

Python

Обновлено 11 июн. 2026 г.

FLAN

★5

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

thonkenizers

★5

yes

Неизвестный язык

Обновлено 11 июн. 2026 г.

scalable-elicitation

★4

The code used in "Balancing Label Quantity and Quality for Scalable Elicitation"

Jupyter Notebook

Обновлено 11 июн. 2026 г.

monkfish

★4

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

alignment-handbook

★4

Robust recipes for to align language models with human and AI preferences

Неизвестный язык

Обновлено 11 июн. 2026 г.

Unpaired-Image-Generation

★4

Project Repo for Unpaired Image Generation project

Неизвестный язык

Обновлено 11 июн. 2026 г.

lm-scope

★4

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

sae_overlap

★3

Acompanying code for our research on SAE feature overlap when trained on different seeds.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

variance-across-time

★3

Studying the variance in neural net predictions across training time

Python

Обновлено 11 июн. 2026 г.

EvilModel

★3

A replication of "EvilModel 2.0: Bringing Neural Network Models into Malware Attacks"

Неизвестный язык

Обновлено 11 июн. 2026 г.

eai-prompt-gallery

★3

Library of interesting prompt generations

JavaScript

Обновлено 11 июн. 2026 г.

gamescope

★2

Can interpretability methods confer an advantage in competitive games?

Python

Обновлено 11 июн. 2026 г.

fmri

★2

Analogue of fMRI on artificial neural networks

Неизвестный язык

Обновлено 11 июн. 2026 г.

rtopk

★2

https://github.com/xiexi51/RTopK PyTorch wrapper

Cuda

Обновлено 11 июн. 2026 г.

pd-books

★2

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

tuned-lens

★2

Tools for understanding how transformer predictions are built layer-by-layer

Python

Обновлено 11 июн. 2026 г.

tinydpo

★2

Описание для этого репозитория не предоставлено.

Неизвестный язык

Обновлено 11 июн. 2026 г.

eleutherai-instruct-dataset

★2

A large instruct dataset for open-source models (WIP).

Неизвестный язык

Обновлено 11 июн. 2026 г.

examples

★2

Mosaicml example benchmarks + LLM scripts

Python

Обновлено 11 июн. 2026 г.

minetest_game

★2

Minetest Game - The default game for the Minetest engine [https://github.com/minetest/minetest/]

Неизвестный язык

Обновлено 11 июн. 2026 г.

groupoid-rl

★2

Описание для этого репозитория не предоставлено.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

truffaldino

★1

Investigating goal instability in RL

Python

Обновлено 11 июн. 2026 г.

rllm

★1

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook

Обновлено 11 июн. 2026 г.

bayesian-adam

★1

Exactly what it says on the tin

Python

Обновлено 11 июн. 2026 г.

RWKV-LM

★1

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Python

Обновлено 11 июн. 2026 г.

conceptual-constraints

★1

Applying LEACE to models during training

Jupyter Notebook

Обновлено 11 июн. 2026 г.

aria.cpp

★1

GGML implementation of https://github.com/EleutherAI/aria

CMake

Обновлено 11 июн. 2026 г.

classifier-latent-diffusion

★1

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

language-adaptation

★1

Описание для этого репозитория не предоставлено.

Неизвестный язык

Обновлено 11 июн. 2026 г.

maxtext

★1

A simple, performant and scalable Jax LLM!

Неизвестный язык

Обновлено 11 июн. 2026 г.

irrlicht

★1

Minetest's fork of Irrlicht

C++

Обновлено 11 июн. 2026 г.

lm-evaulation-ui

★1

App for generating html table from LM evaluation JSONs

JavaScript

Обновлено 11 июн. 2026 г.

gradient-routing

★0

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

rh-indicators

★0

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

hackable-bergson

★0

Simplified library for mapping out the "memory" of neural nets with data attribution

Неизвестный язык

Обновлено 11 июн. 2026 г.

vllm

★0

A high-throughput and memory-efficient inference and serving engine for LLMs

Неизвестный язык

Обновлено 11 июн. 2026 г.

verifiers

★0

Verifiers for LLM Reinforcement Learning

Python

Обновлено 11 июн. 2026 г.

wmdp

★0

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

Megatron-LM

★0

Ongoing research training transformer models at scale

Неизвестный язык

Обновлено 11 июн. 2026 г.

mixture-of-depths

★0

Описание для этого репозитория не предоставлено.

Неизвестный язык

Обновлено 11 июн. 2026 г.

llm-score-behavior

★0

Описание для этого репозитория не предоставлено.

Python

Обновлено 11 июн. 2026 г.

TransformerEngine

★0

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Python

Обновлено 11 июн. 2026 г.

Plenoxels_FreeNerf

★0

implmentation of Plenoxels radiance fields without neural networks, with free nerf strategy

Неизвестный язык

Обновлено 11 июн. 2026 г.

oslo-1

★0

OSLO: Open Source for Large-scale Optimization

Неизвестный язык

Обновлено 11 июн. 2026 г.

t-zero

★0

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Неизвестный язык

Обновлено 11 июн. 2026 г.

CommonLoopUtils

★0

[WIP] a version of CLU with WandB logging added.

Jupyter Notebook

Обновлено 11 июн. 2026 г.

pytorch-fid

★0

Compute FID scores with PyTorch.

Неизвестный язык

Обновлено 11 июн. 2026 г.

Часто задаваемые вопросы

Что разрабатывает EleutherAI на GitHub?

EleutherAI разрабатывает проекты, связанные с языковыми моделями и их интерпретацией. Ключевые репозитории, такие как lm-evaluation-harness и gpt-neox, служат основой для оценки и реализации моделей на графических процессорах.

Какие языки программирования использует EleutherAI?

В своих проектах EleutherAI использует несколько языков программирования, включая Python, Jupyter Notebook, C++, JavaScript, Rust и Cuda. Эти языки помогают в разработке и исследовании языковых моделей.

Являются ли репозитории EleutherAI публичными?

Да, все репозитории EleutherAI являются публичными. Это позволяет исследователям и разработчикам использовать и вносить вклад в проекты, связанные с языковыми моделями и глубоким обучением.

Это раскрытие намеренно?

Следите за EleutherAI с помощью RepoGuard и получайте уведомления в момент появления нового публичного репозитория.

Следить за этим аккаунтом