Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / EleutherAI 51 repositories

EleutherAI/concept-erasure

Erasing concepts from neural representations with provable guarantees

Language: Python - Size: 135 KB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 194 - Forks: 15

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language: Python - Size: 22.6 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 5,306 - Forks: 1,384

EleutherAI/w2s

Language: Python - Size: 292 KB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0

EleutherAI/cupbearer Fork of ejnnr/cupbearer

A library for mechanistic anomaly detection

Size: 7.92 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0

EleutherAI/aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Language: Python - Size: 91.7 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 6 - Forks: 4

EleutherAI/aria

Language: Python - Size: 314 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 36 - Forks: 6

EleutherAI/elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Language: Python - Size: 26 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 171 - Forks: 32

EleutherAI/pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language: Jupyter Notebook - Size: 487 MB - Last synced: 24 days ago - Pushed: 25 days ago - Stars: 2,051 - Forks: 150

EleutherAI/gpt-neo ๐Ÿ“ฆ

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Language: Python - Size: 1.56 MB - Last synced: 25 days ago - Pushed: about 2 years ago - Stars: 8,145 - Forks: 935

EleutherAI/semantic-memorization

Language: Jupyter Notebook - Size: 146 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 34 - Forks: 3

EleutherAI/architecture-experiments

Repository to host architecture experiments and development using Paxml and Praxis

Language: Python - Size: 10.7 KB - Last synced: 29 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1

EleutherAI/FLAN Fork of google-research/FLAN

Language: Python - Size: 55.7 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1

EleutherAI/examples Fork of mosaicml/examples

Mosaicml example benchmarks + LLM scripts

Language: Python - Size: 3.02 MB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

EleutherAI/CommonLoopUtils Fork of google/CommonLoopUtils

[WIP] a version of CLU with WandB logging added.

Language: Jupyter Notebook - Size: 700 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

EleutherAI/trlx Fork of CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language: Python - Size: 44.6 MB - Last synced: 29 days ago - Pushed: 9 months ago - Stars: 7 - Forks: 2

EleutherAI/math-lm

Language: Python - Size: 18.8 MB - Last synced: 29 days ago - Pushed: 3 months ago - Stars: 973 - Forks: 74

EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

Language: Python - Size: 11.6 MB - Last synced: 21 days ago - Pushed: over 2 years ago - Stars: 136 - Forks: 19

EleutherAI/DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Language: Python - Size: 272 KB - Last synced: 29 days ago - Pushed: over 2 years ago - Stars: 436 - Forks: 48

EleutherAI/polyglot

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

Size: 944 KB - Last synced: 25 days ago - Pushed: 9 months ago - Stars: 460 - Forks: 37

EleutherAI/improved-t5

Experiments for efforts to train a new and improved t5

Language: Python - Size: 42.7 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 73 - Forks: 5

EleutherAI/elk-generalization

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Language: Python - Size: 30.7 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 19 - Forks: 3

EleutherAI/oslo

OSLO: Open Source for Large-scale Optimization

Language: Python - Size: 10.4 MB - Last synced: 25 days ago - Pushed: 9 months ago - Stars: 170 - Forks: 29

EleutherAI/best-download

URL downloader supporting checkpointing and continuous checksumming.

Language: Python - Size: 35.2 KB - Last synced: 26 days ago - Pushed: 6 months ago - Stars: 19 - Forks: 7

EleutherAI/variance-across-time

Studying the variance in neural net predictions across training time

Language: Python - Size: 58.6 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0

EleutherAI/DeeperSpeed Fork of microsoft/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Language: Python - Size: 178 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 159 - Forks: 44

EleutherAI/website

New website for EleutherAI based on Hugo static site generator

Language: HTML - Size: 61.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 4 - Forks: 6

EleutherAI/rnngineering

Engineering the state of RNN language models (Mamba, RWKV, etc.)

Language: Jupyter Notebook - Size: 70.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 30 - Forks: 2

EleutherAI/mp_nerf

Massively-Parallel Natural Extension of Reference Frame

Language: Jupyter Notebook - Size: 42.9 MB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 26 - Forks: 2

EleutherAI/jusText Fork of miso-belica/jusText

Heuristic based boilerplate removal tool

Language: Python - Size: 1010 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 1 - Forks: 1

EleutherAI/grouch

Language: HTML - Size: 656 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 1 - Forks: 2

EleutherAI/reddit-comment-processing

Language: Python - Size: 7.81 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 2 - Forks: 3

EleutherAI/features-across-time

Understanding how features learned by neural networks evolve throughout training

Language: Python - Size: 7.93 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 24 - Forks: 1

EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language: Python - Size: 110 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 6,535 - Forks: 946

EleutherAI/alignment-handbook Fork of huggingface/alignment-handbook

Robust recipes for to align language models with human and AI preferences

Size: 128 KB - Last synced: 29 days ago - Pushed: 4 months ago - Stars: 3 - Forks: 1

EleutherAI/tokengrams

Efficiently computing & storing token n-grams from large corpora

Language: Rust - Size: 193 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7 - Forks: 0

EleutherAI/cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Language: Python - Size: 53.3 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 138 - Forks: 10

EleutherAI/tqdm-multiprocess

Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for python logging.

Language: Python - Size: 48.8 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 41 - Forks: 2

EleutherAI/weak-to-strong Fork of openai/weak-to-strong

Language: Python - Size: 12.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 1

EleutherAI/CAA Fork of nrimsky/CAA

Steering Llama 2 with Contrastive Activation Addition

Size: 1.14 GB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

EleutherAI/lm-evaulation-ui

App for generating html table from LM evaluation JSONs

Language: JavaScript - Size: 18.6 KB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

EleutherAI/eleuther-blog

here is the generated content for the EleutherAI blog. Source is from new-website repo

Language: HTML - Size: 27.8 MB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

EleutherAI/eai-prompt-gallery

Library of interesting prompt generations

Language: JavaScript - Size: 94 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 3 - Forks: 0

EleutherAI/alignment-reader

Search and filter through alignment literature

Language: JavaScript - Size: 292 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

EleutherAI/stackexchange-dataset

Python tools for processing the stackexchange data dumps into a text dataset for Language Models

Language: Python - Size: 49.8 KB - Last synced: 2 months ago - Pushed: 6 months ago - Stars: 64 - Forks: 15

EleutherAI/the-pile

Language: Python - Size: 259 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 1,362 - Forks: 116

EleutherAI/openwebtext2

Language: Python - Size: 5.32 MB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 80 - Forks: 15

EleutherAI/pyfra

Python Research Framework

Language: Python - Size: 725 KB - Last synced: 24 days ago - Pushed: over 1 year ago - Stars: 107 - Forks: 12

EleutherAI/RWKV-LM Fork of BlinkDL/RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language: Python - Size: 16.9 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

EleutherAI/text-generation-testing-ui

Web app for demoing the EAI models

Language: JavaScript - Size: 1.06 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 16 - Forks: 12

EleutherAI/project-menu

See the issue board for the current status of active and prospective projects!

Size: 66.4 KB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 65 - Forks: 4

EleutherAI/conceptual-constraints

Applying LEACE to models during training

Language: Jupyter Notebook - Size: 3.59 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

EleutherAI/ccs

Language: Python - Size: 26.1 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 3 - Forks: 6

EleutherAI/aria.cpp

GGML implementation of https://github.com/EleutherAI/aria

Language: CMake - Size: 32.2 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

EleutherAI/hn-scraper

Language: Python - Size: 14.9 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 8 - Forks: 2

EleutherAI/vqgan-clip

Language: Jupyter Notebook - Size: 16.7 MB - Last synced: 4 months ago - Pushed: about 2 years ago - Stars: 339 - Forks: 40

EleutherAI/pd-books

Language: Jupyter Notebook - Size: 9.43 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1

EleutherAI/polyglot-data

data related codebase for polyglot project

Language: Python - Size: 2.43 MB - Last synced: 25 days ago - Pushed: about 1 year ago - Stars: 19 - Forks: 10

EleutherAI/minetest Fork of minetest/minetest

Minetest is an open source voxel game engine with easy modding and game creation

Language: C++ - Size: 91.7 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 53 - Forks: 10

EleutherAI/TransformerEngine Fork of NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language: Python - Size: 2.52 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

EleutherAI/common-llm-settings

Common LLM Settings App

Language: JavaScript - Size: 315 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

EleutherAI/tuned-lens Fork of AlignmentResearch/tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Size: 1.62 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0

EleutherAI/tinydpo Fork of cat-state/tinypar

Size: 292 KB - Last synced: 29 days ago - Pushed: 11 months ago - Stars: 2 - Forks: 0

EleutherAI/tagged-pile

Part-of-Speech Tagging for the Pile and RedPajama

Language: Python - Size: 6.84 KB - Last synced: about 1 month ago - Pushed: 12 months ago - Stars: 9 - Forks: 2

EleutherAI/classifier-latent-diffusion

Language: Python - Size: 8.79 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 1

EleutherAI/irrlicht Fork of minetest/irrlicht

Minetest's fork of Irrlicht

Language: C++ - Size: 18 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1

EleutherAI/llemma-sample-explorer

Sample explorer tool for the Llemma models.

Language: HTML - Size: 731 KB - Last synced: 29 days ago - Pushed: 7 months ago - Stars: 5 - Forks: 0

EleutherAI/maxtext Fork of google/maxtext

A simple, performant and scalable Jax LLM!

Size: 262 KB - Last synced: 29 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 2

EleutherAI/prefix-free-tokenizer

A prefix free tokenizer

Language: Python - Size: 6.84 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

EleutherAI/truncated-gaussian

Method-of-moments estimation and sampling for truncated multivariate Gaussian distributions

Language: Python - Size: 7.81 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

EleutherAI/mup Fork of microsoft/mup

maximal update parametrization (ยตP)

Size: 16.5 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

EleutherAI/latent-video-diffusion

Latent video diffusion

Language: Python - Size: 39.1 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 3 - Forks: 2

EleutherAI/mdl

Minimum Description Length probing for neural network representations

Language: Python - Size: 164 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 11 - Forks: 1

EleutherAI/EvilModel

A replication of "EvilModel 2.0: Bringing Neural Network Models into Malware Attacks"

Size: 6.84 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

EleutherAI/dps

Data processing system for polyglot

Language: Python - Size: 7.67 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 62 - Forks: 20

EleutherAI/hae-rae

Size: 1.52 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 26 - Forks: 4

EleutherAI/pile-literotica

Download, parse, and filter data from Literotica. Data-ready for The-Pile.

Language: Python - Size: 3.91 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 2

EleutherAI/minetest-baselines

Baseline agents for Minetest tasks.

Language: Python - Size: 79.1 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 6 - Forks: 1

EleutherAI/minetest-interpretabilty-notebook

Jupyter notebook for the interpretablity section of the minetester blog post

Language: Jupyter Notebook - Size: 18.1 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

EleutherAI/poll_website_demo

Flask Based Polling Website Demo

Language: Python - Size: 798 KB - Last synced: almost 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

EleutherAI/eleutherai-instruct-dataset

A large instruct dataset for open-source models (WIP).

Size: 58 MB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

EleutherAI/pile_dedupe

Pile Deduplication Code

Language: Python - Size: 16.6 KB - Last synced: almost 1 year ago - Pushed: about 1 year ago - Stars: 3 - Forks: 0

EleutherAI/composer Fork of mosaicml/composer

Train neural networks up to 7x faster

Language: Python - Size: 8.04 MB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 3 - Forks: 2

EleutherAI/lm_perplexity

Language: Python - Size: 536 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 90 - Forks: 14

EleutherAI/exploring-contrastive-topology

Language: Jupyter Notebook - Size: 58 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 16 - Forks: 3

EleutherAI/magiCARP

One stop shop for all things carp

Language: Python - Size: 31.4 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 53 - Forks: 11

EleutherAI/multimodal-fid

Language: Python - Size: 67.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0

EleutherAI/t-zero Fork of bigscience-workshop/t-zero

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Size: 158 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

EleutherAI/NeMo Fork of NVIDIA/NeMo

NeMo: a toolkit for conversational AI

Language: Python - Size: 124 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 10 - Forks: 2

EleutherAI/pile-website Fork of rajpurkar/SQuAD-explorer

Language: HTML - Size: 80.9 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 3

EleutherAI/isaac-mchorse

EleutherAI's discord bot

Language: Python - Size: 208 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 0

EleutherAI/github-downloader Fork of noanabeshima/github-downloader

Script for downloading GitHub.

Language: Python - Size: 4.78 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 46 - Forks: 22

EleutherAI/pile-cc Fork of leogao2/commoncrawl_downloader

Size: 32.2 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 11 - Forks: 1

EleutherAI/lm_dataformat Fork of leogao2/lm_dataformat

Language: Python - Size: 80.1 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 3

EleutherAI/pilev2

Language: Python - Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 11 - Forks: 9

EleutherAI/megatron-3d ๐Ÿ“ฆ

Language: Python - Size: 521 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 3

EleutherAI/equivariance

A framework for implementing equivariant DL

Language: Jupyter Notebook - Size: 1.08 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 10 - Forks: 3

EleutherAI/pile-pubmedcentral

A script for collecting the PubMed Central dataset in a language modelling friendly format.

Language: Python - Size: 18.7 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 7 - Forks: 1

EleutherAI/pile-explorer

For exploring the data and documenting its limitations

Language: Python - Size: 39.1 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 4 - Forks: 3

EleutherAI/pile-allpoetry

Scraper to gather poems from allpoetry.com

Language: Python - Size: 41 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 1

EleutherAI/datasets Fork of huggingface/datasets

๐Ÿค— The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Size: 41.8 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 9 - Forks: 3