Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / EleutherAI 51 repositories
EleutherAI/concept-erasure
Erasing concepts from neural representations with provable guarantees
Language: Python - Size: 135 KB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 194 - Forks: 15
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language: Python - Size: 22.6 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 5,306 - Forks: 1,384
EleutherAI/w2s
Language: Python - Size: 292 KB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0
EleutherAI/cupbearer Fork of ejnnr/cupbearer
A library for mechanistic anomaly detection
Size: 7.92 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0
EleutherAI/aria-amt
Efficient and robust implementation of seq-to-seq automatic piano transcription.
Language: Python - Size: 91.7 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 6 - Forks: 4
EleutherAI/aria
Language: Python - Size: 314 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 36 - Forks: 6
EleutherAI/elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
Language: Python - Size: 26 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 171 - Forks: 32
EleutherAI/pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Language: Jupyter Notebook - Size: 487 MB - Last synced: 24 days ago - Pushed: 25 days ago - Stars: 2,051 - Forks: 150
EleutherAI/gpt-neo ๐ฆ
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language: Python - Size: 1.56 MB - Last synced: 25 days ago - Pushed: about 2 years ago - Stars: 8,145 - Forks: 935
EleutherAI/semantic-memorization
Language: Jupyter Notebook - Size: 146 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 34 - Forks: 3
EleutherAI/architecture-experiments
Repository to host architecture experiments and development using Paxml and Praxis
Language: Python - Size: 10.7 KB - Last synced: 29 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1
EleutherAI/FLAN Fork of google-research/FLAN
Language: Python - Size: 55.7 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1
EleutherAI/examples Fork of mosaicml/examples
Mosaicml example benchmarks + LLM scripts
Language: Python - Size: 3.02 MB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1
EleutherAI/CommonLoopUtils Fork of google/CommonLoopUtils
[WIP] a version of CLU with WandB logging added.
Language: Jupyter Notebook - Size: 700 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
EleutherAI/trlx Fork of CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language: Python - Size: 44.6 MB - Last synced: 29 days ago - Pushed: 9 months ago - Stars: 7 - Forks: 2
EleutherAI/math-lm
Language: Python - Size: 18.8 MB - Last synced: 29 days ago - Pushed: 3 months ago - Stars: 973 - Forks: 74
EleutherAI/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models.
Language: Python - Size: 11.6 MB - Last synced: 21 days ago - Pushed: over 2 years ago - Stars: 136 - Forks: 19
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow.
Language: Python - Size: 272 KB - Last synced: 29 days ago - Pushed: over 2 years ago - Stars: 436 - Forks: 48
EleutherAI/polyglot
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
Size: 944 KB - Last synced: 25 days ago - Pushed: 9 months ago - Stars: 460 - Forks: 37
EleutherAI/improved-t5
Experiments for efforts to train a new and improved t5
Language: Python - Size: 42.7 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 73 - Forks: 5
EleutherAI/elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard
Language: Python - Size: 30.7 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 19 - Forks: 3
EleutherAI/oslo
OSLO: Open Source for Large-scale Optimization
Language: Python - Size: 10.4 MB - Last synced: 25 days ago - Pushed: 9 months ago - Stars: 170 - Forks: 29
EleutherAI/best-download
URL downloader supporting checkpointing and continuous checksumming.
Language: Python - Size: 35.2 KB - Last synced: 26 days ago - Pushed: 6 months ago - Stars: 19 - Forks: 7
EleutherAI/variance-across-time
Studying the variance in neural net predictions across training time
Language: Python - Size: 58.6 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0
EleutherAI/DeeperSpeed Fork of microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language: Python - Size: 178 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 159 - Forks: 44
EleutherAI/website
New website for EleutherAI based on Hugo static site generator
Language: HTML - Size: 61.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 4 - Forks: 6
EleutherAI/rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
Language: Jupyter Notebook - Size: 70.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 30 - Forks: 2
EleutherAI/mp_nerf
Massively-Parallel Natural Extension of Reference Frame
Language: Jupyter Notebook - Size: 42.9 MB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 26 - Forks: 2
EleutherAI/jusText Fork of miso-belica/jusText
Heuristic based boilerplate removal tool
Language: Python - Size: 1010 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 1 - Forks: 1
EleutherAI/grouch
Language: HTML - Size: 656 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 1 - Forks: 2
EleutherAI/reddit-comment-processing
Language: Python - Size: 7.81 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 2 - Forks: 3
EleutherAI/features-across-time
Understanding how features learned by neural networks evolve throughout training
Language: Python - Size: 7.93 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 24 - Forks: 1
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language: Python - Size: 110 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 6,535 - Forks: 946
EleutherAI/alignment-handbook Fork of huggingface/alignment-handbook
Robust recipes for to align language models with human and AI preferences
Size: 128 KB - Last synced: 29 days ago - Pushed: 4 months ago - Stars: 3 - Forks: 1
EleutherAI/tokengrams
Efficiently computing & storing token n-grams from large corpora
Language: Rust - Size: 193 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7 - Forks: 0
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Language: Python - Size: 53.3 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 138 - Forks: 10
EleutherAI/tqdm-multiprocess
Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for python logging.
Language: Python - Size: 48.8 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 41 - Forks: 2
EleutherAI/weak-to-strong Fork of openai/weak-to-strong
Language: Python - Size: 12.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 1
EleutherAI/CAA Fork of nrimsky/CAA
Steering Llama 2 with Contrastive Activation Addition
Size: 1.14 GB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
EleutherAI/lm-evaulation-ui
App for generating html table from LM evaluation JSONs
Language: JavaScript - Size: 18.6 KB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
EleutherAI/eleuther-blog
here is the generated content for the EleutherAI blog. Source is from new-website repo
Language: HTML - Size: 27.8 MB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
EleutherAI/eai-prompt-gallery
Library of interesting prompt generations
Language: JavaScript - Size: 94 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 3 - Forks: 0
EleutherAI/alignment-reader
Search and filter through alignment literature
Language: JavaScript - Size: 292 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
EleutherAI/stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
Language: Python - Size: 49.8 KB - Last synced: 2 months ago - Pushed: 6 months ago - Stars: 64 - Forks: 15
EleutherAI/the-pile
Language: Python - Size: 259 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 1,362 - Forks: 116
EleutherAI/openwebtext2
Language: Python - Size: 5.32 MB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 80 - Forks: 15
EleutherAI/pyfra
Python Research Framework
Language: Python - Size: 725 KB - Last synced: 24 days ago - Pushed: over 1 year ago - Stars: 107 - Forks: 12
EleutherAI/RWKV-LM Fork of BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language: Python - Size: 16.9 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
EleutherAI/text-generation-testing-ui
Web app for demoing the EAI models
Language: JavaScript - Size: 1.06 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 16 - Forks: 12
EleutherAI/project-menu
See the issue board for the current status of active and prospective projects!
Size: 66.4 KB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 65 - Forks: 4
EleutherAI/conceptual-constraints
Applying LEACE to models during training
Language: Jupyter Notebook - Size: 3.59 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
EleutherAI/ccs
Language: Python - Size: 26.1 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 3 - Forks: 6
EleutherAI/aria.cpp
GGML implementation of https://github.com/EleutherAI/aria
Language: CMake - Size: 32.2 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
EleutherAI/hn-scraper
Language: Python - Size: 14.9 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 8 - Forks: 2
EleutherAI/vqgan-clip
Language: Jupyter Notebook - Size: 16.7 MB - Last synced: 4 months ago - Pushed: about 2 years ago - Stars: 339 - Forks: 40
EleutherAI/pd-books
Language: Jupyter Notebook - Size: 9.43 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1
EleutherAI/polyglot-data
data related codebase for polyglot project
Language: Python - Size: 2.43 MB - Last synced: 25 days ago - Pushed: about 1 year ago - Stars: 19 - Forks: 10
EleutherAI/minetest Fork of minetest/minetest
Minetest is an open source voxel game engine with easy modding and game creation
Language: C++ - Size: 91.7 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 53 - Forks: 10
EleutherAI/TransformerEngine Fork of NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language: Python - Size: 2.52 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
EleutherAI/common-llm-settings
Common LLM Settings App
Language: JavaScript - Size: 315 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
EleutherAI/tuned-lens Fork of AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
Size: 1.62 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
EleutherAI/tinydpo Fork of cat-state/tinypar
Size: 292 KB - Last synced: 29 days ago - Pushed: 11 months ago - Stars: 2 - Forks: 0
EleutherAI/tagged-pile
Part-of-Speech Tagging for the Pile and RedPajama
Language: Python - Size: 6.84 KB - Last synced: about 1 month ago - Pushed: 12 months ago - Stars: 9 - Forks: 2
EleutherAI/classifier-latent-diffusion
Language: Python - Size: 8.79 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 1
EleutherAI/irrlicht Fork of minetest/irrlicht
Minetest's fork of Irrlicht
Language: C++ - Size: 18 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1
EleutherAI/llemma-sample-explorer
Sample explorer tool for the Llemma models.
Language: HTML - Size: 731 KB - Last synced: 29 days ago - Pushed: 7 months ago - Stars: 5 - Forks: 0
EleutherAI/maxtext Fork of google/maxtext
A simple, performant and scalable Jax LLM!
Size: 262 KB - Last synced: 29 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 2
EleutherAI/prefix-free-tokenizer
A prefix free tokenizer
Language: Python - Size: 6.84 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0
EleutherAI/truncated-gaussian
Method-of-moments estimation and sampling for truncated multivariate Gaussian distributions
Language: Python - Size: 7.81 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
EleutherAI/mup Fork of microsoft/mup
maximal update parametrization (ยตP)
Size: 16.5 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
EleutherAI/latent-video-diffusion
Latent video diffusion
Language: Python - Size: 39.1 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 3 - Forks: 2
EleutherAI/mdl
Minimum Description Length probing for neural network representations
Language: Python - Size: 164 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 11 - Forks: 1
EleutherAI/EvilModel
A replication of "EvilModel 2.0: Bringing Neural Network Models into Malware Attacks"
Size: 6.84 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
EleutherAI/dps
Data processing system for polyglot
Language: Python - Size: 7.67 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 62 - Forks: 20
EleutherAI/hae-rae
Size: 1.52 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 26 - Forks: 4
EleutherAI/pile-literotica
Download, parse, and filter data from Literotica. Data-ready for The-Pile.
Language: Python - Size: 3.91 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 2
EleutherAI/minetest-baselines
Baseline agents for Minetest tasks.
Language: Python - Size: 79.1 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 6 - Forks: 1
EleutherAI/minetest-interpretabilty-notebook
Jupyter notebook for the interpretablity section of the minetester blog post
Language: Jupyter Notebook - Size: 18.1 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
EleutherAI/poll_website_demo
Flask Based Polling Website Demo
Language: Python - Size: 798 KB - Last synced: almost 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
EleutherAI/eleutherai-instruct-dataset
A large instruct dataset for open-source models (WIP).
Size: 58 MB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
EleutherAI/pile_dedupe
Pile Deduplication Code
Language: Python - Size: 16.6 KB - Last synced: almost 1 year ago - Pushed: about 1 year ago - Stars: 3 - Forks: 0
EleutherAI/composer Fork of mosaicml/composer
Train neural networks up to 7x faster
Language: Python - Size: 8.04 MB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 3 - Forks: 2
EleutherAI/lm_perplexity
Language: Python - Size: 536 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 90 - Forks: 14
EleutherAI/exploring-contrastive-topology
Language: Jupyter Notebook - Size: 58 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 16 - Forks: 3
EleutherAI/magiCARP
One stop shop for all things carp
Language: Python - Size: 31.4 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 53 - Forks: 11
EleutherAI/multimodal-fid
Language: Python - Size: 67.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0
EleutherAI/t-zero Fork of bigscience-workshop/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Size: 158 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
EleutherAI/NeMo Fork of NVIDIA/NeMo
NeMo: a toolkit for conversational AI
Language: Python - Size: 124 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 10 - Forks: 2
EleutherAI/pile-website Fork of rajpurkar/SQuAD-explorer
Language: HTML - Size: 80.9 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 3
EleutherAI/isaac-mchorse
EleutherAI's discord bot
Language: Python - Size: 208 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 0
EleutherAI/github-downloader Fork of noanabeshima/github-downloader
Script for downloading GitHub.
Language: Python - Size: 4.78 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 46 - Forks: 22
EleutherAI/pile-cc Fork of leogao2/commoncrawl_downloader
Size: 32.2 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 11 - Forks: 1
EleutherAI/lm_dataformat Fork of leogao2/lm_dataformat
Language: Python - Size: 80.1 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 3
EleutherAI/pilev2
Language: Python - Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 11 - Forks: 9
EleutherAI/megatron-3d ๐ฆ
Language: Python - Size: 521 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 3
EleutherAI/equivariance
A framework for implementing equivariant DL
Language: Jupyter Notebook - Size: 1.08 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 10 - Forks: 3
EleutherAI/pile-pubmedcentral
A script for collecting the PubMed Central dataset in a language modelling friendly format.
Language: Python - Size: 18.7 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 7 - Forks: 1
EleutherAI/pile-explorer
For exploring the data and documenting its limitations
Language: Python - Size: 39.1 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 4 - Forks: 3
EleutherAI/pile-allpoetry
Scraper to gather poems from allpoetry.com
Language: Python - Size: 41 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 1
EleutherAI/datasets Fork of huggingface/datasets
๐ค The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Size: 41.8 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 9 - Forks: 3