GitHub / bigcode-project 26 Repositories
BigCode Project is an open scientific collaboration run by Hugging Face and ServiceNow Research, focused on open and responsible development of LLMs for code.
bigcode-project/bigcode-dataset
Language: Jupyter Notebook - Size: 3.8 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 473 - Forks: 79

bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
Language: Python - Size: 67.4 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 7,449 - Forks: 527

bigcode-project/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language: Python - Size: 872 KB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 978 - Forks: 251

bigcode-project/starcoder2
Home of StarCoder2!
Language: Python - Size: 44.9 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 1,967 - Forks: 186

bigcode-project/selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
Language: Python - Size: 254 KB - Last synced at: 24 days ago - Pushed at: 7 months ago - Stars: 313 - Forks: 23

bigcode-project/Megatron-LM Fork of NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language: Python - Size: 6.37 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 393 - Forks: 52

bigcode-project/bigcodebench
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
Language: Python - Size: 6.52 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 416 - Forks: 52

bigcode-project/octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: 24 days ago - Pushed at: 8 months ago - Stars: 472 - Forks: 27

bigcode-project/bigcode-analysis
Repository for analysis and experiments in the BigCode project.
Language: Jupyter Notebook - Size: 17.9 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 124 - Forks: 21

bigcode-project/the-stack-v2
Code for the curation of The Stack v2 and StarCoder2 training data
Language: Jupyter Notebook - Size: 189 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 9

bigcode-project/starcoder.cpp
C++ implementation for 💫StarCoder
Language: C - Size: 7.12 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 456 - Forks: 39

bigcode-project/astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
Language: Jupyter Notebook - Size: 68.4 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 62 - Forks: 2

bigcode-project/pii-lib
Code for PII detection and redaction in code datasets
Language: Python - Size: 2.26 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 3

bigcode-project/bigcodebench-annotation
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Language: Jupyter Notebook - Size: 90.5 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 11

bigcode-project/jupytercoder
Language: JavaScript - Size: 771 KB - Last synced at: 24 days ago - Pushed at: almost 2 years ago - Stars: 141 - Forks: 14

bigcode-project/bigcode-website
Source of the website of the BigCode project.
Language: HTML - Size: 4.44 MB - Last synced at: 24 days ago - Pushed at: 25 days ago - Stars: 21 - Forks: 4

bigcode-project/bigcode-inference-benchmark
Language: Python - Size: 4.62 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 19 - Forks: 4

bigcode-project/opt-out-v2
Repository for opt-out requests.
Size: 6.84 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 2

bigcode-project/transformers
Language: Python - Size: 141 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 26 - Forks: 8

bigcode-project/bigcode-tokenizer
Language: Jupyter Notebook - Size: 128 KB - Last synced at: 24 days ago - Pushed at: almost 2 years ago - Stars: 15 - Forks: 3

bigcode-project/bigcode-encoder
Language: Python - Size: 2.63 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 3

bigcode-project/bigcode-data-mix
Language: Python - Size: 16.6 KB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

bigcode-project/bigcode-notebooks
Language: Jupyter Notebook - Size: 3.09 MB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

bigcode-project/text-generation-inference Fork of huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language: Python - Size: 723 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

bigcode-project/Megatron-LM-deprecated
Language: Python - Size: 3.32 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

bigcode-project/admin
A place for generic issues and administrative things.
Size: 9.77 KB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 0

bigcode-project/search
Language: Python - Size: 11.7 KB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bigcode-project/bigcode-demo
A place to build and share model demos
Size: 3.91 KB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
