An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ai4code

microsoft/multilspy

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

Language: Python - Size: 265 KB - Last synced at: 2 days ago - Pushed at: 27 days ago - Stars: 327 - Forks: 58

microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

Language: Python - Size: 6.18 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 258 - Forks: 32

deep-symbolic-mathematics/llm-srbench

[ICML2025 Spotlight] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Language: Python - Size: 1.32 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 18 - Forks: 1

deep-symbolic-mathematics/LLM-SR

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation Discovery and Symbolic Regression with Large Language Models

Language: Python - Size: 8.82 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 108 - Forks: 13

salesforce/CodeTF 📦

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Language: Python - Size: 10.7 MB - Last synced at: 3 days ago - Pushed at: 16 days ago - Stars: 1,477 - Forks: 99

ise-uiuc/magicoder

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

Language: Python - Size: 2.4 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 2,012 - Forks: 165

replit/ReplitLM 📦

Inference code and configs for the ReplitLM model family

Language: Python - Size: 460 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 970 - Forks: 93

FSoft-AI4Code/TheVault

[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation

Language: Jupyter Notebook - Size: 9.44 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 92 - Forks: 9

saltudelft/ml4se

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

Size: 573 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 708 - Forks: 93

FSoft-AI4Code/CodeCapybara

Open-source Self-Instruction Tuning Code LLM

Language: Python - Size: 922 KB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 170 - Forks: 11

JY0284/code_completion_as_human_action_prediction

This repository contains the core methods and models described in the paper “Represent Code as Action Sequence for Predicting Next Method Call.” It uses action sequence modeling to predict method calls in Python code based on developer intentions, treating code editing as a sequence of human-like actions.

Language: Python - Size: 4.3 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 14 - Forks: 0

FSoft-AI4Code/CodeFlow

Predicting Program Behavior with Dynamic Dependencies Learning

Language: Python - Size: 2.49 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 22 - Forks: 0

wyt2000/InverseCoder

The official code of the paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct"(https://arxiv.org/abs/2407.05700).

Language: Python - Size: 464 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

GhabiX/SRepair

âś…SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug

Language: Python - Size: 2.19 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 25 - Forks: 0

Alex-Mathai-98/Monolith-to-Microservices

This paper explores the idea of using heterogeneous graph neural networks (Het-GNN) to partition old legacy monoliths into candidate microservices. We additionally take membership constraints that come from a subject matter expert who has deep domain knowledge of the application.

Language: Python - Size: 261 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

ALFA-group/adversarial-code-generation

[ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, and Una-May O'Reilly

Language: Python - Size: 16.2 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 4

ALFA-group/CLAW-SAT Fork of OPTML-Group/CLAW-SAT

[SANER 2023] "CLAWSAT: Towards Both Robust and Accurate Code Models" by Jinghan Jia*, Shashank Srikant*, Tamara Mitrovska, Chuang Gan, Shiyu Chang, Sijia Liu, Una-May O'Reilly

Size: 27.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0