An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: code-understanding

unoplat/unoplat-code-confluence

Maintain a live, pluggable context layer per repo that renders and updates Agents.md

Language: Python - Size: 55 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 71 - Forks: 6

ioncakephper/repo-description

An AI-powered CLI tool that automatically generates clear, natural-language descriptions for every file within a given repository, enhancing code understanding and documentation.

Language: JavaScript - Size: 132 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

akashs101199/ask-my-code-gen-ai

Conversational AI code assistant powered by Mistral & RAG. Explore codebases through natural languageβ€”ask questions, find functions, understand logic, and generate documentation. Uses vector embeddings for semantic search. Runs locally with Ollama for complete privacy. Zero API costs, your code never leaves your machine.

Language: Python - Size: 5.86 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

QuantLet/Encode-the-Qode

Towards Code Summarization for Scientific Domain Experts on Scarce Data (Code accompanying the research paper)

Language: Jupyter Notebook - Size: 123 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

Alireza29675/yar-agent

Yar β€” Understand Complex Codebases Fast ⚑ πŸš€

Language: TypeScript - Size: 9.52 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

SecurityLab-UCD/TF-Bench

[NeurIPS'25] TF-Bench: Evaluating Program Semantics Reasoning with Type Inference in System F

Language: Python - Size: 1.23 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

FSoft-AI4Code/HyperAgent

Generalist Software Agents to Solve Soware Engineering Tasks

Language: Python - Size: 42.3 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 220 - Forks: 20

Kodezi/Chronos

Kodezi Chronos Debugging-first language model achieving 65.3% autonomous bug fixing (6-7x better than GPT-4). Research, benchmarks & evaluation framework. Model available Q1 2026 via Kodezi OS.

Language: Java - Size: 17.2 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 2,447 - Forks: 28

CRJFisher/code-charter

Visual summaries for code repositories

Language: TypeScript - Size: 3.08 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 0

salesforce/CodeTF πŸ“¦

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Language: Python - Size: 10.7 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1,479 - Forks: 99

evdcush/fart

you cannot fart without art

Language: Python - Size: 85.9 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 14 - Forks: 1

salesforce/CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

Language: Python - Size: 10.7 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 2,999 - Forks: 458

wjn1996/HugNLP

HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab

Language: Python - Size: 3.71 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 247 - Forks: 13

boredom1234/CodeCraft

A powerful CLI tool using vector embeddings and LLMs to help developers understand codebases through natural language. Ask questions in plain English, get context-aware responses, analyze GitHub repos, and generate documentation. Your AI coding companion for quick codebase exploration.

Language: Python - Size: 1.09 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

J-McNamara/smoosh

Snapshot an entire repo or directory as plaintext on the clipboard and paste to your favorite AI tool!

Language: Python - Size: 72.3 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

wala/graph4code

GraphGen4Code: a toolkit for creating code knowledge graphs based on WALA code analysis and extraction of documentation and forum content.

Language: Jupyter Notebook - Size: 175 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 233 - Forks: 35

williamfzc/srctag

Tag source files with real-world stories.

Language: Python - Size: 197 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

GNOEYHEAT/CodeSim_cpp

μ½”λ“œ μœ μ‚¬μ„± νŒλ‹¨ μ‹œμ¦Œ2 AI κ²½μ§„λŒ€νšŒ, DACON (2024.03.04 ~ 2024.04.01)

Language: Python - Size: 618 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

RepoMining/RepoSim4Py

A project for determining the similarity of python repositories based on embedding approach

Language: Jupyter Notebook - Size: 95.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

HugAILab/HugNLP

CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊

Language: Python - Size: 4.16 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 370 - Forks: 45

Jaso1024/Semantic-Code-Embeddings

IEEE 2023 | SCALE: Semantic Code Analysis via Learned Embeddings

Language: Python - Size: 24.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

RepoAnalysis/RepoSnipy

Neural search engine for discovering semantically similar Python repositories on GitHub

Language: Python - Size: 56.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

RepoAnalysis/RepoSim

This repository contains experiments on comparing the similarity of Python repositories using ML models.

Language: Jupyter Notebook - Size: 7.67 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Keywords
code-understanding 23 transformers 6 language-model 5 llm 5 code-analysis 5 natural-language-processing 4 codebert 3 code-generation 3 developer-tools 3 ai 3 large-language-models 3 code 3 prompt-based-learning 2 pytorch 2 few-shot-learning 2 deep-learning 2 benchmark 2 code-summarization 2 code-intelligence 2 cli 2 python 2 code-documentation 1 ast-parsing 1 ai-code-assistant 1 supervised-learning 1 semi-supervised-learning 1 neural-network 1 pre-trained-language-models 1 knowledge-enhancement 1 github-repository-search 1 neural-search-engine 1 streamlit-application 1 toilet 1 text-box 1 style 1 smells-good 1 readability 1 productivity 1 font 1 figlet-fonts 1 semantic-search 1 figlet 1 fart 1 cultured 1 banner 1 auteur 1 ascii-art 1 ascii 1 type-inference 1 semantic-analysis 1 semantic-similarity 1 git 1 knowledge-graph 1 summari 1 hugchat 1 pip-package 1 llms 1 developer 1 code-analyzer 1 chatbot 1 vector-embeddings 1 information-extraction 1 together-ai 1 tech-onboarding 1 semantic-code-search 1 instruction-tuning 1 relevance-scoring 1 python-cli-tools 1 natural-language-code-analysis 1 code-similarity 1 monorepo-management 1 llm-applications 1 legacy-code-analysis 1 github-integration 1 knowledge-representation 1 faiss-vector-search 1 developer-productivity 1 codebase-navigation 1 code-exploration 1 system-f 1 program-semantics 1 haskell 1 yar 1 claude-agent-sdk 1 agent 1 quantitative-methods 1 ml 1 finance 1 code-analysis-true 1 vector 1 sematic-search 1 rag 1 open-source 1 ollama 1 mistral 1 langchain 1 genai 1 embeddings 1 code-assistant 1 chromadb 1