An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: repository-mining

adamtornhill/code-maat

A command line tool to mine and analyze data from version-control systems

Language: Clojure - Size: 1.35 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 2,461 - Forks: 229

dreamyguy/gitlogg

💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'

Language: JavaScript - Size: 16.8 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 134 - Forks: 27

codingwithshawnyt/GitAnalyzer

Dive deep into the history of any Git repository. This Python framework provides comprehensive tools for analyzing source code changes, commit metadata, and developer contributions at a granular level.

Language: Python - Size: 31.6 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 5 - Forks: 2

VariantSync/DiffDetective

Library for Variability-Aware Differencing and the Analysis of Edits to Preprocessor-Based Software Product Lines

Language: Java - Size: 58.5 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 9 - Forks: 5

SAP-samples/security-research-commit-anomaly-detection

SAP Security Research sample code and data to reproduce the research done in our paper "Detecting Backdoors in Collaboration Graphs of Software Repositories".

Language: Python - Size: 157 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 9 - Forks: 0

Andrei-Constantin-Programmer/TDD_In_Practice

⛏️ Repository containing scripts and other resources for finding out how many projects use TDD in practice, based on Git history

Language: Python - Size: 1.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

XIMDEX/xdam-laravel 📦

Backend for Ximdex's Document Asset Management Service (XDAM)

Language: PHP - Size: 2.06 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

jalalmostafa/mining-afxdp

Mining Linux Kernel Repositories to extract knowledge about AF_XDP

Language: Python - Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

xserban/GraphRepo

Github repo to Neo4j (and back)

Language: Python - Size: 362 KB - Last synced at: 7 days ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 5

CESEL/RelationalGit

RelationalGit extracts valuable information about commits, blame, changes, devs, and pull requests out of git's data structure and imports them to a relational database such as Microsoft SQL Server. These data can be a ground for further source code mining analysis.

Language: C# - Size: 461 KB - Last synced at: 10 days ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 6

jrfaller/diggit

An agile tool to analyze Git repositories

Language: Ruby - Size: 239 KB - Last synced at: 18 days ago - Pushed at: almost 4 years ago - Stars: 20 - Forks: 5

philippedeb/susie

Susie checks GitHub repositories for sustainability and provides interesting knowledge for developers regarding sustainable software development.

Language: TypeScript - Size: 4.8 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 1

DeGraciaMathieu/php-git-insights-analyzer

Analyze the Git history of your PHP files

Language: PHP - Size: 17.9 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 0

EricTRL/todo-bot-repo-identifier

Identifies and clones repositories in which a given bot or user has created at least one issue.

Language: Python - Size: 17.6 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 2

yuzhimanhua/HiGitClass

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)

Language: Python - Size: 2.76 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 2

deeplook/reposheet

Language: Python - Size: 233 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

abilian/gitstats Fork of ErikBjare/gitstats

git history statistics generator

Language: Python - Size: 394 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

DouglasGiordano/extract-github-repository-v4

Extract certain data from github repositories using the v4 API offered by github itself.

Language: JavaScript - Size: 6.06 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

cirovitale/TRACE

The TRACE tool, developed as part of my Bachelor's thesis in Computer Science at the University of Salerno, is designed to analyze GitHub repositories, with the main purpose of calculating the Cultural Dispersion in a community of developers.

Language: Python - Size: 114 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ciselab/CPS_repo_mining

Repository mining for searching Performance Antipatterns in Cyber-Physical System Projects.

Language: Python - Size: 15.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

softvis-research/Getaviz

With Getaviz you can solve software engineering problems visually by exploring software artifacts them in 2D, 3D, and virtual reality.

Language: Java - Size: 267 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 49 - Forks: 49

jackwadden/GitHub-Clone-Scraper

Python script that scrapes GitHub repositories to keep track of total clone counts. This is useful for projects that are NSF funded, where "impact" (total downloads) is required to be reported.

Language: Python - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 3

correiajoao/jmt

JMT - A multi-language tool to collect code elements impacted by commits.

Language: Python - Size: 3.35 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

dreamyguy/gitinsight

📊 👁 🧘 Visualise your git log in new and exciting ways!

Language: JavaScript - Size: 26.2 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

DLR-SC/Hackathon

Results and exchange files of Repository Mining Hackathon.

Language: Jupyter Notebook - Size: 1.45 MB - Last synced at: 26 days ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 0

ekincanufuktepe/change-instepector-java

This project is on inspecting change types from commits (QRS2021)

Language: Java - Size: 2.95 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

XIMDEX/xsir_v0

Ximdex Semantic Information Repository

Language: PHP - Size: 89.8 KB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

mohayemin/param-diff

Mines Java Git repository and finds commits that have added a parameter to a method

Language: Java - Size: 464 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

xserban/GraphRepo-Demo

A demo Jupyter Notebook for the GraphRepo Project

Language: Jupyter Notebook - Size: 1.09 MB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

DLR-SC/Gitlab2Graph

A pipeline processor to extract data from Gitlab and transform into a graph representation

Language: Python - Size: 12.7 KB - Last synced at: 26 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

ayaankazerouni/incremental-testing

Repo mining and analysis code to measure adherence to incremental testing.

Language: Java - Size: 1.01 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

bdrupieski/GitOmega

analyze git repository activity

Language: C# - Size: 13.7 KB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Keywords
repository-mining 32 git 9 github-api 4 java 3 python 3 statistics 3 pydriller 2 software-development 2 software-engineering 2 neo4j 2 repository-utilities 2 github 2 repository 2 git-log 2 data-mining 2 technical-debt 2 clone 1 software-visualization 1 software-quality 1 software-analytics 1 program-comprehension 1 scraper 1 performance 1 cyber-physical-systems 1 antipatterns 1 libgit2sharp 1 antipattern 1 culture 1 cultural-engineering 1 cultural-dispersion 1 analyzer 1 extract-data 1 api-client 1 git-mining 1 scan-tool 1 jupyterlab 1 jupyter 1 binder 1 weakly-supervised-learning 1 text-classification 1 metadata 1 csharp 1 software-engineering-research 1 r 1 incremental-development 1 cs-education 1 graph-representation 1 gitlab 1 graphrepo 1 program-analysis 1 semantic-web 1 semantic-repository 1 repository-service 1 repository-management 1 metadata-management 1 jcr 1 graphql 1 graph-database 1 content-services 1 content-repository 1 cmis-server 1 cmis 1 parsing 1 parse-tree 1 change-type-detection 1 bug-fix-changes 1 visualisation 1 react 1 gitlogg 1 software-engineering-resear 1 abstract-syntax-tree 1 ximdex 1 microservice 1 json-api 1 indexing 1 digital-asset-management 1 backend 1 university-project 1 unit-testing 1 tdd 1 matplotlib 1 apache 1 sample-code 1 sample 1 reproducible-research 1 backdoor 1 anomaly-detection 1 variability-analysis 1 variability 1 software-product-lines 1 software-evolution 1 diffing 1 differencing 1 git-analysis 1 code-analysis 1 multiple-repositories 1 json-parser 1 json 1 code-analysis-tool 1 behavioral-code-analysis 1