An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: code-similarity

jplag/JPlag

State-of-the-Art Source Code Plagiarism & Collusion Detection. Check for plagiarism in a set of programs.

Language: Java - Size: 63.3 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1,576 - Forks: 345

dodona-edu/dolos

:detective: Source code plagiarism detection

Language: TypeScript - Size: 43.1 MB - Last synced at: 2 days ago - Pushed at: 7 days ago - Stars: 291 - Forks: 42

jorge-martinez-gil/graphcodebert-interpretability

Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks

Language: Python - Size: 9.71 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

OSLL/code-plagiarism

Program for finding plagiarism in the source code written in Python 3, C, and C++ based on comparing AST metadata.

Language: Python - Size: 946 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 11 - Forks: 3

BojanStipic/resava

Plagiarism detection for source code

Language: Rust - Size: 225 KB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 6 - Forks: 2

danielplohmann/mcrit

The MinHash-based Code Relationship & Investigation Toolkit (MCRIT) is a framework created to simplify the application of the MinHash algorithm in the context of code similarity.

Language: Python - Size: 1.01 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 91 - Forks: 13

jorge-martinez-gil/ensemble-codesim

Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures

Language: Java - Size: 38.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

fyrestone/pycode_similar

A simple plagiarism detection tool for python code

Language: Python - Size: 34.2 KB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 185 - Forks: 38

patois/HexraysToolbox

Hexrays Toolbox - Find code patterns within the Hexrays ctree

Language: Python - Size: 247 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 455 - Forks: 44

JackHCC/Awesome-Binary-Code-Similarity-Detection-2021

Awesome list for Binary Code Similarity Detection in 2021

Size: 1.95 KB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 0

nikita715/moss-api

API for Moss plagiarism analyzer

Language: Kotlin - Size: 76.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 2

jorge-martinez-gil/graphcodebert-feature-integration

Improving Source Code Similarity Detection with GraphCodeBERT and Additional Feature Integration

Language: Python - Size: 82 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

fanghon/antiplag

作业查重软件,它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.

Language: Java - Size: 52.6 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 379 - Forks: 61

eren23/semantic-code-searcher

Basic example for searching code semantically in github profiles. In python

Language: Python - Size: 44 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

StardustDL/codesim

A similarity measurer on two programming assignments on Online Judge.

Language: Python - Size: 32.2 KB - Last synced at: 30 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

kylin-zhou/binary-sim

binary similarity using Deep learning

Language: Python - Size: 86.9 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

BK-SCOSS/scoss

A Source Code Similarity System - SCOSS

Language: Python - Size: 30.9 MB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 1

jorge-martinez-gil/crosslingual-clone-detection

Transcending Language Barriers in Software Engineering with Crosslingual Code Clone Detection

Language: Java - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

taabishm2/CoSi

Code Similarity detection for Python files and Jupyter Notebooks

Language: Jupyter Notebook - Size: 36.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Jaso1024/Semantic-Code-Embeddings

IEEE 2023 | SCALE: Semantic Code Analysis via Learned Embeddings

Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

island255/source2binary_dataset_construction

This is the repository for the paper "One to One or One to many? What function inline brings to binary similarity analysis"

Language: Python - Size: 50.8 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 4

JackHCC/Pcode-Similarity

二进制代码相似性检测算法(Algorithm for calculating similarity between function and library function.)

Language: Java - Size: 23.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 0

nikita715/mossclient

Simple client for Moss plagiarism analyzer

Language: Kotlin - Size: 115 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

nikita715/gitplag

Plagiarism analyser for git educational repositories

Language: Kotlin - Size: 5.94 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

rwehresmann/moss_ruby

Ruby gem to submit files to MOSS.

Language: Ruby - Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

kariminf/trishalna

An anti-fraud program that I didn't finish. May be some day :p

Language: Java - Size: 61.5 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

Related Keywords
code-similarity 26 plagiarism-detection 14 moss 7 plagiarism 7 clone-detection 4 source-code-analysis 4 semantic-similarity 4 similarity-measures 4 plagiarism-checker 4 text-similarity 3 codebert 3 graphcodebert 3 education 3 reverse-engineering 2 binary-analysis 2 clone-coding 2 ast 2 neural-network 2 large-language-models 2 jplag 2 binary 2 learn-to-code 2 online-learning 2 plagiarism-checking 2 plagiarism-detector 2 plagiarism-prevention 2 software-plagiarism 2 academic-dishonesty 2 collusion-detection 2 code-duplication 1 assignment 1 search-algorithm 1 search 1 roberta 1 openai 1 java 1 accademic 1 llm 1 fraud-detection 1 phash 1 embeddings 1 cosine-similarity 1 cosine-distance 1 bert-embeddings 1 bert 1 cross-linguistic-data 1 code-understanding 1 knowledge-representation 1 scoss 1 gnn 1 algorithm 1 deep-learning 1 binary-similarity 1 ruby 1 nju-cs 1 nju 1 cpp 1 stanford 1 code-copying 1 sentence-transformers 1 sentence-embeddings 1 semantic 1 awesome 1 code-clones 1 disassembly 1 string-similarity 1 python 1 umap 1 pca-analysis 1 interpretability 1 code-analysis 1 code 1 hacktoberfest 1 fuzzy-matching 1 dodona 1 source-code-plagiarism 1 programming-education 1 program-analysis 1 plagiarism-check 1 cs-education 1 computer-science 1 vulnerability-scanner 1 variant-analysis 1 pattern-matching 1 loops 1 idapython-script 1 idapython 1 ida-pro 1 hexrays-toolbox 1 hexrays-decompiler 1 hexrays 1 hex-rays 1 decompiler 1 ctree 1 code-pattern-matching 1 code-comparison 1 bug-finding 1 semantic-similarity-measures 1 code-intelligence 1