Topic: "ml4code"
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Size: 573 KB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 708 - Forks: 93

ml4code/ml4code.github.io
Website for "A Survey of Machine Learning for Big Code and Naturalness"
Language: CSS - Size: 24.5 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 291 - Forks: 87

danielzuegner/code-transformer
Implementation of the paper "Language-agnostic representation learning of source code from structure and context".
Language: Python - Size: 2.65 MB - Last synced at: 8 months ago - Pushed at: about 3 years ago - Stars: 166 - Forks: 31

mast-group/convolutional-attention 📦
Repository for the code of the "A Convolutional Attention Network for Extreme Summarization of Source Code" paper
Language: HTML - Size: 1.59 MB - Last synced at: 3 months ago - Pushed at: almost 9 years ago - Stars: 119 - Forks: 31

JetBrains-Research/code2seq
PyTorch's implementation of the code2seq model.
Language: Python - Size: 6.14 MB - Last synced at: 19 days ago - Pushed at: 11 months ago - Stars: 62 - Forks: 18

JetBrains-Research/psiminer
A Tool for Mining Rich Abstract Syntax Trees from Code
Language: Kotlin - Size: 753 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 58 - Forks: 12

mast-group/api-mining 📦
Probabilistic API Mining
Language: Java - Size: 21.1 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 53 - Forks: 16

mast-group/tassal 📦
Tree-based Autofolding Software Summarization Algorithm
Language: Java - Size: 206 KB - Last synced at: 3 months ago - Pushed at: almost 9 years ago - Stars: 42 - Forks: 7

bentrevett/code2vec 📦
A PyTorch implementation of `code2vec: Learning Distributed Representations of Code` (Alon et al., 2018)
Language: Python - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 31 - Forks: 8

SunLab-GMU/GraphSPD
The official repository of "GraphSPD: Graph-Based Security Patch Detection with Enriched Code Semantics". The paper will appear in the IEEE Symposium on Security and Privacy (S&P), San Francisco, CA, May 22-26, 2023.
Language: Shell - Size: 90.4 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 27 - Forks: 6

JetBrains-Research/embeddings-for-trees
Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.
Language: Python - Size: 733 KB - Last synced at: 27 days ago - Pushed at: over 3 years ago - Stars: 22 - Forks: 4

ALFA-group/adversarial-code-generation
[ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, and Una-May O'Reilly
Language: Python - Size: 16.2 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 4

naturalness/sensibility
Fixes Java syntax errors with LSTM neural networks! [proof-of-concept]
Language: Python - Size: 1.39 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 5

tud-ccc/compy-learn
ComPy-Learn is a framework for exploring program representations for ML4CODE tasks.
Language: Python - Size: 326 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 10

bentrevett/extreme-summarization-of-source-code 📦
Implementation of 'A Convolutional Attention Network for Extreme Summarization of Source Code' in PyTorch using TorchText
Language: Python - Size: 42 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 13 - Forks: 7

EngineeringSoftware/time-segmented-evaluation
Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.
Language: Python - Size: 75.2 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 1

saltudelft/type4py-vscode-ext
VSCode Extension of Type4Py
Language: TypeScript - Size: 5.94 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 0

msintaha/BugClassificationWithGNN
A graph based bug classifier using the dgl library and DeepBugs dataset
Language: Python - Size: 188 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

hadary-ai/php-code2seq-extractor
Extracts code2seq compatible datasets from PHP source files.
Size: 2.93 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

eladn/NDFA
Neural Data-Flow Analysis: A tool for solving program-related tasks which involve data-flow analysis using deep neural networks
Language: Python - Size: 1.26 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ALFA-group/CLAW-SAT Fork of OPTML-Group/CLAW-SAT
[SANER 2023] "CLAWSAT: Towards Both Robust and Accurate Code Models" by Jinghan Jia*, Shashank Srikant*, Tamara Mitrovska, Chuang Gan, Shiyu Chang, Sijia Liu, Una-May O'Reilly
Size: 27.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

makhshari/BugClassificationWithGNN Fork of msintaha/BugClassificationWithGNN
A graph based bug classifier using the dgl library and DeepBugs dataset
Language: Python - Size: 188 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

maxkvant/bert-on-source-code
Language: Jupyter Notebook - Size: 333 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
