An open API service providing repository metadata for many open source software ecosystems.

Topic: "codebert"

neulab/code-bert-score

CodeBERTScore: an automatic metric for code generation, based on BERTScore

Language: Jupyter Notebook - Size: 24.6 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 188 - Forks: 16

dessertlab/EVIL

EVIL (Exploiting software VIa natural Language) is an approach to automatically generate software exploits in assembly/Python language from descriptions in natural language. The approach leverages Neural Machine Translation (NMT) techniques and a dataset that we developed for this work.

Language: Python - Size: 1.23 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 15 - Forks: 1

EhsanMashhadi/ISSRE2023-BugSeverityPrediction

Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.

Language: Java - Size: 14.1 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 2

RepoAnalysis/RepoSnipy

Neural search engine for discovering semantically similar Python repositories on GitHub

Language: Python - Size: 56.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 2

dessertlab/Targeted-Data-Poisoning-Attacks

This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks" accepted for publication at The 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024).

Language: Python - Size: 3.41 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 2

ML4SE2022/Group4

Fine-tuning CodeBERT with AST-based Vectors for Code Translation

Language: C# - Size: 11.4 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

jorge-martinez-gil/graphcodebert-interpretability

Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks

Language: Python - Size: 9.71 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

jorge-martinez-gil/ensemble-codesim

Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures

Language: Java - Size: 38.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

RepoMining/RepoSim4Py

A project for determining the similarity of python repositories based on embedding approach

Language: Jupyter Notebook - Size: 95.2 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 2

daimakram/Bug-Detection-Code-Summarization

Performs Code Summarization, Bug Detection, Bug Removal using different Natural language processing models including Garph CodeBERT, GREAT, GNN, CoText etc.

Language: Jupyter Notebook - Size: 95.7 KB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jorge-martinez-gil/graphcodebert-feature-integration

Improving Source Code Similarity Detection with GraphCodeBERT and Additional Feature Integration

Language: Python - Size: 82 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

philippnormann/malicious-payload-detection

🕵️‍♂️ ML project to identify malicious web payloads, aimed at boosting the effectiveness of WAFs and IDSs.

Language: Jupyter Notebook - Size: 3.91 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

sarvagyakrcs/s0.dev

The modern web development landscape is plagued by a peculiar paradox: despite the abundance of UI components and design systems, developers still spend countless hours reimplementing similar interfaces. S0 addresses this challenge by introducing a novel approach that combines advanced vector search capabilities.

Language: Python - Size: 2.08 MB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

aleksibovellan/ai-python-code-validator

AI/ML Trained Python Code Validator with Gradio Web Interface

Language: Python - Size: 62.5 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AndrewDarnall/The-Code-Unmasker

SpringBoot-based microserviced web app which unmasks, using CodeBERT MLM, a code prompt

Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Ahmedfir/java-business-locations

extracts business-logic code locations.

Language: Java - Size: 229 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

hishamp3/codeDetection

Django implementation of CodeBERT for detecting vulnerable code.

Language: Python - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RepoAnalysis/RepoSim

This repository contains experiments on comparing the similarity of Python repositories using ML models.

Language: Jupyter Notebook - Size: 7.67 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

GianRomani/Neural_search_engine

Neural search engine for questions/answers from StackOverflow

Language: Jupyter Notebook - Size: 389 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

yegmor/CoCLR-ML_Reproducibility_Challenge_2021 Fork of Jun-jie-Huang/CoCLR

Reproducibility report ofCoSQA: 20,000+ Web Queries for Code Search and QuestionAnswering for ML Reproducibility Challenge 2021

Language: Python - Size: 7.66 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Related Topics
language-model 4 bert 4 transformers 4 clone-detection 3 code-understanding 3 code-similarity 3 graphcodebert 3 code 3 semantic-similarity 3 similarity-measures 3 python 2 nmt 2 ast 2 machine-learning 2 dataset 2 source-code-analysis 2 code-analysis 1 interpretability 1 pca-analysis 1 umap 1 webinterface 1 spring-boot 1 microservices 1 masked-language-models 1 docker 1 vulnerabilities 1 software-security-assessment 1 data-poisoning-attacks 1 code-generation 1 semantic-analysis 1 naturalness 1 mutation-testing 1 business-logic-component 1 web-interface 1 validator 1 torch 1 python3 1 ml 1 microsoft 1 gradio-interface 1 gradio 1 datasets 1 ai 1 score 1 codebertscore 1 code-bertscore 1 code-bert-score 1 bertscore 1 similarity-search 1 retrival-augmented-generation 1 pg-vector 1 nextjs 1 multimodal-embeddings 1 fastapi 1 bun 1 semantic-similarity-measures 1 code-intelligence 1 code-clones 1 clone-coding 1 code-search 1 code-question-answering 1 coclr 1 software-exploitation 1 shellcode 1 seq2seq 1 linux 1 exploit 1 encoder 1 decoder 1 assembly 1 semantic-search 1 stackoverflow 1 search-engine 1 pytorch 1 neural-search 1 jina-search 1 jina 1 information-retrieval 1 huggingface-transformers 1 huggingface 1 data-mining 1 bert-embeddings 1 software-engineering 1 natural-language-processing 1 web-security 1 web-application-firewall 1 random-forest 1 payload-detection 1 intrusion-detection-system 1 feature-engineering 1 cybersecurity 1 streamlit-application 1 neural-search-engine 1 github-repository-search 1 llm-fine-tuning 1 large-language-models 1 html-css 1 django-framework 1 severity-prediction 1 largelanguagemodel 1