GitHub / giganticode / codeprep
A toolkit for pre-processing large source code corpora
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/giganticode%2Fcodeprep
Stars: 47
Forks: 11
Open issues: 8
License: None
Language: Python
Size: 1.56 MB
Dependencies parsed at: Pending
Created at: about 6 years ago
Updated at: 4 months ago
Pushed at: over 2 years ago
Last synced at: 15 days ago
Topics: language-modeling, mining-software-repositories, natural-language-processing, source-code-analysis, word-segmentation