GitHub topics: wikiextractor
JunKuang-algo/Agriculture-KnowledgeGraph-Data
对知识库Wikidata的爬虫以及数据处理脚本 将三元组关系对齐到语料库的脚本 获取知识图谱数据的脚本
Language: JavaScript - Size: 137 MB - Last synced at: 18 days ago - Pushed at: almost 4 years ago - Stars: 272 - Forks: 115

shyamupa/wikidump_preprocessing
Extracting useful metadata from Wikipedia dumps in any language.
Language: Python - Size: 82 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 25 - Forks: 5

TomerAberbach/wikipedia-ngrams
📚 A Kotlin project which extracts ngram counts from Wikipedia data dumps.
Language: Kotlin - Size: 13.7 KB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

studerw/wiki-dump-parser
Java tool to Wikimedia dumps into Java Article pojos for test or fake data.
Language: Java - Size: 1.14 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0
