Topic: "data-for-robots"
david-smejkal/wiki2txt
A tool to extract plain (unformatted) multilingual / language-agnostic text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI Training / Machine Learning software.
Language: Python - Size: 215 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 6 - Forks: 1

Related Topics
ai-learning
1
ai-learning-tool
1
ai-training
1
data-parser-for-ai
1
machine-learning
1
machine-learning-tool
1
plaintext-data-for-ai
1
tool-for-ai
1
training-data
1
wiki-parser
1
wiki-to-plaintext
1
wiki-to-text
1
wiki-to-txt
1
wiki2plaintext
1
wikidump-parser
1
wikidump-to-plaintext
1
wikidump-to-txt
1
wikidumps-parser
1
wikipedia-to-txt
1