GitHub topics: wiki-parser
david-smejkal/wiki2txt
A tool to extract plain (unformatted) multilingual / language-agnostic text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI Training / Machine Learning software.
Language: Python - Size: 215 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

dmayilyan/kp_layout_analysis
Analysis of human typing and keyboard layout efficiency.
Language: Python - Size: 497 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1

Epicfisher/touhou-wiki-arrange-parser
A Touhou Wiki parser that returns a list of Touhou Arranges plus their Circles and Albums, including in HTML using GitHub Pages.
Language: Python - Size: 2.83 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

Epicfisher/touhoudex-parser
A Touhou Wiki parser that returns the "Touhou Puppet Play" Touhoudex with every Touhoumon and their Stats.
Language: Python - Size: 26.4 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0
