Topic: "wiki-parser"
david-smejkal/wiki2txt
A tool to extract plain (unformatted) multilingual / language-agnostic text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI Training / Machine Learning software.
Language: Python - Size: 215 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 1

Epicfisher/touhoudex-parser
A Touhou Wiki parser that returns the "Touhou Puppet Play" Touhoudex with every Touhoumon and their Stats.
Language: Python - Size: 26.4 KB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Epicfisher/touhou-wiki-arrange-parser
A Touhou Wiki parser that returns a list of Touhou Arranges plus their Circles and Albums, including in HTML using GitHub Pages.
Language: Python - Size: 2.83 MB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

dmayilyan/kp_layout_analysis
Analysis of human typing and keyboard layout efficiency.
Language: Python - Size: 497 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1

thisisayushg/archean
Archean is a tool to extract information from Wikipedia dumps in a JSON format
Last synced at: 3 months ago - Stars: 0 - Forks: 1