Topic: "html-parser"
ariary/JSextractor
Fastly gather all JavaScript from url (CLi+TUI)
Language: Go - Size: 6.76 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 2

youhan26/dom-parser
dom parser using regex support browser, node, react-native and so on
Language: JavaScript - Size: 60.5 KB - Last synced at: 26 days ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 3

duncan3dc/domparser
Wrappers for the PHP DomDocument class to provide extra functionality for html/xml parsing
Language: PHP - Size: 209 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 4

danny1113/html-parser-builder
A result builder that build HTML parser and transform HTML elements to strongly-typed result, inspired by RegexBuilder.
Language: Swift - Size: 77.1 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 9 - Forks: 0

rnantes/swift-html-parser
Parse plaintext HTML into an object and easily search it to find elements
Language: HTML - Size: 874 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

imelgrat/feed-finder
A PHP class for extracting the URLs of RSS (1.0 and 2.0) and ATOM feeds associated to a page, as well as OPML outline documents.
Language: PHP - Size: 646 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 3

beliantech/boilertext
Extract content from HTML by removing unwanted boilerplate text.
Language: Go - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 9 - Forks: 1

algunion/HTMLForge.jl
Flexible HTML parsing and manipulation in Julia Programming Language
Language: Julia - Size: 52.7 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 8 - Forks: 2

sunshineplan/node
HTML parsing library, the alternative to BeautifulSoup in Golang.
Language: Go - Size: 85.9 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

creeperyang/html-parser-lite
A light weight html parser and more.
Language: JavaScript - Size: 107 KB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 3

divyeshmakwana96/Pochta
Testing MJML/HTML emails simplified. A serverless cli application solution for multiple integrations.
Language: JavaScript - Size: 776 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

willianantunes/pyfriends
Let's research over all the seasons of Friends sitcom and try to get some insights from it 🕵
Language: HTML - Size: 35.7 MB - Last synced at: 5 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 0

victornpb/benchmark-html-parser-libraries
A Benchmark of javascript libraries for parsing HTML (CPU/RAM)
Language: HTML - Size: 4.47 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1

genius257/AutoIt-HTML-Parser
Yet another HTML Parser written in AutoIt
Language: AutoIt - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 2

RohitAwate/DOMEngine
DOM manipulation engine written in C++.
Language: C++ - Size: 190 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 2

hean01/domx
HTML parser and DOM tree builder for rust
Language: Rust - Size: 14.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 6 - Forks: 0

TRowbotham/PHPDOM
A modern alternative to PHP's built in DOM classes.
Language: PHP - Size: 3.17 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 6 - Forks: 2

f34nk/elixir_html_tools
Overview of available html tools in Elixir
Language: HTML - Size: 4.36 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

imingyu/forgiving-xml-parser
An XML/HTML parser and serializer for JavaScript.
Language: TypeScript - Size: 16.3 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

akinemreyazici/read_news_app
This is a example project for HTLM Parser in Kotlin
Language: Kotlin - Size: 107 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

Bystroushaak/pyDHTMLParser 📦
Lightweight HTML/XML parser for quick and dirty web scraping.
Language: Python - Size: 252 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 3

Epicfisher/TouhouDiscordBot
A Work-In-Progress Discord bot based on the largely popular Touhou series by ZUN.
Language: Python - Size: 329 KB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

rsharifnasab/telegram_export_analyzer
this script can analyze number of telegram messages by time
Language: Python - Size: 17.6 KB - Last synced at: 17 days ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

raymccrae/swift-htmlsaxparser
Swift wrapper around libxml2 HTML Parser to provide SAX style HTML Parsing
Language: Swift - Size: 101 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 2

PopFlamingo/MyHTML
A Swift wrapper for MyHTML, a fast, pure-C, HTML 5 parsing library
Language: Swift - Size: 48.8 KB - Last synced at: 28 days ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 0

emmanuelroecker/php-simply-html
Add, delete, modify, get html tags, text, links by using css selector
Language: PHP - Size: 53.7 KB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 6

luxcem/apifier
Apifier is a very simple HTML parser written in Python based on CSS selectors
Language: HTML - Size: 76.2 KB - Last synced at: 10 months ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 1

esign-consulting/postdenuncia
Projeto de software para cidadãos denunciarem problemas urbanos.
Language: Java - Size: 3.18 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 5 - Forks: 0

Project-OtaWilma/WilmaAPI
Provides access to Wilma's public-api without need for an api-key. Mainly used for the functionality of OtaWilma web-client. Moderately stable, altough provides support for only one school. Not maintained to be production ready
Language: JavaScript - Size: 11 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

lrswss/akal2ical 📦
Perl-Skript um Abfuhrtermine des AfA Karlsruhe für den angegebenen Straßenzug abzurufen und als iCal-Datei zu speichern
Language: Perl - Size: 25.4 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

kevinhermawan/markup2json
A library for converting HTML and XML into JSON
Language: TypeScript - Size: 384 KB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1

ubbeg2000/pars
a simple package for parsing html files into dom trees
Language: Go - Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 2

iamlockelightning/BaiduProcess
Extract Baidu Baike Pages from HTML
Language: Java - Size: 44.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

ocramz/twelve
Like @11ty , but this goes up to 12
Language: Haskell - Size: 70.3 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1

jedmitten/humble_catalog
A script to parse the saved Humble Bundle library HTML
Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 2

vitkarpov/fast-xml-parser
🚀 Is a fast XML parser in TypeScript with zero dependencies
Language: TypeScript - Size: 103 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

askonomm/dompa
A zero-dependency HTML5 document parser.
Language: Rust - Size: 147 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

Solrikk/DataDigger
DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.
Language: Go - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

markuspoerschke/extractum
Extractum is a PHP library that extracts information from web pages.
Language: PHP - Size: 1.15 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 4 - Forks: 1

roblillack/gockl
Minimal XML processor for Go that does not to fuck with your markup.
Language: Go - Size: 33.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

nix1707/WebScrapper-BrowserExtension
Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.
Language: TypeScript - Size: 70.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

1623311678/html-parser-server
将html解析成json,服务端,前端都可以用
Language: JavaScript - Size: 24.4 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

tbjgolden/deno-htmlparser2
Deno port of `htmlparser2`
Language: TypeScript - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

cwsky0221/CWRichTextView
Android textview显示富文本,支持自定义html标签
Language: Java - Size: 126 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

talhashraf/parsel
Parsel is a Java library for parsing HTML and XML to extract data using XPath selector.
Language: Java - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 2

romagny13/html-parser
TypeScript/JavaScript HTML Parser
Language: TypeScript - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: over 8 years ago - Stars: 4 - Forks: 2

karambir/ugc-colleges
Python Script to extract college names from UGC, India website.
Language: Python - Size: 2.18 MB - Last synced at: 3 days ago - Pushed at: almost 13 years ago - Stars: 4 - Forks: 1

jgarber623/micromicro 📦
A Ruby gem for extracting microformats2-encoded data from HTML documents.
Language: Ruby - Size: 359 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 3 - Forks: 2

vborovikov/readability
A C# version of the readability lib
Language: C# - Size: 7.13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

hashim21223445/ai-chatbot Fork of vercel/ai-chatbot
A full-featured, hackable Next.js AI chatbot built by Vercel
Language: TypeScript - Size: 2.31 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

thomasthiebaud/htmlstring-to-react
Convert a string containing html tags to an array of React elements. Light and secure
Language: TypeScript - Size: 1.19 MB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 4

staticka/staticka
Yet-another static site generator in PHP.
Language: PHP - Size: 217 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

graetz23/xmlcc
an ANSI C++ XML library keeping SAX interface and XML / DOM tree
Language: C++ - Size: 418 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 2

rohanasan/rohanasantml
Rohanasantml an easy alternative to html!
Language: Rust - Size: 29.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

silenzzz/RuTracker4j
RuTracker java library
Language: Java - Size: 81.1 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

wellloy1/strops-js
Provides simple and the most useful methods to string operations in JavaScript / Node.js
Language: JavaScript - Size: 97.7 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

glebliutsko/PriceParser
Парсер цен с сайтов
Language: C# - Size: 39.1 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

coderosh/docpa
A simple library that I use for web scraping. Uses htmlparser2 to parse dom.
Language: TypeScript - Size: 69.3 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

AntoData/on_page_basic_SEO_checker
This project provides methods and utils to make basic checks in the SEO of an instance of a page using the URL of this page or a webdriver instance that is browsing that page at the moment
Language: Python - Size: 7.87 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

Anikeshpatel/dompy
JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python
Language: HTML - Size: 470 KB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

1bk/gdelt_headline_analysis
News Headlines Analysis of (two) Websites - Using GDELT 2.0 Event Database
Language: HTML - Size: 715 KB - Last synced at: 4 months ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 2

majdsalloum/mate
Maté Browser
Language: Java - Size: 3.72 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

betrixed/dlang-xml
Xml & Html parser for the D programming language (2019 - D2)
Language: D - Size: 15.2 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

xnerhu/HTML-Parser
:page_facing_up: A HTML parser prototype
Language: C# - Size: 82 KB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

luanmuniz/shorio
Dom Manipulation for Node.JS with an jQuery like API
Language: JavaScript - Size: 43.9 KB - Last synced at: 24 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

iamareebjamal/get_results
Python Script to download results of whole class/branch by providing attendance Excel file.
Language: Python - Size: 346 KB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 4

jiangxianli/SimpleHtml
一款类似于Jquery语法的HTML DOM解析PHP扩展包
Language: PHP - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 0

ghuls-apps/github-calendar-api
An HTML parser to get data about the GitHub Profile Contributions Calendar
Language: Ruby - Size: 16.6 KB - Last synced at: 1 day ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 1

omar2535/BioLife-AU-01-attendance-parser
Biolife-AU-01 打卡鐘解析程序
Language: Python - Size: 23.4 KB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

firas-codes1/spwder
A simple HTML form password bruteforcing tool written in python.
Language: Python - Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

zavzyatiy/drom_archive_parser
A full automatic pipeline for parsing data from auto.drom.ru/archive
Language: Jupyter Notebook - Size: 2.44 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

luminati-io/curl_cffi-web-scraping
Use curl_cffi in Python to mimic browser TLS fingerprints for reliable and stealthy web scraping.
Size: 1.17 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

KTBsomen/httl-s
html but templating language, hyper text templating language
Language: JavaScript - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

dinaraparanid/Star-Wars-Travel-KMP
Traveling App in Star Wars Universe
Language: Kotlin - Size: 27.3 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

jaijs/jai-body-parser
Simple and fast Node.js module for parsing Http request body. Part of Jai.js ecosystem. Built without any third part dependency.
Language: JavaScript - Size: 55.7 KB - Last synced at: 9 days ago - Pushed at: 11 months ago - Stars: 2 - Forks: 1

tleino/purehtml
Reusable purely HTML-only parser that can be used for developing crawlers, converters or simple clients for consuming HTML.
Language: C - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ShiftHackZ/Joy-Reactor-Android
Android client app for website joyreactor.cc
Language: Kotlin - Size: 423 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

cacing69/cquery
Cquery is an acronym for Crawl Query, its a PHP Scraper with language expression, could be used to scrape data from a website that uses javascript or ajax
Language: PHP - Size: 662 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 3

dinaraparanid/genius_track_number_in_album
Fetches track's number in album using its Genius URL
Language: Rust - Size: 26.4 KB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

dinaraparanid/genius_lyrics
Fetches lyrics of song from genius by its URL
Language: Rust - Size: 20.5 KB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

MichaelE919/ncaa-stats-webscraper
Python webscraping module for NCAA Basketball Stats
Language: Python - Size: 352 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 6

MatheusTGP/HTML-Collector-Tkinter
Converta uma página na Web para um arquivo em HTML, o software vem com uma interface gráfica básica produzida Totalmente em Python Tkinter.
Size: 18.6 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

yogendratamang48/parse_utils
Easy html/json parser for webscraping
Language: Python - Size: 22.5 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

jy0529/simple-html-parser
a simple html parser by Typescript
Language: TypeScript - Size: 195 KB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

bredbrains/tthk-app 📦
Tallinna Tööstushariduskeskus unofficial mobile application
Language: C# - Size: 2.04 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

enveezee/urearl
U R Earl is an abstraction of python standard libraries for extracting and returning stuff from URLs
Language: Python - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

bersoy12/UniversityStudentAnalysis
Data Mining and Analysis
Language: PHP - Size: 7.8 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

gurkankaymak/gosoup
Lightweight go library for pulling data out of HTML, inspired by BeautifulSoup and Jsoup
Language: Go - Size: 7.81 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

AntoData/WebScraperAllMusic
Simple example of a web scrapper using python. In this case, we ask the user using the console for the name of a band/artist and using selenium webdriver and beautifulsoup we print information about the discography of that artist/band
Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

timolinn/nginB
[WIP] This is a hobbyist browser engine written in Go
Language: Go - Size: 6.84 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

millerlogic/htmlstrip
Strips HTML from the input, outputs plain text, streamed in realtime without preloading the whole document
Language: Go - Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

linkonoid/objhtml
Library for generating and outputs render html-elements on native Golang with application of object model.
Language: Go - Size: 3.38 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

sentialx/html-parser
A HTML parser prototype written in C# and .NET Core
Language: C# - Size: 25.4 KB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 1

codexguy/xSkrape.APIWrapper.REST
Data extraction, identification, and shaping functionality. This repository is for the REST-based interfaces to xSkrape.com, which offers these parsing services in a hosted environment.
Language: C# - Size: 84 KB - Last synced at: 22 days ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 2

akiya64/AmazonScrapingExcelFile
Scraping Amazon Catalog in Excel Book
Language: C# - Size: 71.3 KB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 1

azazar/uncaring-html-parser
HTML parser that intend to be fast, but wasn't benchmarked or optimized yet
Language: Java - Size: 84 KB - Last synced at: 3 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

Couchtr26/Bookmark_Cleaner
Bookmark_Cleaner is a Python tool designed to automatically scan, organize, and clean your browser bookmarks. It detects and removes duplicates, dead links, and empty folders, making your bookmark collection tidy and efficient. Ideal for anyone who wants to maintain a well-organized set of online resources.
Language: JavaScript - Size: 3.91 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

vborovikov/brackets
Resilient markup parser library
Language: C# - Size: 1.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

flaviu22/domtree
C++ html parser
Language: C++ - Size: 6.68 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

DEmodoriGatsuO/minisoup-html-parser
A lightweight HTML parsing library inspired by Beautiful Soup, providing capabilities for HTML element analysis and extraction
Language: C# - Size: 69.3 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
