GitHub topics: htmlparser
lokesh144/HTMLer
A Minimal HTML Parser and Renderer written in CPP.
Language: C - Size: 9.29 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

Erkmik/best-python-html-parsers
The top Python HTML parsers for web scraping, including Beautiful Soup, lxml, PyQuery, Scrapy, and more.
Size: 6.84 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

cheeriojs/cheerio
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Language: TypeScript - Size: 17.8 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 29,336 - Forks: 1,669

KiraLT/isomorphic-htmlparser
HTML parser that works both in JavaScript and NodeJS with TypeScript support
Language: TypeScript - Size: 180 KB - Last synced at: 7 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 1

chatnoir-eu/chatnoir-resiliparse
A robust web archive analytics toolkit
Language: Cython - Size: 1.86 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 101 - Forks: 15

ajordan2984/HtmlToText
A compact library written in C# to parse out all the text from news articles.
Language: C# - Size: 8.79 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

shuerguo999/gogoAST
The simplest tool to parse/transform/generate code on ast
Language: JavaScript - Size: 473 KB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 1

willforde/python-htmlement Fork of marmelo/python-htmlparser
Pure-Python HTML parser with ElementTree XPath support.
Language: Python - Size: 908 KB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 4

okwilkins/Web-Crawler
This program will crawl through entire domains, exporting every link it can find into a txt file.
Language: Python - Size: 232 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

liulinboyi/HTMLParser
HTMLParser 解析HTML 欢迎参考 HTMLParser Parsing HTML Welcome to the reference
Language: TypeScript - Size: 823 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 14 - Forks: 1

sukhcha-in/dart_web_scraper
Config-based, reusable web scraper for web and API scraping. Scrape multiple pages or APIs without writing parsers or scraping logic, using simple configurations for efficient scraping.
Language: Dart - Size: 86.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 7 - Forks: 1

LoboEvolution/CobraEvolution
CobraEvolution HTML render and parser
Language: Java - Size: 3.85 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 3

Rct567/DomQuery
PHP library for easy 'jQuery like' DOM traversing and manipulation.
Language: PHP - Size: 297 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 134 - Forks: 40

umijs/niddle
A super fast nodejs addon for html parsing and manipulation written in rust.
Language: JavaScript - Size: 2.73 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

dipiro/HackerNewsHeadlines
iOS App for Hacker News
Language: Swift - Size: 375 KB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

CorlyDream/html2obj
纯 js 解析 html,html 转 对象,对象转html,抽取文本
Language: HTML - Size: 647 KB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

Aghajari/JSSoup
JSSoup: the JavaScript HTML DOM parser for node.js
Language: JavaScript - Size: 77.1 KB - Last synced at: about 15 hours ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 2

EIGHTFINITE/cheerio
📦 Cheerio drop in replacement. Always mirrors the latest version. Patched to use the Lodash isArrayLike function instead of the flawed implementation in the Cheerio source. — `npm install cheerio@github:EIGHTFINITE/cheerio#main` — https://github.com/cheeriojs/cheerio
Language: TypeScript - Size: 261 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

marc-ed-raffalli/confluence-to-jekyll
Confluence (HTML) to Jekyll (Markdown) converter script in JS to facilitate IBM Loopback documentation migration
Language: JavaScript - Size: 13.7 KB - Last synced at: 20 days ago - Pushed at: over 8 years ago - Stars: 10 - Forks: 2

ayush-129/BLOG-PLATFORM
A user friendly blogging platform where user can read others' posts and post/edit/delete own posts.
Language: JavaScript - Size: 140 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

wopehq/muninn
Muninn is a fast and flexible HTML parsing tool that simplifies the process of extracting data from HTMLs.
Language: TypeScript - Size: 946 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 127 - Forks: 4

shobhit45/Blog-Sphere
Blog Sphere is your ultimate destination for insightful and engaging thoughts to share
Language: JavaScript - Size: 68.4 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

anarmhr/autodidacts
Software Engineering course.
Language: Python - Size: 257 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

timolinn/html
[WIP] HTML Parser written in Go
Language: Go - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

rearc-data/who-covid-19-cases-deaths
Coronavirus (COVID-19) Cases and Deaths | World Health Organization (WHO)
Language: Shell - Size: 39.1 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

rearc-data/nyt-states-reopen-status-covid-19
COVID-19 United States Reopen and Shut Down Status by State | NY Times
Language: Python - Size: 89.8 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

rearc-data/google-covid-19-community-mobility-reports
Google COVID-19 Community Mobility Reports
Language: Python - Size: 47.9 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

rearc-data/covid-19-chicago
Chicago COVID-19 Update Data | City of Chicago
Language: Shell - Size: 56.3 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

lost22git/apk_dl
Language: Nim - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

313801120/xiyueta
xiyueta.js库可以快速解析html字符串,遍历网页dom结构的JavaScript库。它通过与jQuery语法使用一致的 API 使 html文档遍历和处理更加简单。xiyueta.js库是先解析网页html文本再遍历html网页dom,xiyueta.js库可以在WEB浏览器里使用,也可以在ASP程序里使用,也可以在nodejs里使用
Language: HTML - Size: 2.25 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 2

shuxiaoyuan/SanQiuBooksSpider
练习使用 Python 内置的 urllib 和 HTMLParser 库爬取三秋书屋电子书的百度网盘链接和提取码
Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Anikeshpatel/dompy
JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python
Language: HTML - Size: 470 KB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

witjem/feedpls
Generate RSS feeds from websites
Language: Go - Size: 79.1 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 2

svenhsia/html2tree
A python script to build a tree structure from a raw HTML text.
Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

genius257/AutoIt-HTML-Parser
Yet another HTML Parser written in AutoIt
Language: AutoIt - Size: 24.4 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 2

rscottlundgren/csci-e10b_Term-Project
My Term Project submission for CSCI-E10b - a desktop, online "Job Description" parser for descriptions with a `boards.greenhouse.io` host.
Language: Java - Size: 529 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

BunyaminKiremit/android_haber_app
Haberler mobil uygulaması
Language: Kotlin - Size: 320 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

orhanucr/NewsApp
Orhan Uçar Ödev 4
Language: Kotlin - Size: 104 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

FlightPriceAnalysis/Flight-Price-Analysis
A java-based Web Crawler application for data extraction from flight booking website and recommending the best flight to users by giving 90% better results.
Language: HTML - Size: 77.1 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Soumya117/finnazurenotebook
Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

BryceRussell/HTMLRx
Select and manipulate elements in a HTML string without an AST
Language: TypeScript - Size: 156 KB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

min050410/Python_crawling
파이썬 크롤링 실습
Language: Python - Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

wenqi73/mini-html-parser
Language: JavaScript - Size: 6.84 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

Nelson-Gon/pycite
Python Citations Generator
Language: Python - Size: 151 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 3

gigaturbo/ici-grenoble-caldav
Script pour intégrer les événements de www.ici-grenoble.org dans un calendrier CalDAV
Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mesutde/Country.List.Interest.Rate.Net6.API
interest rate country list Web Api
Language: C# - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

DanArmor/CF-parser-cross-platform
CodeForces parser, that works on Mac/Linux/Windows. Was created with Python
Language: Python - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

itsRajatkumar/PythonWebScraper
Language: Python - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Agenty/Agenty.TestData
This project contains the publc test data set to try and learn how to use cloud-based agents in Agenty.
Language: HTML - Size: 6.71 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

siddharth17196/Personalized-news-feed
Random stuff ++
Language: Python - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

chuksoo/bs4-web-scraper
This repository contains code on how to build a job website scraper with request and Beautiful Soup.
Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

yuriy-logosha/myutils
Utils for python based services.
Language: Python - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

goyaldhara/news_scrapping
A script written in python to get latest news headlines from CNBC
Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

LeeCenY/LipstickHTMLParser
使用 Ono 开源库解析 HTML 页面获取数据
Language: Objective-C - Size: 17.6 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

SagarGaniga/Scrape-Websites-with-Python
Well commented Python code to scrape websites with the help of Beautiful Soup to parse HTML
Language: Python - Size: 14.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

moovs/scrappy
Parsing html data and their further organize and analyze for better perception and understanding.
Language: Python - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

goodwinfame/portal-plus
Read html attributes to generates edit form
Language: JavaScript - Size: 283 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

dhruvp-8/Word-Detection
A Web Application which on input of sentence gives the info of POS Tagger of the different words
Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0
