An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: htmlparser

lokesh144/HTMLer

A Minimal HTML Parser and Renderer written in CPP.

Language: C - Size: 9.29 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

Erkmik/best-python-html-parsers

The top Python HTML parsers for web scraping, including Beautiful Soup, lxml, PyQuery, Scrapy, and more.

Size: 6.84 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

cheeriojs/cheerio

The fast, flexible, and elegant library for parsing and manipulating HTML and XML.

Language: TypeScript - Size: 17.8 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 29,336 - Forks: 1,669

KiraLT/isomorphic-htmlparser

HTML parser that works both in JavaScript and NodeJS with TypeScript support

Language: TypeScript - Size: 180 KB - Last synced at: 7 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 1

chatnoir-eu/chatnoir-resiliparse

A robust web archive analytics toolkit

Language: Cython - Size: 1.86 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 101 - Forks: 15

ajordan2984/HtmlToText

A compact library written in C# to parse out all the text from news articles.

Language: C# - Size: 8.79 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

shuerguo999/gogoAST

The simplest tool to parse/transform/generate code on ast

Language: JavaScript - Size: 473 KB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 1

willforde/python-htmlement Fork of marmelo/python-htmlparser

Pure-Python HTML parser with ElementTree XPath support.

Language: Python - Size: 908 KB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 4

okwilkins/Web-Crawler

This program will crawl through entire domains, exporting every link it can find into a txt file.

Language: Python - Size: 232 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

liulinboyi/HTMLParser

HTMLParser 解析HTML 欢迎参考 HTMLParser Parsing HTML Welcome to the reference

Language: TypeScript - Size: 823 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 14 - Forks: 1

sukhcha-in/dart_web_scraper

Config-based, reusable web scraper for web and API scraping. Scrape multiple pages or APIs without writing parsers or scraping logic, using simple configurations for efficient scraping.

Language: Dart - Size: 86.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 7 - Forks: 1

LoboEvolution/CobraEvolution

CobraEvolution HTML render and parser

Language: Java - Size: 3.85 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 3

Rct567/DomQuery

PHP library for easy 'jQuery like' DOM traversing and manipulation.

Language: PHP - Size: 297 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 134 - Forks: 40

umijs/niddle

A super fast nodejs addon for html parsing and manipulation written in rust.

Language: JavaScript - Size: 2.73 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

dipiro/HackerNewsHeadlines

iOS App for Hacker News

Language: Swift - Size: 375 KB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

CorlyDream/html2obj

纯 js 解析 html,html 转 对象,对象转html,抽取文本

Language: HTML - Size: 647 KB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

Aghajari/JSSoup

JSSoup: the JavaScript HTML DOM parser for node.js

Language: JavaScript - Size: 77.1 KB - Last synced at: about 15 hours ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 2

EIGHTFINITE/cheerio

📦 Cheerio drop in replacement. Always mirrors the latest version. Patched to use the Lodash isArrayLike function instead of the flawed implementation in the Cheerio source. — `npm install cheerio@github:EIGHTFINITE/cheerio#main` — https://github.com/cheeriojs/cheerio

Language: TypeScript - Size: 261 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

marc-ed-raffalli/confluence-to-jekyll

Confluence (HTML) to Jekyll (Markdown) converter script in JS to facilitate IBM Loopback documentation migration

Language: JavaScript - Size: 13.7 KB - Last synced at: 20 days ago - Pushed at: over 8 years ago - Stars: 10 - Forks: 2

ayush-129/BLOG-PLATFORM

A user friendly blogging platform where user can read others' posts and post/edit/delete own posts.

Language: JavaScript - Size: 140 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

wopehq/muninn

Muninn is a fast and flexible HTML parsing tool that simplifies the process of extracting data from HTMLs.

Language: TypeScript - Size: 946 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 127 - Forks: 4

shobhit45/Blog-Sphere

Blog Sphere is your ultimate destination for insightful and engaging thoughts to share

Language: JavaScript - Size: 68.4 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

anarmhr/autodidacts

Software Engineering course.

Language: Python - Size: 257 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

timolinn/html

[WIP] HTML Parser written in Go

Language: Go - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

rearc-data/who-covid-19-cases-deaths

Coronavirus (COVID-19) Cases and Deaths | World Health Organization (WHO)

Language: Shell - Size: 39.1 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

rearc-data/nyt-states-reopen-status-covid-19

COVID-19 United States Reopen and Shut Down Status by State | NY Times

Language: Python - Size: 89.8 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

rearc-data/google-covid-19-community-mobility-reports

Google COVID-19 Community Mobility Reports

Language: Python - Size: 47.9 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

rearc-data/covid-19-chicago

Chicago COVID-19 Update Data | City of Chicago

Language: Shell - Size: 56.3 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

lost22git/apk_dl

Language: Nim - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

313801120/xiyueta

xiyueta.js库可以快速解析html字符串,遍历网页dom结构的JavaScript库。它通过与jQuery语法使用一致的 API 使 html文档遍历和处理更加简单。xiyueta.js库是先解析网页html文本再遍历html网页dom,xiyueta.js库可以在WEB浏览器里使用,也可以在ASP程序里使用,也可以在nodejs里使用

Language: HTML - Size: 2.25 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 2

shuxiaoyuan/SanQiuBooksSpider

练习使用 Python 内置的 urllib 和 HTMLParser 库爬取三秋书屋电子书的百度网盘链接和提取码

Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Anikeshpatel/dompy

JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python

Language: HTML - Size: 470 KB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

witjem/feedpls

Generate RSS feeds from websites

Language: Go - Size: 79.1 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 2

svenhsia/html2tree

A python script to build a tree structure from a raw HTML text.

Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

genius257/AutoIt-HTML-Parser

Yet another HTML Parser written in AutoIt

Language: AutoIt - Size: 24.4 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 2

rscottlundgren/csci-e10b_Term-Project

My Term Project submission for CSCI-E10b - a desktop, online "Job Description" parser for descriptions with a `boards.greenhouse.io` host.

Language: Java - Size: 529 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

BunyaminKiremit/android_haber_app

Haberler mobil uygulaması

Language: Kotlin - Size: 320 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

orhanucr/NewsApp

Orhan Uçar Ödev 4

Language: Kotlin - Size: 104 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

FlightPriceAnalysis/Flight-Price-Analysis

A java-based Web Crawler application for data extraction from flight booking website and recommending the best flight to users by giving 90% better results.

Language: HTML - Size: 77.1 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Soumya117/finnazurenotebook

Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

BryceRussell/HTMLRx

Select and manipulate elements in a HTML string without an AST

Language: TypeScript - Size: 156 KB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

min050410/Python_crawling

파이썬 크롤링 실습

Language: Python - Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

wenqi73/mini-html-parser

Language: JavaScript - Size: 6.84 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

Nelson-Gon/pycite

Python Citations Generator

Language: Python - Size: 151 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 3

gigaturbo/ici-grenoble-caldav

Script pour intégrer les événements de www.ici-grenoble.org dans un calendrier CalDAV

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mesutde/Country.List.Interest.Rate.Net6.API

interest rate country list Web Api

Language: C# - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

DanArmor/CF-parser-cross-platform

CodeForces parser, that works on Mac/Linux/Windows. Was created with Python

Language: Python - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

itsRajatkumar/PythonWebScraper

Language: Python - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Agenty/Agenty.TestData

This project contains the publc test data set to try and learn how to use cloud-based agents in Agenty.

Language: HTML - Size: 6.71 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

siddharth17196/Personalized-news-feed

Random stuff ++

Language: Python - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

chuksoo/bs4-web-scraper

This repository contains code on how to build a job website scraper with request and Beautiful Soup.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

yuriy-logosha/myutils

Utils for python based services.

Language: Python - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

goyaldhara/news_scrapping

A script written in python to get latest news headlines from CNBC

Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

LeeCenY/LipstickHTMLParser

使用 Ono 开源库解析 HTML 页面获取数据

Language: Objective-C - Size: 17.6 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

SagarGaniga/Scrape-Websites-with-Python

Well commented Python code to scrape websites with the help of Beautiful Soup to parse HTML

Language: Python - Size: 14.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

moovs/scrappy

Parsing html data and their further organize and analyze for better perception and understanding.

Language: Python - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

goodwinfame/portal-plus

Read html attributes to generates edit form

Language: JavaScript - Size: 283 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

dhruvp-8/Word-Detection

A Web Application which on input of sentence gives the info of POS Tagger of the different words

Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0