An open API service providing repository metadata for many open source software ecosystems.

Topic: "html-parser"

ariary/JSextractor

Fastly gather all JavaScript from url (CLi+TUI)

Language: Go - Size: 6.76 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 2

youhan26/dom-parser

dom parser using regex support browser, node, react-native and so on

Language: JavaScript - Size: 60.5 KB - Last synced at: 26 days ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 3

duncan3dc/domparser

Wrappers for the PHP DomDocument class to provide extra functionality for html/xml parsing

Language: PHP - Size: 209 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 4

danny1113/html-parser-builder

A result builder that build HTML parser and transform HTML elements to strongly-typed result, inspired by RegexBuilder.

Language: Swift - Size: 77.1 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 9 - Forks: 0

rnantes/swift-html-parser

Parse plaintext HTML into an object and easily search it to find elements

Language: HTML - Size: 874 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

imelgrat/feed-finder

A PHP class for extracting the URLs of RSS (1.0 and 2.0) and ATOM feeds associated to a page, as well as OPML outline documents.

Language: PHP - Size: 646 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 3

beliantech/boilertext

Extract content from HTML by removing unwanted boilerplate text.

Language: Go - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 9 - Forks: 1

algunion/HTMLForge.jl

Flexible HTML parsing and manipulation in Julia Programming Language

Language: Julia - Size: 52.7 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 8 - Forks: 2

sunshineplan/node

HTML parsing library, the alternative to BeautifulSoup in Golang.

Language: Go - Size: 85.9 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

creeperyang/html-parser-lite

A light weight html parser and more.

Language: JavaScript - Size: 107 KB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 3

divyeshmakwana96/Pochta

Testing MJML/HTML emails simplified. A serverless cli application solution for multiple integrations.

Language: JavaScript - Size: 776 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

willianantunes/pyfriends

Let's research over all the seasons of Friends sitcom and try to get some insights from it 🕵

Language: HTML - Size: 35.7 MB - Last synced at: 5 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 0

victornpb/benchmark-html-parser-libraries

A Benchmark of javascript libraries for parsing HTML (CPU/RAM)

Language: HTML - Size: 4.47 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1

genius257/AutoIt-HTML-Parser

Yet another HTML Parser written in AutoIt

Language: AutoIt - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 2

RohitAwate/DOMEngine

DOM manipulation engine written in C++.

Language: C++ - Size: 190 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 2

hean01/domx

HTML parser and DOM tree builder for rust

Language: Rust - Size: 14.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 6 - Forks: 0

TRowbotham/PHPDOM

A modern alternative to PHP's built in DOM classes.

Language: PHP - Size: 3.17 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 6 - Forks: 2

f34nk/elixir_html_tools

Overview of available html tools in Elixir

Language: HTML - Size: 4.36 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

imingyu/forgiving-xml-parser

An XML/HTML parser and serializer for JavaScript.

Language: TypeScript - Size: 16.3 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

akinemreyazici/read_news_app

This is a example project for HTLM Parser in Kotlin

Language: Kotlin - Size: 107 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

Bystroushaak/pyDHTMLParser 📦

Lightweight HTML/XML parser for quick and dirty web scraping.

Language: Python - Size: 252 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 3

Epicfisher/TouhouDiscordBot

A Work-In-Progress Discord bot based on the largely popular Touhou series by ZUN.

Language: Python - Size: 329 KB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

rsharifnasab/telegram_export_analyzer

this script can analyze number of telegram messages by time

Language: Python - Size: 17.6 KB - Last synced at: 17 days ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

raymccrae/swift-htmlsaxparser

Swift wrapper around libxml2 HTML Parser to provide SAX style HTML Parsing

Language: Swift - Size: 101 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 2

PopFlamingo/MyHTML

A Swift wrapper for MyHTML, a fast, pure-C, HTML 5 parsing library

Language: Swift - Size: 48.8 KB - Last synced at: 28 days ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 0

emmanuelroecker/php-simply-html

Add, delete, modify, get html tags, text, links by using css selector

Language: PHP - Size: 53.7 KB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 6

luxcem/apifier

Apifier is a very simple HTML parser written in Python based on CSS selectors

Language: HTML - Size: 76.2 KB - Last synced at: 10 months ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 1

esign-consulting/postdenuncia

Projeto de software para cidadãos denunciarem problemas urbanos.

Language: Java - Size: 3.18 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 5 - Forks: 0

Project-OtaWilma/WilmaAPI

Provides access to Wilma's public-api without need for an api-key. Mainly used for the functionality of OtaWilma web-client. Moderately stable, altough provides support for only one school. Not maintained to be production ready

Language: JavaScript - Size: 11 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

lrswss/akal2ical 📦

Perl-Skript um Abfuhrtermine des AfA Karlsruhe für den angegebenen Straßenzug abzurufen und als iCal-Datei zu speichern

Language: Perl - Size: 25.4 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

kevinhermawan/markup2json

A library for converting HTML and XML into JSON

Language: TypeScript - Size: 384 KB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1

ubbeg2000/pars

a simple package for parsing html files into dom trees

Language: Go - Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 2

iamlockelightning/BaiduProcess

Extract Baidu Baike Pages from HTML

Language: Java - Size: 44.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

ocramz/twelve

Like @11ty , but this goes up to 12

Language: Haskell - Size: 70.3 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1

jedmitten/humble_catalog

A script to parse the saved Humble Bundle library HTML

Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 2

vitkarpov/fast-xml-parser

🚀 Is a fast XML parser in TypeScript with zero dependencies

Language: TypeScript - Size: 103 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

askonomm/dompa

A zero-dependency HTML5 document parser.

Language: Rust - Size: 147 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

Solrikk/DataDigger

DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.

Language: Go - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

markuspoerschke/extractum

Extractum is a PHP library that extracts information from web pages.

Language: PHP - Size: 1.15 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 4 - Forks: 1

roblillack/gockl

Minimal XML processor for Go that does not to fuck with your markup.

Language: Go - Size: 33.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

nix1707/WebScrapper-BrowserExtension

Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.

Language: TypeScript - Size: 70.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

1623311678/html-parser-server

将html解析成json,服务端,前端都可以用

Language: JavaScript - Size: 24.4 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

tbjgolden/deno-htmlparser2

Deno port of `htmlparser2`

Language: TypeScript - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

cwsky0221/CWRichTextView

Android textview显示富文本,支持自定义html标签

Language: Java - Size: 126 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

talhashraf/parsel

Parsel is a Java library for parsing HTML and XML to extract data using XPath selector.

Language: Java - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 2

romagny13/html-parser

TypeScript/JavaScript HTML Parser

Language: TypeScript - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: over 8 years ago - Stars: 4 - Forks: 2

karambir/ugc-colleges

Python Script to extract college names from UGC, India website.

Language: Python - Size: 2.18 MB - Last synced at: 3 days ago - Pushed at: almost 13 years ago - Stars: 4 - Forks: 1

jgarber623/micromicro 📦

A Ruby gem for extracting microformats2-encoded data from HTML documents.

Language: Ruby - Size: 359 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 3 - Forks: 2

vborovikov/readability

A C# version of the readability lib

Language: C# - Size: 7.13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

hashim21223445/ai-chatbot Fork of vercel/ai-chatbot

A full-featured, hackable Next.js AI chatbot built by Vercel

Language: TypeScript - Size: 2.31 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

thomasthiebaud/htmlstring-to-react

Convert a string containing html tags to an array of React elements. Light and secure

Language: TypeScript - Size: 1.19 MB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 4

staticka/staticka

Yet-another static site generator in PHP.

Language: PHP - Size: 217 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

graetz23/xmlcc

an ANSI C++ XML library keeping SAX interface and XML / DOM tree

Language: C++ - Size: 418 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 2

rohanasan/rohanasantml

Rohanasantml an easy alternative to html!

Language: Rust - Size: 29.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

silenzzz/RuTracker4j

RuTracker java library

Language: Java - Size: 81.1 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

wellloy1/strops-js

Provides simple and the most useful methods to string operations in JavaScript / Node.js

Language: JavaScript - Size: 97.7 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

glebliutsko/PriceParser

Парсер цен с сайтов

Language: C# - Size: 39.1 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

coderosh/docpa

A simple library that I use for web scraping. Uses htmlparser2 to parse dom.

Language: TypeScript - Size: 69.3 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

AntoData/on_page_basic_SEO_checker

This project provides methods and utils to make basic checks in the SEO of an instance of a page using the URL of this page or a webdriver instance that is browsing that page at the moment

Language: Python - Size: 7.87 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

Anikeshpatel/dompy

JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python

Language: HTML - Size: 470 KB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

1bk/gdelt_headline_analysis

News Headlines Analysis of (two) Websites - Using GDELT 2.0 Event Database

Language: HTML - Size: 715 KB - Last synced at: 4 months ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 2

majdsalloum/mate

Maté Browser

Language: Java - Size: 3.72 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

betrixed/dlang-xml

Xml & Html parser for the D programming language (2019 - D2)

Language: D - Size: 15.2 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

xnerhu/HTML-Parser

:page_facing_up: A HTML parser prototype

Language: C# - Size: 82 KB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

luanmuniz/shorio

Dom Manipulation for Node.JS with an jQuery like API

Language: JavaScript - Size: 43.9 KB - Last synced at: 24 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

iamareebjamal/get_results

Python Script to download results of whole class/branch by providing attendance Excel file.

Language: Python - Size: 346 KB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 4

jiangxianli/SimpleHtml

一款类似于Jquery语法的HTML DOM解析PHP扩展包

Language: PHP - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 0

ghuls-apps/github-calendar-api

An HTML parser to get data about the GitHub Profile Contributions Calendar

Language: Ruby - Size: 16.6 KB - Last synced at: 1 day ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 1

omar2535/BioLife-AU-01-attendance-parser

Biolife-AU-01 打卡鐘解析程序

Language: Python - Size: 23.4 KB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

firas-codes1/spwder

A simple HTML form password bruteforcing tool written in python.

Language: Python - Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

zavzyatiy/drom_archive_parser

A full automatic pipeline for parsing data from auto.drom.ru/archive

Language: Jupyter Notebook - Size: 2.44 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

luminati-io/curl_cffi-web-scraping

Use curl_cffi in Python to mimic browser TLS fingerprints for reliable and stealthy web scraping.

Size: 1.17 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

KTBsomen/httl-s

html but templating language, hyper text templating language

Language: JavaScript - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

dinaraparanid/Star-Wars-Travel-KMP

Traveling App in Star Wars Universe

Language: Kotlin - Size: 27.3 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

jaijs/jai-body-parser

Simple and fast Node.js module for parsing Http request body. Part of Jai.js ecosystem. Built without any third part dependency.

Language: JavaScript - Size: 55.7 KB - Last synced at: 9 days ago - Pushed at: 11 months ago - Stars: 2 - Forks: 1

tleino/purehtml

Reusable purely HTML-only parser that can be used for developing crawlers, converters or simple clients for consuming HTML.

Language: C - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ShiftHackZ/Joy-Reactor-Android

Android client app for website joyreactor.cc

Language: Kotlin - Size: 423 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

cacing69/cquery

Cquery is an acronym for Crawl Query, its a PHP Scraper with language expression, could be used to scrape data from a website that uses javascript or ajax

Language: PHP - Size: 662 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 3

dinaraparanid/genius_track_number_in_album

Fetches track's number in album using its Genius URL

Language: Rust - Size: 26.4 KB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

dinaraparanid/genius_lyrics

Fetches lyrics of song from genius by its URL

Language: Rust - Size: 20.5 KB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

MichaelE919/ncaa-stats-webscraper

Python webscraping module for NCAA Basketball Stats

Language: Python - Size: 352 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 6

MatheusTGP/HTML-Collector-Tkinter

Converta uma página na Web para um arquivo em HTML, o software vem com uma interface gráfica básica produzida Totalmente em Python Tkinter.

Size: 18.6 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

yogendratamang48/parse_utils

Easy html/json parser for webscraping

Language: Python - Size: 22.5 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

jy0529/simple-html-parser

a simple html parser by Typescript

Language: TypeScript - Size: 195 KB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

bredbrains/tthk-app 📦

Tallinna Tööstushariduskeskus unofficial mobile application

Language: C# - Size: 2.04 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

enveezee/urearl

U R Earl is an abstraction of python standard libraries for extracting and returning stuff from URLs

Language: Python - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

bersoy12/UniversityStudentAnalysis

Data Mining and Analysis

Language: PHP - Size: 7.8 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

gurkankaymak/gosoup

Lightweight go library for pulling data out of HTML, inspired by BeautifulSoup and Jsoup

Language: Go - Size: 7.81 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

AntoData/WebScraperAllMusic

Simple example of a web scrapper using python. In this case, we ask the user using the console for the name of a band/artist and using selenium webdriver and beautifulsoup we print information about the discography of that artist/band

Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

timolinn/nginB

[WIP] This is a hobbyist browser engine written in Go

Language: Go - Size: 6.84 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

millerlogic/htmlstrip

Strips HTML from the input, outputs plain text, streamed in realtime without preloading the whole document

Language: Go - Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

linkonoid/objhtml

Library for generating and outputs render html-elements on native Golang with application of object model.

Language: Go - Size: 3.38 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

sentialx/html-parser

A HTML parser prototype written in C# and .NET Core

Language: C# - Size: 25.4 KB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 1

codexguy/xSkrape.APIWrapper.REST

Data extraction, identification, and shaping functionality. This repository is for the REST-based interfaces to xSkrape.com, which offers these parsing services in a hosted environment.

Language: C# - Size: 84 KB - Last synced at: 22 days ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 2

akiya64/AmazonScrapingExcelFile

Scraping Amazon Catalog in Excel Book

Language: C# - Size: 71.3 KB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 1

azazar/uncaring-html-parser

HTML parser that intend to be fast, but wasn't benchmarked or optimized yet

Language: Java - Size: 84 KB - Last synced at: 3 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

Couchtr26/Bookmark_Cleaner

Bookmark_Cleaner is a Python tool designed to automatically scan, organize, and clean your browser bookmarks. It detects and removes duplicates, dead links, and empty folders, making your bookmark collection tidy and efficient. Ideal for anyone who wants to maintain a well-organized set of online resources.

Language: JavaScript - Size: 3.91 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

vborovikov/brackets

Resilient markup parser library

Language: C# - Size: 1.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

flaviu22/domtree

C++ html parser

Language: C++ - Size: 6.68 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

DEmodoriGatsuO/minisoup-html-parser

A lightweight HTML parsing library inspired by Beautiful Soup, providing capabilities for HTML element analysis and extraction

Language: C# - Size: 69.3 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0