An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: html-parsing

azazar/uncaring-html-parser

HTML parser that intend to be fast, but wasn't benchmarked or optimized yet

Language: Java - Size: 81.1 KB - Last synced at: about 20 hours ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

1sand0s/Mouser2MD

JSoup based HTML parser for parsing orders from Mouser.com into BOM for use in Markdown

Language: Java - Size: 5.86 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

AshishYUO/html_parser

A sample C++ program to parse HTML files.

Language: C++ - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

kvpz/Parts-a-la-carte

Extract information from HTML documents using C. Applied to a website supplying television parts.

Language: C - Size: 89.8 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

whimtrip/jwht-scrapper

Fully Featured Java Scrapping Framework, highly pluggable and customizable

Language: Java - Size: 213 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 4

AntoData/WebScraperAllMusic

Simple example of a web scrapper using python. In this case, we ask the user using the console for the name of a band/artist and using selenium webdriver and beautifulsoup we print information about the discography of that artist/band

Language: Python - Size: 16.6 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

eivankin/gachiparser

Script for extracting data from site "dop.edu.ru"

Language: Python - Size: 44.9 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

sichkar-valentyn/Processing_html_files_in_Python

Examples on how to process html files in Python

Language: Python - Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

reevesba/tana2tree

This packages parses a Tanagra html description into usable formats.

Language: Python - Size: 135 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

SAINIAbhishek/bankist-website

A website that demonstrate the Events and DOM manipulation using the JavaScript. It's a course project. 'The Complete JavaScript Course 2020: From Zero to Expert!' on the udemy. It provided me an opportunity to learn different concepts related to the events and dom like scrolling effects, lazy loading images building a slider and many more. It is only for learning process.

Language: HTML - Size: 7.64 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

burzum/custom-html-parser

Turns your custom HTML tags into valid HTML

Language: PHP - Size: 24.4 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

InsonusK/dns-shop-data-grabber

grab data from dns shop

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

joshADE/Timetable-Transfer

Helps to transfer school timetable in a form that can be put in google calendar

Language: C# - Size: 3.14 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

tamboto2000/htmltojson

HTMLtoJSON is a HTML parser, based on net/html package. This package actually just to simplify HTML parsing. If you need more complex HTML processing, please use net/html as its offer more features. The package name is actually is not really fitting for this package purpose, but I use this package for my scraper engines, so I don't really want to bother with changing the package name...

Language: Go - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

lalabuy948/webprev

Golang package provides easy extraction of website previews. Generic, Facebook and Twitter cards.

Language: Go - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

PhilipSteinkellner/Charts_to_Spotify_Playlist

Fetches the current charts from the Welle 1 website and creates or updates a Spotify playlist.

Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mtsgi/writedown

writing html will be more fun!

Language: JavaScript - Size: 50.8 KB - Last synced at: 20 days ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

Tsovak/AndroidHtmlAgilityPack

Fix more problem with Android and building dll

Language: C# - Size: 320 KB - Last synced at: 2 months ago - Pushed at: over 10 years ago - Stars: 1 - Forks: 1

Develop-Packt/Web-Scraping-with-Jupyter-Notebooks

Analyze and parse HTML responses, programmatically scrape web data, and utilize Pandas DataFrames to store, transform, and merge tables.

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

jansanz/Rgdemo

Example on parsing HTML on iOS.

Language: Objective-C - Size: 122 KB - Last synced at: 3 months ago - Pushed at: over 12 years ago - Stars: 1 - Forks: 1

rgladwell/microtesia

Simple microdata parsing library for Scala.

Language: Scala - Size: 1.3 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

OneFrameLink/Ofl.Html

Utilities for working with HTML content.

Language: C# - Size: 15.6 KB - Last synced at: 11 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

OpenBookPublishers/geturls

Extact all URLs from anchor and image tags within a html/xhtml page and its children.

Language: Shell - Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 2

pubblic/htmlquery

Language: Go - Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

IonutCiuta/edu-crawler

Crawler for bacalaureat.edu.ro for 2018 results. HTML parsing & caching, content stored in MongoDB. Built with Java, SpringBoot and Jsoup.

Language: Java - Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

emmanuelroecker/php-simply-html

Add, delete, modify, get html tags, text, links by using css selector

Language: PHP - Size: 53.7 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 6

mnjl1/webdataparsing

test task

Language: Java - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

gusenov/excel-functions

Функции Excel.

Language: Shell - Size: 527 KB - Last synced at: 16 days ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

bradmontgomery/django-janitor

django-janitor allows you to use bleach to clean HTML stored in a Model's field.

Language: Python - Size: 514 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 3

varocarbas/FeedAny

Perl application which creates (RSS) web feeds from random webpages via HTML parsing.

Language: Perl - Size: 80.1 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

mazid1/InterviewTest

Experimental app for testing json, xml and html parsing, custom view and write in external storage device.

Language: Java - Size: 443 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

naens/am-scripts

naens anime and manga scripts

Language: Common Lisp - Size: 16.6 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

ednunezg/CSOM_Printer_Notifier

Desktop Java App developed during my sophomore year while working for Carlson School of Management IT. This app parses HTML text form printer servers, interprets whether printers have hit critical levels, and notifies the user via email or text

Language: Java - Size: 801 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 1