An open API service providing repository metadata for many open source software ecosystems.

Topic: "parsing"

FranxYao/Deep-Generative-Models-for-Natural-Language-Processing

DGMs for NLP. A roadmap.

Size: 32 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 390 - Forks: 32

rep-movsd/see-phit

A C++ HTML template engine that uses compile time HTML parsing

Language: C++ - Size: 133 KB - Last synced at: 27 days ago - Pushed at: about 6 years ago - Stars: 390 - Forks: 18

pointfreeco/swift-url-routing

A bidirectional router with more type safety and less fuss.

Language: Swift - Size: 6.83 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 372 - Forks: 30

viraptor/phply

PHP parser written in Python using PLY

Language: Python - Size: 265 KB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 369 - Forks: 101

stalwartlabs/mail-parser

Fast and robust e-mail parsing library for Rust

Language: Rust - Size: 2.21 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 363 - Forks: 45

tpolecat/atto

friendly little parsers

Language: Scala - Size: 2.38 MB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 359 - Forks: 58

macmade/ClangKit

ClangKit provides an Objective-C frontend to LibClang. Source tokenization, diagnostics and fix-its are actually implemented.

Language: C - Size: 15.2 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 359 - Forks: 46

martijnversluis/ChordSheetJS

A JavaScript library for parsing and formatting chords and chord sheets

Language: TypeScript - Size: 6.58 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 356 - Forks: 53

rflechner/ScrapySharp

reborn of https://bitbucket.org/rflechner/scrapysharp

Language: C# - Size: 751 KB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 352 - Forks: 76

vlasovskikh/funcparserlib

Recursive descent parsing library for Python based on functional combinators

Language: Python - Size: 2.08 MB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 351 - Forks: 38

juntaoy/biaffine-ner

Named Entity Recognition as Dependency Parsing

Language: Python - Size: 52.7 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 348 - Forks: 39

analang/como-lang-ng

como-lang-ng is now ana-lang, located at https://github.com/analang/ana

Language: C - Size: 317 KB - Last synced at: about 22 hours ago - Pushed at: over 5 years ago - Stars: 342 - Forks: 36

nilproject/NiL.JS

JavaScript engine for .NET written in C#.

Language: JavaScript - Size: 71.5 MB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 339 - Forks: 47

jstockwin/py-pdf-parser

A Python tool to help extracting information from structured PDFs.

Language: Python - Size: 1.28 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 335 - Forks: 41

kataw/kataw

An 100% spec compliant ES2022 JavaScript toolchain

Language: TypeScript - Size: 224 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 334 - Forks: 6

Alexhuszagh/rust-lexical

Fast numeric to- and from-string conversion routines.

Language: Rust - Size: 124 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 326 - Forks: 40

jalajthanaki/NLPython

This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"

Language: Jupyter Notebook - Size: 131 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 322 - Forks: 207

saferwall/pe

A :zap: lightweight Go package to parse, analyze and extract metadata from Portable Executable (PE) binaries. Designed for malware analysis tasks and robust against PE malformations.

Language: Go - Size: 28.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 321 - Forks: 48

sirkon/ldetool

Code generator for fast log file parsers

Language: Go - Size: 823 KB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 319 - Forks: 22

SimGus/Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

Language: Python - Size: 16.1 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 319 - Forks: 56

bodoni/svg

Composer and parser for SVG

Language: Rust - Size: 330 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 316 - Forks: 45

rgrove/parse-xml

A fast, safe, compliant XML parser for Node.js and browsers.

Language: JavaScript - Size: 1.57 MB - Last synced at: 20 days ago - Pushed at: 6 months ago - Stars: 307 - Forks: 16

5j9/wikitextparser

A Python library to parse MediaWiki WikiText

Language: Python - Size: 1.72 MB - Last synced at: 20 days ago - Pushed at: 7 months ago - Stars: 305 - Forks: 22

pyfisch/cbor 📦

CBOR support for serde.

Language: Rust - Size: 343 KB - Last synced at: about 20 hours ago - Pushed at: almost 3 years ago - Stars: 304 - Forks: 100

Duff89/parser_avito

Находит товары на avito.ru по ключевым словам

Language: Python - Size: 88.9 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 301 - Forks: 95

tree-sitter/swift-tree-sitter

Swift API for the tree-sitter incremental parsing system

Language: Swift - Size: 411 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 299 - Forks: 34

nHapiNET/nHapi

nHapi is the .Net port of the original Java project HAPI.

Language: C# - Size: 53.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 295 - Forks: 163

GregRos/parjs

JavaScript parser-combinator library

Language: TypeScript - Size: 6.55 MB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 295 - Forks: 20

emacs-tree-sitter/tree-sitter-langs

Language bundle for Emacs's tree-sitter package

Language: Emacs Lisp - Size: 1.34 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 292 - Forks: 136

satya-das/cppparser

A library to parse C/C++ source as AST

Language: C++ - Size: 19.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 291 - Forks: 38

mcmtroffaes/inipp

Simple C++ ini parser.

Language: C++ - Size: 101 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 290 - Forks: 57

RReverser/serde-xml-rs

xml-rs based deserializer for Serde (compatible with 1.0+)

Language: Rust - Size: 253 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 289 - Forks: 97

likexian/whois-parser

Whois parser for domain whois information parsing in Go(Golang).

Language: Go - Size: 1.04 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 289 - Forks: 90

StegSchreck/RatS

Movie Ratings Synchronization with Python

Language: HTML - Size: 4.25 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 287 - Forks: 31

ml-archive/Serpent 📦

A protocol to serialize Swift structs and classes for encoding and decoding.

Language: Swift - Size: 3.16 MB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 286 - Forks: 13

indy-singh/StringsAreEvil

Reducing memory allocations from 7.5GB to 32KB

Language: C# - Size: 404 KB - Last synced at: 6 months ago - Pushed at: almost 5 years ago - Stars: 280 - Forks: 43

vincentneo/CoreGPX

A library for parsing and creation of GPX location files. Purely Swift.

Language: Swift - Size: 2.55 MB - Last synced at: 26 days ago - Pushed at: 7 months ago - Stars: 273 - Forks: 57

decorators-squad/eo-yaml

YAML for Java 8 and above. A user-friendly OOP library. Previously known as "Camel".

Language: Java - Size: 1.8 MB - Last synced at: 26 days ago - Pushed at: 6 months ago - Stars: 268 - Forks: 53

kstenerud/concise-encoding

The secure data format for a modern world

Language: ANTLR - Size: 3.22 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 268 - Forks: 4

jgontrum/spacy-api-docker

spaCy REST API, wrapped in a Docker container.

Language: Python - Size: 356 KB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 267 - Forks: 99

textX/Arpeggio

Parser interpreter based on PEG grammars written in Python http://textx.github.io/Arpeggio/

Language: Python - Size: 8.97 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 264 - Forks: 55

zeek/spicy

C++ parser generator for dissecting protocols & files.

Language: C++ - Size: 14.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 261 - Forks: 41

metasoarous/semantic-csv

Higher level tools for working with CSV data and files

Language: Clojure - Size: 419 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 257 - Forks: 22

d0c-s4vage/gramfuzz

gramfuzz is a grammar-based fuzzer that lets one define complex grammars to generate text and binary data formats.

Language: Python - Size: 6.19 MB - Last synced at: 28 days ago - Pushed at: over 4 years ago - Stars: 257 - Forks: 39

mike01/pypacker

:package: The fastest and simplest packet manipulation lib for Python

Language: Python - Size: 3.37 MB - Last synced at: 2 days ago - Pushed at: about 5 years ago - Stars: 256 - Forks: 43

propensive/contextual

Statically-checked string interpolation in Scala

Language: Scala - Size: 4.66 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 251 - Forks: 23

Kronuz/esprima-python

ECMAScript parsing infrastructure for multipurpose analysis

Language: Python - Size: 7.24 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 246 - Forks: 47

ThaDafinser/UserAgentParser

UserAgent parsing done right

Language: PHP - Size: 4.75 MB - Last synced at: about 22 hours ago - Pushed at: over 4 years ago - Stars: 245 - Forks: 47

tokay-lang/tokay

Tokay is a programming language designed for ad-hoc parsing, inspired by awk.

Language: Rust - Size: 1.64 MB - Last synced at: 8 days ago - Pushed at: 11 days ago - Stars: 242 - Forks: 7

fox-it/dissect.cstruct_legacy

A no-nonsense c-like structure parsing library for Python

Language: Python - Size: 38.1 KB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 239 - Forks: 25

iabudiab/HTMLKit

An Objective-C framework for your everyday HTML needs.

Language: HTML - Size: 2.45 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 236 - Forks: 26

iuccio/csvToJson

Convert CSV file to JSON

Language: JavaScript - Size: 529 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 231 - Forks: 43

felixSchl/neodoc 📦

Beautiful, hand-crafted commandline interfaces for node.js

Language: PureScript - Size: 2.47 MB - Last synced at: 7 days ago - Pushed at: about 4 years ago - Stars: 229 - Forks: 9

howardroark/pollinate

Template your base files and generate new projects from Git(Hub).

Language: JavaScript - Size: 1.68 MB - Last synced at: 10 days ago - Pushed at: over 5 years ago - Stars: 228 - Forks: 14

BLLIP/bllip-parser Fork of dmcc/bllip-parser

BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.

Language: GAP - Size: 47.8 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 227 - Forks: 53

redco/goose-parser

Universal scraping tool, which allows you to extract data using multiple environments

Language: JavaScript - Size: 933 KB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 227 - Forks: 13

parsify-dev/desktop

Next generation notepad-based calculator, built with extendibility and privacy in mind.

Size: 2.51 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 226 - Forks: 3

mdaines/grammophone

A tool for analyzing and transforming context-free grammars.

Language: JavaScript - Size: 2.02 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 218 - Forks: 24

xpl/stacktracey

Parses call stacks. Reads sources. Clean & filtered output. Sourcemaps. Node & browsers.

Language: JavaScript - Size: 813 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 218 - Forks: 22

Captainarash/CaptCC

A tiny C compiler written purely in JavaScript.

Language: JavaScript - Size: 106 KB - Last synced at: 26 days ago - Pushed at: almost 6 years ago - Stars: 213 - Forks: 35

joanllenas/ts.data.json Fork of aische/JsonDecoder

A lightweight JSON decoding library for TypeScript

Language: TypeScript - Size: 677 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 209 - Forks: 16

spdx/tools-python

A Python library to parse, validate and create SPDX documents.

Language: Python - Size: 3.39 MB - Last synced at: 8 days ago - Pushed at: 17 days ago - Stars: 208 - Forks: 138

yakaz/yamerl

YAML 1.2 and JSON parser in pure Erlang

Language: Erlang - Size: 827 KB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 208 - Forks: 53

ehmicky/safe-json-value

⛑️ JSON serialization should never fail

Language: JavaScript - Size: 5.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 206 - Forks: 6

jimmiebergmann/mini-yaml

Single header YAML 1.0 C++11 serializer/deserializer.

Language: C++ - Size: 896 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 204 - Forks: 38

briansmith/untrusted

Safe, fast, zero-panic, zero-crashing, zero-allocation parsing of untrusted inputs in Rust.

Language: Rust - Size: 102 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 201 - Forks: 24

aappleby/matcheroni

A minimalist single-header library for building pattern-matchers, lexers, and parsers.

Language: C++ - Size: 7.31 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 200 - Forks: 5

bykof/gostradamus

Gostradamus: Better DateTimes for Go 🕰️

Language: Go - Size: 348 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 200 - Forks: 6

netheril96/StaticJSON

Fast, direct and static typed parsing of JSON with C++

Language: C++ - Size: 702 KB - Last synced at: about 16 hours ago - Pushed at: over 2 years ago - Stars: 200 - Forks: 35

bitwalker/combine

A parser combinator library for Elixir projects

Language: Elixir - Size: 243 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 199 - Forks: 19

rcdilorenzo/filtrex

A library for performing and validating complex filters from a client (e.g. smart filters)

Language: Elixir - Size: 234 KB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 198 - Forks: 28

colis-anr/morbig

A static parser for POSIX Shell

Language: OCaml - Size: 1.01 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 194 - Forks: 8

Cargill/OpenSIEM-Logstash-Parsing

SIEM Logstash parsing for more than hundred technologies

Language: Python - Size: 4.18 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 184 - Forks: 40

mck89/peast

JavaScript parser written in PHP that generates AST from your code according to ECMAScript specification

Language: PHP - Size: 1.72 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 182 - Forks: 20

simongray/clojure-dsl-resources

A curated list of Clojure resources for dealing with domain-specific languages.

Size: 109 KB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 182 - Forks: 5

maxime1992/pizza-sync

Pizza-Sync is a web app built on the frontend with angular, ngrx and on the backend with Nest. It let you and your friends/colleagues choose a pizza before placing a group order. Built using https://github.com/maxime1992/angular-ngrx-starter

Language: TypeScript - Size: 57.7 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 168 - Forks: 49

RohanNagar/jmail

A modern and lightweight library for working with email addresses in Java

Language: Java - Size: 761 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 165 - Forks: 9

bijington/expressive

Expressive is a cross-platform expression parsing and evaluation framework. The cross-platform nature is achieved through compiling for .NET Standard so it will run on practically any platform.

Language: C# - Size: 3.74 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 164 - Forks: 25

untitaker/html5gum

A WHATWG-compliant HTML5 tokenizer and tag soup parser

Language: Rust - Size: 576 KB - Last synced at: about 22 hours ago - Pushed at: 2 months ago - Stars: 160 - Forks: 10

Anwarvic/Dan-Jurafsky--Chris-Manning--NLP

My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.

Language: Java - Size: 49.7 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 157 - Forks: 55

ashutoshbsathe/yacv

Yet Another Compiler Visualizer

Language: Python - Size: 7.83 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 157 - Forks: 9

Krusen/BencodeNET

.NET library for encoding/decoding bencode and reading/writing torrent files

Language: C# - Size: 1.06 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 156 - Forks: 29

SoftCreatR/JSONPath

JSONPath implementation for PHP.

Language: PHP - Size: 267 KB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 154 - Forks: 28

NiuTrans/NiuTrans.SMT

NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully developed in C++ language. So it runs fast and uses less memory. Currently it supports phrase-based, hierarchical phrase-based and syntax-based (string-to-tree, tree-to-string and tree-to-tree) models for research-oriented studies.

Language: C++ - Size: 94 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 154 - Forks: 36

soulverteam/SoulverStringParsing

Learn about how SoulverCore can give Swift "better than regex" data parsing features (for many common tasks)

Language: Swift - Size: 44.9 KB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 154 - Forks: 3

dogoncouch/logdissect

CLI utility and Python module for analyzing log files and other data.

Language: Python - Size: 655 KB - Last synced at: 24 days ago - Pushed at: 9 months ago - Stars: 150 - Forks: 23

buehler/node-typescript-parser

Parser for typescript (and javascript) files, that compiles those files and generates a human understandable AST.

Language: TypeScript - Size: 618 KB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 148 - Forks: 46

nerdsupremacist/Syntax

Write value-driven parsers quickly in Swift with an intuitive SwiftUI-like DSL

Language: Swift - Size: 2.7 MB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 148 - Forks: 11

scraperai/scraperai

ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.

Language: HTML - Size: 15.8 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 147 - Forks: 15

jamiejennings/rosie-pattern-language

Rosie Pattern Language (RPL) and the Rosie Pattern Engine have MOVED!

Size: 18.9 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 145 - Forks: 20

masak/alma

ALgoloid with MAcros -- a language with Algol-family syntax where macros take center stage

Language: Raku - Size: 2.31 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 140 - Forks: 15

0x2a-42/lelwel

Resilient LL(1) parser generator for Rust

Language: Rust - Size: 673 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 139 - Forks: 4

igordejanovic/parglare

A pure Python LR/GLR parser - http://www.igordejanovic.net/parglare/

Language: Python - Size: 13.5 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 139 - Forks: 32

aclindsa/ofxgo

Golang library for querying and parsing OFX

Language: Go - Size: 379 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 139 - Forks: 27

r-lib/evaluate

A version of eval for R that returns more information about what happened

Language: R - Size: 2.57 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 137 - Forks: 35

dcavar/python-tutorial-notebooks

Python tutorials as Jupyter Notebooks for NLP, ML, AI

Language: Jupyter Notebook - Size: 64.9 MB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 137 - Forks: 89

valayDave/arxiv-miner

arxiv_miner is a toolkit for mining research papers on CS ArXiv.

Language: Python - Size: 159 KB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 134 - Forks: 8

rodrigosetti/master-plan

Project Management for Hackers

Language: Haskell - Size: 1.1 MB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 134 - Forks: 6

WolframResearch/codeparser

Parse Wolfram Language source code as abstract syntax trees (ASTs) or concrete syntax trees (CSTs)

Language: Rust - Size: 6.85 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 133 - Forks: 14

breandan/galoisenne

🕸️ Graphs, finite fields and discrete dynamical systems in Kotlin

Language: Kotlin - Size: 121 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 130 - Forks: 11