Topic: "parsing"
FranxYao/Deep-Generative-Models-for-Natural-Language-Processing
DGMs for NLP. A roadmap.
Size: 32 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 390 - Forks: 32

rep-movsd/see-phit
A C++ HTML template engine that uses compile time HTML parsing
Language: C++ - Size: 133 KB - Last synced at: 27 days ago - Pushed at: about 6 years ago - Stars: 390 - Forks: 18

pointfreeco/swift-url-routing
A bidirectional router with more type safety and less fuss.
Language: Swift - Size: 6.83 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 372 - Forks: 30

viraptor/phply
PHP parser written in Python using PLY
Language: Python - Size: 265 KB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 369 - Forks: 101

stalwartlabs/mail-parser
Fast and robust e-mail parsing library for Rust
Language: Rust - Size: 2.21 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 363 - Forks: 45

tpolecat/atto
friendly little parsers
Language: Scala - Size: 2.38 MB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 359 - Forks: 58

macmade/ClangKit
ClangKit provides an Objective-C frontend to LibClang. Source tokenization, diagnostics and fix-its are actually implemented.
Language: C - Size: 15.2 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 359 - Forks: 46

martijnversluis/ChordSheetJS
A JavaScript library for parsing and formatting chords and chord sheets
Language: TypeScript - Size: 6.58 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 356 - Forks: 53

rflechner/ScrapySharp
reborn of https://bitbucket.org/rflechner/scrapysharp
Language: C# - Size: 751 KB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 352 - Forks: 76

vlasovskikh/funcparserlib
Recursive descent parsing library for Python based on functional combinators
Language: Python - Size: 2.08 MB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 351 - Forks: 38

juntaoy/biaffine-ner
Named Entity Recognition as Dependency Parsing
Language: Python - Size: 52.7 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 348 - Forks: 39

analang/como-lang-ng
como-lang-ng is now ana-lang, located at https://github.com/analang/ana
Language: C - Size: 317 KB - Last synced at: about 22 hours ago - Pushed at: over 5 years ago - Stars: 342 - Forks: 36

nilproject/NiL.JS
JavaScript engine for .NET written in C#.
Language: JavaScript - Size: 71.5 MB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 339 - Forks: 47

jstockwin/py-pdf-parser
A Python tool to help extracting information from structured PDFs.
Language: Python - Size: 1.28 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 335 - Forks: 41

kataw/kataw
An 100% spec compliant ES2022 JavaScript toolchain
Language: TypeScript - Size: 224 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 334 - Forks: 6

Alexhuszagh/rust-lexical
Fast numeric to- and from-string conversion routines.
Language: Rust - Size: 124 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 326 - Forks: 40

jalajthanaki/NLPython
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Language: Jupyter Notebook - Size: 131 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 322 - Forks: 207

saferwall/pe
A :zap: lightweight Go package to parse, analyze and extract metadata from Portable Executable (PE) binaries. Designed for malware analysis tasks and robust against PE malformations.
Language: Go - Size: 28.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 321 - Forks: 48

sirkon/ldetool
Code generator for fast log file parsers
Language: Go - Size: 823 KB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 319 - Forks: 22

SimGus/Chatette
A powerful dataset generator for Rasa NLU, inspired by Chatito
Language: Python - Size: 16.1 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 319 - Forks: 56

bodoni/svg
Composer and parser for SVG
Language: Rust - Size: 330 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 316 - Forks: 45

rgrove/parse-xml
A fast, safe, compliant XML parser for Node.js and browsers.
Language: JavaScript - Size: 1.57 MB - Last synced at: 20 days ago - Pushed at: 6 months ago - Stars: 307 - Forks: 16

5j9/wikitextparser
A Python library to parse MediaWiki WikiText
Language: Python - Size: 1.72 MB - Last synced at: 20 days ago - Pushed at: 7 months ago - Stars: 305 - Forks: 22

pyfisch/cbor 📦
CBOR support for serde.
Language: Rust - Size: 343 KB - Last synced at: about 20 hours ago - Pushed at: almost 3 years ago - Stars: 304 - Forks: 100

Duff89/parser_avito
Находит товары на avito.ru по ключевым словам
Language: Python - Size: 88.9 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 301 - Forks: 95

tree-sitter/swift-tree-sitter
Swift API for the tree-sitter incremental parsing system
Language: Swift - Size: 411 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 299 - Forks: 34

nHapiNET/nHapi
nHapi is the .Net port of the original Java project HAPI.
Language: C# - Size: 53.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 295 - Forks: 163

GregRos/parjs
JavaScript parser-combinator library
Language: TypeScript - Size: 6.55 MB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 295 - Forks: 20

emacs-tree-sitter/tree-sitter-langs
Language bundle for Emacs's tree-sitter package
Language: Emacs Lisp - Size: 1.34 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 292 - Forks: 136

satya-das/cppparser
A library to parse C/C++ source as AST
Language: C++ - Size: 19.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 291 - Forks: 38

mcmtroffaes/inipp
Simple C++ ini parser.
Language: C++ - Size: 101 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 290 - Forks: 57

RReverser/serde-xml-rs
xml-rs based deserializer for Serde (compatible with 1.0+)
Language: Rust - Size: 253 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 289 - Forks: 97

likexian/whois-parser
Whois parser for domain whois information parsing in Go(Golang).
Language: Go - Size: 1.04 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 289 - Forks: 90

StegSchreck/RatS
Movie Ratings Synchronization with Python
Language: HTML - Size: 4.25 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 287 - Forks: 31

ml-archive/Serpent 📦
A protocol to serialize Swift structs and classes for encoding and decoding.
Language: Swift - Size: 3.16 MB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 286 - Forks: 13

indy-singh/StringsAreEvil
Reducing memory allocations from 7.5GB to 32KB
Language: C# - Size: 404 KB - Last synced at: 6 months ago - Pushed at: almost 5 years ago - Stars: 280 - Forks: 43

vincentneo/CoreGPX
A library for parsing and creation of GPX location files. Purely Swift.
Language: Swift - Size: 2.55 MB - Last synced at: 26 days ago - Pushed at: 7 months ago - Stars: 273 - Forks: 57

decorators-squad/eo-yaml
YAML for Java 8 and above. A user-friendly OOP library. Previously known as "Camel".
Language: Java - Size: 1.8 MB - Last synced at: 26 days ago - Pushed at: 6 months ago - Stars: 268 - Forks: 53

kstenerud/concise-encoding
The secure data format for a modern world
Language: ANTLR - Size: 3.22 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 268 - Forks: 4

jgontrum/spacy-api-docker
spaCy REST API, wrapped in a Docker container.
Language: Python - Size: 356 KB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 267 - Forks: 99

textX/Arpeggio
Parser interpreter based on PEG grammars written in Python http://textx.github.io/Arpeggio/
Language: Python - Size: 8.97 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 264 - Forks: 55

zeek/spicy
C++ parser generator for dissecting protocols & files.
Language: C++ - Size: 14.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 261 - Forks: 41

metasoarous/semantic-csv
Higher level tools for working with CSV data and files
Language: Clojure - Size: 419 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 257 - Forks: 22

d0c-s4vage/gramfuzz
gramfuzz is a grammar-based fuzzer that lets one define complex grammars to generate text and binary data formats.
Language: Python - Size: 6.19 MB - Last synced at: 28 days ago - Pushed at: over 4 years ago - Stars: 257 - Forks: 39

mike01/pypacker
:package: The fastest and simplest packet manipulation lib for Python
Language: Python - Size: 3.37 MB - Last synced at: 2 days ago - Pushed at: about 5 years ago - Stars: 256 - Forks: 43

propensive/contextual
Statically-checked string interpolation in Scala
Language: Scala - Size: 4.66 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 251 - Forks: 23

Kronuz/esprima-python
ECMAScript parsing infrastructure for multipurpose analysis
Language: Python - Size: 7.24 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 246 - Forks: 47

ThaDafinser/UserAgentParser
UserAgent parsing done right
Language: PHP - Size: 4.75 MB - Last synced at: about 22 hours ago - Pushed at: over 4 years ago - Stars: 245 - Forks: 47

tokay-lang/tokay
Tokay is a programming language designed for ad-hoc parsing, inspired by awk.
Language: Rust - Size: 1.64 MB - Last synced at: 8 days ago - Pushed at: 11 days ago - Stars: 242 - Forks: 7

fox-it/dissect.cstruct_legacy
A no-nonsense c-like structure parsing library for Python
Language: Python - Size: 38.1 KB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 239 - Forks: 25

iabudiab/HTMLKit
An Objective-C framework for your everyday HTML needs.
Language: HTML - Size: 2.45 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 236 - Forks: 26

iuccio/csvToJson
Convert CSV file to JSON
Language: JavaScript - Size: 529 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 231 - Forks: 43

felixSchl/neodoc 📦
Beautiful, hand-crafted commandline interfaces for node.js
Language: PureScript - Size: 2.47 MB - Last synced at: 7 days ago - Pushed at: about 4 years ago - Stars: 229 - Forks: 9

howardroark/pollinate
Template your base files and generate new projects from Git(Hub).
Language: JavaScript - Size: 1.68 MB - Last synced at: 10 days ago - Pushed at: over 5 years ago - Stars: 228 - Forks: 14

BLLIP/bllip-parser Fork of dmcc/bllip-parser
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
Language: GAP - Size: 47.8 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 227 - Forks: 53

redco/goose-parser
Universal scraping tool, which allows you to extract data using multiple environments
Language: JavaScript - Size: 933 KB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 227 - Forks: 13

parsify-dev/desktop
Next generation notepad-based calculator, built with extendibility and privacy in mind.
Size: 2.51 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 226 - Forks: 3

mdaines/grammophone
A tool for analyzing and transforming context-free grammars.
Language: JavaScript - Size: 2.02 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 218 - Forks: 24

xpl/stacktracey
Parses call stacks. Reads sources. Clean & filtered output. Sourcemaps. Node & browsers.
Language: JavaScript - Size: 813 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 218 - Forks: 22

Captainarash/CaptCC
A tiny C compiler written purely in JavaScript.
Language: JavaScript - Size: 106 KB - Last synced at: 26 days ago - Pushed at: almost 6 years ago - Stars: 213 - Forks: 35

joanllenas/ts.data.json Fork of aische/JsonDecoder
A lightweight JSON decoding library for TypeScript
Language: TypeScript - Size: 677 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 209 - Forks: 16

spdx/tools-python
A Python library to parse, validate and create SPDX documents.
Language: Python - Size: 3.39 MB - Last synced at: 8 days ago - Pushed at: 17 days ago - Stars: 208 - Forks: 138

yakaz/yamerl
YAML 1.2 and JSON parser in pure Erlang
Language: Erlang - Size: 827 KB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 208 - Forks: 53

ehmicky/safe-json-value
⛑️ JSON serialization should never fail
Language: JavaScript - Size: 5.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 206 - Forks: 6

jimmiebergmann/mini-yaml
Single header YAML 1.0 C++11 serializer/deserializer.
Language: C++ - Size: 896 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 204 - Forks: 38

briansmith/untrusted
Safe, fast, zero-panic, zero-crashing, zero-allocation parsing of untrusted inputs in Rust.
Language: Rust - Size: 102 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 201 - Forks: 24

aappleby/matcheroni
A minimalist single-header library for building pattern-matchers, lexers, and parsers.
Language: C++ - Size: 7.31 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 200 - Forks: 5

bykof/gostradamus
Gostradamus: Better DateTimes for Go 🕰️
Language: Go - Size: 348 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 200 - Forks: 6

netheril96/StaticJSON
Fast, direct and static typed parsing of JSON with C++
Language: C++ - Size: 702 KB - Last synced at: about 16 hours ago - Pushed at: over 2 years ago - Stars: 200 - Forks: 35

bitwalker/combine
A parser combinator library for Elixir projects
Language: Elixir - Size: 243 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 199 - Forks: 19

rcdilorenzo/filtrex
A library for performing and validating complex filters from a client (e.g. smart filters)
Language: Elixir - Size: 234 KB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 198 - Forks: 28

colis-anr/morbig
A static parser for POSIX Shell
Language: OCaml - Size: 1.01 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 194 - Forks: 8

Cargill/OpenSIEM-Logstash-Parsing
SIEM Logstash parsing for more than hundred technologies
Language: Python - Size: 4.18 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 184 - Forks: 40

mck89/peast
JavaScript parser written in PHP that generates AST from your code according to ECMAScript specification
Language: PHP - Size: 1.72 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 182 - Forks: 20

simongray/clojure-dsl-resources
A curated list of Clojure resources for dealing with domain-specific languages.
Size: 109 KB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 182 - Forks: 5

maxime1992/pizza-sync
Pizza-Sync is a web app built on the frontend with angular, ngrx and on the backend with Nest. It let you and your friends/colleagues choose a pizza before placing a group order. Built using https://github.com/maxime1992/angular-ngrx-starter
Language: TypeScript - Size: 57.7 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 168 - Forks: 49

RohanNagar/jmail
A modern and lightweight library for working with email addresses in Java
Language: Java - Size: 761 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 165 - Forks: 9

bijington/expressive
Expressive is a cross-platform expression parsing and evaluation framework. The cross-platform nature is achieved through compiling for .NET Standard so it will run on practically any platform.
Language: C# - Size: 3.74 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 164 - Forks: 25

untitaker/html5gum
A WHATWG-compliant HTML5 tokenizer and tag soup parser
Language: Rust - Size: 576 KB - Last synced at: about 22 hours ago - Pushed at: 2 months ago - Stars: 160 - Forks: 10

Anwarvic/Dan-Jurafsky--Chris-Manning--NLP
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Language: Java - Size: 49.7 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 157 - Forks: 55

ashutoshbsathe/yacv
Yet Another Compiler Visualizer
Language: Python - Size: 7.83 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 157 - Forks: 9

Krusen/BencodeNET
.NET library for encoding/decoding bencode and reading/writing torrent files
Language: C# - Size: 1.06 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 156 - Forks: 29

SoftCreatR/JSONPath
JSONPath implementation for PHP.
Language: PHP - Size: 267 KB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 154 - Forks: 28

NiuTrans/NiuTrans.SMT
NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully developed in C++ language. So it runs fast and uses less memory. Currently it supports phrase-based, hierarchical phrase-based and syntax-based (string-to-tree, tree-to-string and tree-to-tree) models for research-oriented studies.
Language: C++ - Size: 94 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 154 - Forks: 36

soulverteam/SoulverStringParsing
Learn about how SoulverCore can give Swift "better than regex" data parsing features (for many common tasks)
Language: Swift - Size: 44.9 KB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 154 - Forks: 3

dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
Language: Python - Size: 655 KB - Last synced at: 24 days ago - Pushed at: 9 months ago - Stars: 150 - Forks: 23

buehler/node-typescript-parser
Parser for typescript (and javascript) files, that compiles those files and generates a human understandable AST.
Language: TypeScript - Size: 618 KB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 148 - Forks: 46

nerdsupremacist/Syntax
Write value-driven parsers quickly in Swift with an intuitive SwiftUI-like DSL
Language: Swift - Size: 2.7 MB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 148 - Forks: 11

scraperai/scraperai
ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.
Language: HTML - Size: 15.8 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 147 - Forks: 15

jamiejennings/rosie-pattern-language
Rosie Pattern Language (RPL) and the Rosie Pattern Engine have MOVED!
Size: 18.9 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 145 - Forks: 20

masak/alma
ALgoloid with MAcros -- a language with Algol-family syntax where macros take center stage
Language: Raku - Size: 2.31 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 140 - Forks: 15

0x2a-42/lelwel
Resilient LL(1) parser generator for Rust
Language: Rust - Size: 673 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 139 - Forks: 4

igordejanovic/parglare
A pure Python LR/GLR parser - http://www.igordejanovic.net/parglare/
Language: Python - Size: 13.5 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 139 - Forks: 32

aclindsa/ofxgo
Golang library for querying and parsing OFX
Language: Go - Size: 379 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 139 - Forks: 27

r-lib/evaluate
A version of eval for R that returns more information about what happened
Language: R - Size: 2.57 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 137 - Forks: 35

dcavar/python-tutorial-notebooks
Python tutorials as Jupyter Notebooks for NLP, ML, AI
Language: Jupyter Notebook - Size: 64.9 MB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 137 - Forks: 89

valayDave/arxiv-miner
arxiv_miner is a toolkit for mining research papers on CS ArXiv.
Language: Python - Size: 159 KB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 134 - Forks: 8

rodrigosetti/master-plan
Project Management for Hackers
Language: Haskell - Size: 1.1 MB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 134 - Forks: 6

WolframResearch/codeparser
Parse Wolfram Language source code as abstract syntax trees (ASTs) or concrete syntax trees (CSTs)
Language: Rust - Size: 6.85 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 133 - Forks: 14

breandan/galoisenne
🕸️ Graphs, finite fields and discrete dynamical systems in Kotlin
Language: Kotlin - Size: 121 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 130 - Forks: 11
