Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / kuhumcst 61 repositories
kuhumcst/danish-semantic-reasoning-benchmark
A Danish semantic reasoning benchmark compiled from lexical semantic resources
Size: 645 KB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 2 - Forks: 0
kuhumcst/GEHM_zoom_corpus
Size: 1.83 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0
kuhumcst/DanNet
The Danish WordNet as an RDF graph.
Language: Clojure - Size: 18.2 MB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 19 - Forks: 0
kuhumcst/korp-setups
Docker setups for all Korp installations maintained by NorS.
Language: JavaScript - Size: 23.9 MB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0
kuhumcst/texton
Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs
Language: PHP - Size: 7.72 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 4 - Forks: 0
kuhumcst/texton-linguistic-resources
Linguistic resources for several of the tools included in the Text Tonsorium
Language: Roff - Size: 553 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
kuhumcst/Voyant Fork of sgsinclair/Voyant
Fork of Voyant Tools
Language: JavaScript - Size: 118 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
kuhumcst/qname 📦
A QName record and conversions between QNames, Keywords, and IRI strings.
Language: Clojure - Size: 1.95 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/WordnetLoom Fork of CLARIN-PL/Depracticated-WordnetLoom
Wordnet Visual Editor (client/server application) developed by CLARIN-PL
Language: Java - Size: 44 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
kuhumcst/VLO-mapping Fork of clarin-eric/VLO-mapping
Mapping definitions and vocabularies for the Virtual Language Observatory
Language: Shell - Size: 9.66 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
kuhumcst/SPF-SPs-metadata Fork of clarin-eric/SPF-SPs-metadata
Metadata sources for all service providers in the CLARIN Service Provider Federation
Language: Shell - Size: 1.96 MB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
kuhumcst/texton-bin
Binary executable files used by services in the Text Tonsorium.
Size: 31.9 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
kuhumcst/semdax Fork of coastalcph/semdax
Sense-annotated corpora from the Semantic Processing Across Domains project
Size: 22.8 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
kuhumcst/stucco 📦
An experimental adaptive UI toolkit.
Language: Clojure - Size: 1000 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 31 - Forks: 1
kuhumcst/schemas Fork of globalwordnet/schemas
WordNet-LMF formats
Size: 603 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
kuhumcst/tinylemmatizer Fork of BartJongejan/tinylemmatizer
Simple lemmatizer for inclusion in Python programs that uses the same lemmatization rules as CSTlemma.
Language: C - Size: 37.1 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
kuhumcst/timeline Fork of simile-widgets/timeline
Size: 19.8 MB - Last synced: 3 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0
kuhumcst/repetitiveness-checker
Finds repeated sequences of two or more tokens
Language: C++ - Size: 41 KB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
kuhumcst/rtfreader
Text segmenter and tokeniser for Danish, English and other languages. Reads an RTF or flat text file and outputs the text, one line per sentence & optionally tokenized.
Language: C++ - Size: 375 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 4
kuhumcst/switchboard-tool-registry Fork of clarin-eric/switchboard-tool-registry
The Switchboard Tool Registry
Language: Python - Size: 2.51 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/sofie-vega-timeline
Language: JavaScript - Size: 9.57 MB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 1
kuhumcst/taggerXML
Modernized version of Eric Brill's Part Of Speech tagger.
Language: C++ - Size: 126 KB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 17 - Forks: 5
kuhumcst/META-SHARE Fork of metashare/META-SHARE
Public repository of the META-SHARE software
Language: Python - Size: 164 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
kuhumcst/opennlpPOSTagger
Webservice that wraps around the OpenNLP POS tagger
Language: Java - Size: 14.4 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/ParlaMint Fork of clarin-eric/ParlaMint
ParlaMint: Comparable Parliamentary Corpora
Size: 2.03 GB - Last synced: 3 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
kuhumcst/public-license-selector Fork of ufal/public-license-selector
Tool that will help you select the right open license for your data or software
Language: CoffeeScript - Size: 600 KB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
kuhumcst/mate-POStagger
Webservice that wraps around the mate POS tagger
Language: Java - Size: 1.68 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/parsesgml
Parse sgml, html and xml in a forgiving way.
Language: C++ - Size: 164 KB - Last synced: 3 months ago - Pushed: almost 10 years ago - Stars: 0 - Forks: 0
kuhumcst/parliament
Language: Clojure - Size: 6.84 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
kuhumcst/mate-parser
Web service that wraps around Bernd Bohnet's graph based parser
Language: Java - Size: 1.66 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/letterfunc
Functions for upper/lower casing, for testing whether a character is a letter and for conversion between Unicode encodings UTF-8 and UTF-16
Language: C - Size: 633 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1
kuhumcst/makeUTF8
Converts UTF-16 (BE/LE), UTF-32 (BE/LE), ISO-8859-N to UTF-8. Removes BOM and surrogate pairs from UTF-8, converting a codepoint between U-D800 and U-DBFF followed by a codepoint between U-DC00 and U-DFFF to one valid codepoint > U-FFFF.
Language: C++ - Size: 43.9 KB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 1
kuhumcst/lexicalsample_wsd
Language: Jupyter Notebook - Size: 130 KB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
kuhumcst/head_movement_detection
Jupyter notebooks and training data containing manual head movement annotations, speech data and velocity, acceleration and jerk data.
Language: Jupyter Notebook - Size: 13.3 MB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1
kuhumcst/hashmap
Simple implementation of a hash map using separate chaining. The table allocates more buckets if the load factor is more than 100% and frees buckets if the loadfactor falls below 20%.
Language: C++ - Size: 11.7 KB - Last synced: 3 months ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0
kuhumcst/jerk
Analyses the movement of two points in x-y plane, in casu nose tips data from OpenPoseDemo.exe, and computes velocity, acceleration and jerk of the points.
Language: C++ - Size: 20.5 KB - Last synced: 3 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
kuhumcst/LemmaX
Lemmatiser with an extra. Predict lemmas as well as classes (e.g. Parts of Speech), based on the morphology of the input word.
Size: 19.5 KB - Last synced: 3 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0
kuhumcst/kuhumcst.github.io
Frontend demos.
Language: HTML - Size: 3.13 MB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
kuhumcst/fcs-korp-endpoint Fork of clarin-eric/fcs-korp-endpoint
The implementation of fcs-korp-endpoint running on Alf.
Language: Java - Size: 189 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/clarin-dspace Fork of ufal/clarin-dspace
Digital repository for the CLARIN-DK data centre
Language: Java - Size: 140 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
kuhumcst/all2lower
Converts input text (UTF-8 encoded) to lowercase. Usage: all2lower <input> <output>
Language: Makefile - Size: 16.6 KB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
kuhumcst/pedestal-sp
Turn a Pedestal web service into a SAML Service Provider.
Language: Clojure - Size: 18.6 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 0
kuhumcst/affixtrain
Using supervised learning, create a set of affix rules for use by the CSTlemma lemmatiser.
Language: C++ - Size: 1.11 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 4 - Forks: 0
kuhumcst/rescope
Turn documents into UI components.
Language: Clojure - Size: 121 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 8 - Forks: 0
kuhumcst/cstlemma
Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
Language: C++ - Size: 689 KB - Last synced: 25 days ago - Pushed: 6 months ago - Stars: 33 - Forks: 6
kuhumcst/regionh
Scrape JSON data from a list of Region Hovestaden URLs (with permission).
Language: Clojure - Size: 1000 Bytes - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
kuhumcst/xml-hiccup
Convert XML into Hiccup in Clojure and ClojureScript.
Language: Clojure - Size: 18.6 KB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 19 - Forks: 1
kuhumcst/hiccup-tools
Language: Clojure - Size: 3.91 KB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
kuhumcst/dk5
Fetch the DK5 dataset and store it as EDN.
Language: Clojure - Size: 1.95 KB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
kuhumcst/clarin-tei Fork of kuhumcst/glossematics
Language: Clojure - Size: 11.1 MB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
kuhumcst/tei-clarin
Language: Clojure - Size: 2.8 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
kuhumcst/rum Fork of tonsky/rum
Simple, decomplected, isomorphic HTML UI library for Clojure and ClojureScript
Size: 3.04 MB - Last synced: 3 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
kuhumcst/tf-idf
A reasonably performant TF-IDF implementation.
Language: Clojure - Size: 9.77 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 12 - Forks: 1
kuhumcst/Anvil-Facetracker
OpenCV-based Plugin for the Anvil annotation software that tracks faces and creates annotations when velocity or acceleration thresholds are transgressed.
Language: Java - Size: 88 MB - Last synced: 3 months ago - Pushed: almost 8 years ago - Stars: 5 - Forks: 3
kuhumcst/texton-Java
Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).
Language: Java - Size: 13.6 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 2 - Forks: 2
kuhumcst/glossematics
The life of Louis Hjelmslev.
Language: Clojure - Size: 9.64 MB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 1
kuhumcst/finetune_bert_sense_select
Language: Python - Size: 60.5 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
kuhumcst/pycor
Language: Python - Size: 1.45 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
kuhumcst/aristotle Fork of arachne-framework/aristotle
RDF, SPARQL and OWL for Clojure
Language: Clojure - Size: 149 KB - Last synced: 3 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0
kuhumcst/cuphic
Transform or scrape Hiccup with a declarative DSL.
Language: Clojure - Size: 142 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0
kuhumcst/Danish-Similarity-Dataset
Gold standard resource for evaluation of Danish word embedding models.
Size: 12.7 KB - Last synced: 25 days ago - Pushed: about 4 years ago - Stars: 8 - Forks: 0
kuhumcst/wordties Fork of andersjo/andreord-public
WordTies presents a search interface and visualisation of the conceptual network of a WordNet, to browse its relational and multilingual links to other wordnets. Originally developed for visualising DanNet, the Danish wordnet.
Language: Ruby - Size: 72.4 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 3 - Forks: 1