An open API service providing repository metadata for many open source software ecosystems.

GitHub / qurator-spk 13 Repositories

Curation Technologies

qurator-spk/sbb_images

Image Annotation Tool and Image Search

Language: JavaScript - Size: 37.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 14 - Forks: 3

qurator-spk/eynollah

Document Layout Analysis

Language: Python - Size: 6.01 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 372 - Forks: 31

qurator-spk/OCR_textline_editor

Language: Python - Size: 2.81 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

qurator-spk/dinglehopper

An OCR evaluation tool

Language: Python - Size: 3.66 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 65 - Forks: 16

qurator-spk/page2tsv

PAGE-XML to TSV

Language: Python - Size: 342 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 7

qurator-spk/sbb_ner

Named Entity Recognition

Language: Python - Size: 323 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 2

qurator-spk/publications

Qurator-SPK team publications

Size: 72.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

qurator-spk/sbb_utils

shared functionality

Language: Python - Size: 1.32 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

qurator-spk/sbb_binarization

Document Image Binarization

Language: Python - Size: 159 KB - Last synced at: 26 days ago - Pushed at: 7 months ago - Stars: 78 - Forks: 17

qurator-spk/sbb_pixelwise_segmentation

Pixelwise segmentation for document images

Language: Python - Size: 177 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 13 - Forks: 10

qurator-spk/setuptools_ocrd

Manage your package version through ocrd-tool.json

Language: Python - Size: 34.2 KB - Last synced at: 25 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

qurator-spk/mods4pandas

Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis

Language: Python - Size: 363 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

qurator-spk/sbb_textline_detection

Detect textlines in document images

Language: Python - Size: 190 KB - Last synced at: 10 months ago - Pushed at: 12 months ago - Stars: 84 - Forks: 17

qurator-spk/neat

Named entity annotation tool

Language: JavaScript - Size: 5.43 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 27 - Forks: 5

qurator-spk/ocrd_repair_inconsistencies 📦

Automatically re-order lines, words and glyphs to become textually consistent with their parents.

Language: Python - Size: 44.9 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 3

qurator-spk/sbb_ned

Named Entity Linking and Disambiguation

Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 2

qurator-spk/train-calamari-gt4histocr

Train a GT4HistOCR Calamari model

Language: Shell - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

qurator-spk/PyTorch-YOLOv3 Fork of eriklindernoren/PyTorch-YOLOv3

Minimal PyTorch implementation of YOLOv3

Language: Python - Size: 15.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

qurator-spk/sbb_ocr_postcorrection

Two-Step Approach to OCR Post-Correction

Language: Jupyter Notebook - Size: 515 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 3

qurator-spk/sbb_web-integration

Visualization of NER+EL+Topic Modelling + Image-Search

Language: JavaScript - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/sbb_topic-modelling

Topic Modelling

Language: Python - Size: 1.31 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/sbb_knowledge-base

Wikidata + Wikipedia Knowledge-Base Extraction for EL-purposes

Language: Python - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/sbb_page_extractor 📦

Language: Python - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

qurator-spk/page-to-alto Fork of kba/page-to-alto

Language: Python - Size: 448 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrd_trocr

OCR-D processor for TrOCR

Language: Python - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

qurator-spk/download-gitter.im-chat Fork of jbarth-ubhd/download-gitter.im-chat 📦

tiny tool to download gitter.im chat

Language: Perl - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

qurator-spk/abbyy-to-alto Fork of Mewel/abbyy-to-alto

Converts FineReader abbyy.xml to alto.xml.

Size: 395 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/2021-04-match-ocr-text-vs-gt-text

Match OCR page text to GT page text

Language: Jupyter Notebook - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrd-galley

A Dockerized test environment for OCR-D processors 🚢

Language: Shell - Size: 19.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

qurator-spk/sbb_tools

Digitalized Collections of the Berlin State Library: ALTO-XML Processing Tools / batch NER + EL / BERT-pre-training

Language: Python - Size: 1.31 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

qurator-spk/sbb_predict_tool

Language: Python - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

qurator-spk/sbb_column_classifier

Get the number of columns for a document image

Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

qurator-spk/ocrd_calamari Fork of OCR-D/ocrd_calamari

Recognize text using Calamari OCR and the OCR-D framework

Language: Python - Size: 125 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrd_butler Fork of StaatsbibliothekBerlin/ocrd_butler

A butler is a domestic worker in a large household. The butler, as the senior servant, has the highest servant status. He can also sometimes function as a chauffeur.

Size: 115 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrodeg Fork of NVlabs/ocrodeg

document image degradation

Language: Jupyter Notebook - Size: 15.3 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

qurator-spk/ocr-fileformat Fork of UB-Mannheim/ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Size: 764 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

qurator-spk/page2img Fork of vahidrezanezhad/page-xml-to-image

Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

qurator-spk/core Fork of OCR-D/core

Collection of OCR-related python tools and wrappers from @OCR-D

Size: 25.2 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0