GitHub / qurator-spk 13 Repositories
Curation Technologies
qurator-spk/sbb_images
Image Annotation Tool and Image Search
Language: JavaScript - Size: 37.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 14 - Forks: 3

qurator-spk/eynollah
Document Layout Analysis
Language: Python - Size: 6.01 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 372 - Forks: 31

qurator-spk/OCR_textline_editor
Language: Python - Size: 2.81 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

qurator-spk/dinglehopper
An OCR evaluation tool
Language: Python - Size: 3.66 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 65 - Forks: 16

qurator-spk/page2tsv
PAGE-XML to TSV
Language: Python - Size: 342 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 7

qurator-spk/sbb_ner
Named Entity Recognition
Language: Python - Size: 323 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 2

qurator-spk/publications
Qurator-SPK team publications
Size: 72.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

qurator-spk/sbb_utils
shared functionality
Language: Python - Size: 1.32 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

qurator-spk/sbb_binarization
Document Image Binarization
Language: Python - Size: 159 KB - Last synced at: 26 days ago - Pushed at: 7 months ago - Stars: 78 - Forks: 17

qurator-spk/sbb_pixelwise_segmentation
Pixelwise segmentation for document images
Language: Python - Size: 177 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 13 - Forks: 10

qurator-spk/setuptools_ocrd
Manage your package version through ocrd-tool.json
Language: Python - Size: 34.2 KB - Last synced at: 25 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

qurator-spk/mods4pandas
Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis
Language: Python - Size: 363 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

qurator-spk/sbb_textline_detection
Detect textlines in document images
Language: Python - Size: 190 KB - Last synced at: 10 months ago - Pushed at: 12 months ago - Stars: 84 - Forks: 17

qurator-spk/neat
Named entity annotation tool
Language: JavaScript - Size: 5.43 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 27 - Forks: 5

qurator-spk/ocrd_repair_inconsistencies 📦
Automatically re-order lines, words and glyphs to become textually consistent with their parents.
Language: Python - Size: 44.9 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 3

qurator-spk/sbb_ned
Named Entity Linking and Disambiguation
Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 2

qurator-spk/train-calamari-gt4histocr
Train a GT4HistOCR Calamari model
Language: Shell - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

qurator-spk/PyTorch-YOLOv3 Fork of eriklindernoren/PyTorch-YOLOv3
Minimal PyTorch implementation of YOLOv3
Language: Python - Size: 15.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

qurator-spk/sbb_ocr_postcorrection
Two-Step Approach to OCR Post-Correction
Language: Jupyter Notebook - Size: 515 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 3

qurator-spk/sbb_web-integration
Visualization of NER+EL+Topic Modelling + Image-Search
Language: JavaScript - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/sbb_topic-modelling
Topic Modelling
Language: Python - Size: 1.31 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/sbb_knowledge-base
Wikidata + Wikipedia Knowledge-Base Extraction for EL-purposes
Language: Python - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/sbb_page_extractor 📦
Language: Python - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

qurator-spk/page-to-alto Fork of kba/page-to-alto
Language: Python - Size: 448 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrd_trocr
OCR-D processor for TrOCR
Language: Python - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

qurator-spk/download-gitter.im-chat Fork of jbarth-ubhd/download-gitter.im-chat 📦
tiny tool to download gitter.im chat
Language: Perl - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

qurator-spk/abbyy-to-alto Fork of Mewel/abbyy-to-alto
Converts FineReader abbyy.xml to alto.xml.
Size: 395 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

qurator-spk/2021-04-match-ocr-text-vs-gt-text
Match OCR page text to GT page text
Language: Jupyter Notebook - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrd-galley
A Dockerized test environment for OCR-D processors 🚢
Language: Shell - Size: 19.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

qurator-spk/sbb_tools
Digitalized Collections of the Berlin State Library: ALTO-XML Processing Tools / batch NER + EL / BERT-pre-training
Language: Python - Size: 1.31 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

qurator-spk/sbb_predict_tool
Language: Python - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

qurator-spk/sbb_column_classifier
Get the number of columns for a document image
Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

qurator-spk/ocrd_calamari Fork of OCR-D/ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
Language: Python - Size: 125 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrd_butler Fork of StaatsbibliothekBerlin/ocrd_butler
A butler is a domestic worker in a large household. The butler, as the senior servant, has the highest servant status. He can also sometimes function as a chauffeur.
Size: 115 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

qurator-spk/ocrodeg Fork of NVlabs/ocrodeg
document image degradation
Language: Jupyter Notebook - Size: 15.3 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

qurator-spk/ocr-fileformat Fork of UB-Mannheim/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Size: 764 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

qurator-spk/page2img Fork of vahidrezanezhad/page-xml-to-image
Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

qurator-spk/core Fork of OCR-D/core
Collection of OCR-related python tools and wrappers from @OCR-D
Size: 25.2 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
