An open API service providing repository metadata for many open source software ecosystems.

GitHub / nachoDRT 1 Repository

nachoDRT/MERIT-Dataset

The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.

Language: Python - Size: 603 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 10 - Forks: 1

nachoDRT/MERIT-Students

Repository for assembling the Merit-Students dataset, available on Hugging Face 🤗. It merges data from the Merit Dataset and images from FairFace.

Language: Python - Size: 1.15 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

nachoDRT/MERIT-Secret

A place to store MERIT-Secret with accessible URLs

Size: 223 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

nachoDRT/Checking-Face

Language: Python - Size: 9.77 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

nachoDRT/MERIT-Subsets

A repository to simplify access to the MERIT Dataset image URLs. This repository contains only subsets; to access the complete dataset, please visit de-Rodrigo/merit on Hugging Face.

Language: Python - Size: 3.43 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

nachoDRT/dummy

Language: Python - Size: 1000 Bytes - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

nachoDRT/CVI-ICAI

The Computer Vision I Lab Repo

Language: TeX - Size: 141 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 9 - Forks: 8

nachoDRT/transformers Fork of huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Size: 221 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nachoDRT/nachoDRT

Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nachoDRT/sdg-industrial-part-pipeline

Sinthetic Dataset Generation Pipeline for Industrial Parts

Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

nachoDRT/none-sculpture-project

Connect a set of Raspberry Pi(s) to the cloud. The collection of boards should be able to download content (video) from the cloud. The app owner is responsible for updating content in the cloud.

Language: CSS - Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

nachoDRT/records-inference

Language: Python - Size: 45.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nachoDRT/github-slideshow

A robot powered training repository :robot:

Language: Ruby - Size: 3.36 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nachoDRT/comillas

A repo to connect Frontend, Backend and a Deep Learning Module. The goal is to extract valuable information from non-digital native documents and provide the interface for an end-user.

Size: 11.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

nachoDRT/records-dataset

A pipeline to create labelled datasets. Datasets include '.pdf', '.png' documents with visual information and '.json' files to gather labels

Language: Python - Size: 157 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nachoDRT/trunsd

Transcript of Records in Noisy Scanned Documents

Size: 22.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0