An open API service providing repository metadata for many open source software ecosystems.

Topic: "ingest"

sammcj/ingest

Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs

Language: Go - Size: 2.69 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 286 - Forks: 18

opensemanticsearch/open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Language: Python - Size: 615 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 268 - Forks: 72

newrelic/nri-flex

An application-agnostic, all-in-one New Relic integration integration

Language: Go - Size: 9.07 MB - Last synced at: about 18 hours ago - Pushed at: about 20 hours ago - Stars: 116 - Forks: 126

alongL/srs_ingest_helper

a json controlled web server to help srs doing ingest.

Language: C++ - Size: 291 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 11

codingchili/ethereum-ingest

JavaFX and commandline application to import events from the Ethereum blockchain into ElasticSearch, MongoDB, Hazelcast, CQEngine and SQLite.

Language: Java - Size: 2.68 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 35 - Forks: 9

speculare-cloud/speculare-server

Receive, store info coming from the client into the database and handle alerts to report incidents based on criteria

Language: Rust - Size: 634 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 31 - Forks: 1

alephdata/alephclient

API client for Aleph, supports bulk entity and document upload.

Language: Python - Size: 250 KB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 28 - Forks: 14

rse/vdo-ninja-trampoline

VDO.Ninja Trampoline

Language: HTML - Size: 790 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 27 - Forks: 12

E-ARK-Software/earkweb

E-ARK Web is a software for the creation and management of archival information packages, and it supports full-text search for individual files contained in them.

Language: JavaScript - Size: 64.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 22 - Forks: 6

CloudFormations/CF.Cumulus

A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away the complexity, giving you the power to build, scale, and manage your dataflows with ease, accelerating data delivery.

Language: TSQL - Size: 10.7 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 20 - Forks: 13

artefactual-labs/enduro

A tool to support ingest and automation in digital preservation workflows

Language: Go - Size: 3.94 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 8

luludotdev/frameshift 📦

Ingest and Viewing stack for deploying Mixer's FTL streaming protocol.

Language: TypeScript - Size: 427 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

prasanthj/culvert

Hive streaming ingest test application

Language: Java - Size: 51.8 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 2

jmxt3/Git-Scape-Web

Understand Any Repo in seconds. Instantly generate AI-ready code digests, visualize repository structures, and chat with your codebase using Git Scape AI.

Language: TypeScript - Size: 445 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 8 - Forks: 1

nagare-media/ingest

nagare media ingest implements various HTTP based media ingest protocols

Language: Go - Size: 297 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 7 - Forks: 0

artefactual-sdps/enduro

A tool to support ingest and automation in digital preservation workflows

Language: Go - Size: 15.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 6 - Forks: 3

EFS-OpenSource/superb-data-kraken-ingest

Language: Python - Size: 83 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

lloydmeta/jhhi

Java Heap Histogram Ingest, written in Rust. Sends jmap heap histograms to Elasticsearch.

Language: Rust - Size: 1.28 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1

kids-first/kf-lib-data-ingest

🏭 Kids First Data Ingest Library

Language: Python - Size: 11.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Themimitoof/cambak

📸 A simple and easy to use tool for derushing digital cameras

Language: Go - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 5 - Forks: 1

Regionaal-Archief-Zuid-Utrecht/razulibs

Data transformation tools for the creation of RDF for ingest of digital archives.

Language: Python - Size: 244 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4 - Forks: 2

YathishGP003/Cyber-Quest-v1

A gamified cybersecurity learning platform with 10 progressive difficulty levels, each teaching a specific security concept.

Language: TypeScript - Size: 2.32 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

tspannhw/GetWebCamera

Apache NiFi 1.23 Custom Processor for WebCams

Language: Java - Size: 2.57 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 4

ovidiugiorgi/csv2opensearch

Import CSV files into OpenSearch

Language: Go - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

tmcgrath/cassandra-ingest

DataStax or Cassandra Ingest from Relational Databases with StreamSets

Language: PLSQL - Size: 12.3 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 13

ShreyPurohit/gitingest-vsextension

Seamlessly analyze your repositories inside VS Code!

Language: TypeScript - Size: 12.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 1

Lolle2000la/MangaIngestWithUpscaling

Manage ingesting mangas, placing them in the right places and merging multiple sources together, while also enabling automatic upscaling of said mangas.

Language: C# - Size: 1.96 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

rotkonetworks/githem

Githem repos for your LLMs

Language: Rust - Size: 621 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

jmxt3/Git-Scape-API

Official API of the project Git Space AI. Understand Any Repo in seconds

Language: Python - Size: 138 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

AilieOHagan/primm Fork of mohagan9/primm

Pre-Ingest Metadata Modifier

Language: Kotlin - Size: 70.3 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

LeidenUniversityLibrary/islandora_prepare_ingest

Islandora Prepare Ingest is a module that helps you build workflows for preparing data for ingest into Islandora.

Language: PHP - Size: 621 KB - Last synced at: 10 months ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

tspannhw/ClouderaNow2020

Twitter Ingest to S3, ORC, Slack, Hive Streaming

Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

tspannhw/clouddatawarehouse

Populating Cloud Data Warehouses with Apache NiFi

Size: 14.4 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

jrcichra/ingestd

HTTP server that easily ingests data into a database

Language: Go - Size: 348 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

xerosic/gnam

🍽️ gnam - HTTP request ingester and visualizer

Language: Go - Size: 30.3 KB - Last synced at: 4 days ago - Pushed at: 23 days ago - Stars: 1 - Forks: 0

stevenlafl/GPTLoader

Ingest an entire codebase into ChatGPT to ask questions about it

Language: TypeScript - Size: 188 KB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

discoverygarden/dgi_migrate

A module to facilitate bulk ingest and migrations into Islandora using the migrate module.

Language: PHP - Size: 823 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 3

josev2046/Watch-Folder-Automation

A simple watch folder prototype for OVPs/OTTs/MAMs/DAMs.

Size: 49.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

sxudan/lets-watch

This is a watch party video streaming app

Language: C++ - Size: 12.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sxudan/live_file_publisher

This is a flutter package to publish file to RTSP or RTMP server.

Language: Dart - Size: 28.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

shitizenlism/goForwarder

用ffmpeg采样File/rtsp/rtmp流,推送到RTMP Server,断开自动重拉,防假死。Use ffmpeg to ingest File/rtsp/rtmp streams, push to RTMP Server, auo re-pull and prevent fake-dead.

Language: Go - Size: 1.55 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

tspannhw/kudrone-nifi

Notes, scripts, images, Apache NiFi templates, processors

Size: 7.97 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

discoverygarden/islandora_ordered_zip_newspaper_batch

Newspaper batch ingest that uses the ordering of a ZIP to dictate sequence.

Language: PHP - Size: 21.5 KB - Last synced at: 5 months ago - Pushed at: about 9 years ago - Stars: 1 - Forks: 1

discoverygarden/book_batch_rename_script

Renames a folder full of TIFF/XML/MRC files and moves them into the correct folders to prepare for ingest using the Islandora Book Batch module

Language: Shell - Size: 158 KB - Last synced at: 3 months ago - Pushed at: over 11 years ago - Stars: 1 - Forks: 3

suwa-sh/local-RAG-backend

This is the backend for a RAG system that runs on Docker Compose. It registers documents in a wide range of file formats, which can be searched using the MCP server.

Language: Python - Size: 313 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

bigbossbrother/do-ingest-api-aggregate-r2

Aggregate 100.000s of files in a Durable Object, streaming out to R2, circumventing limitations

Language: TypeScript - Size: 37.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

codewmanas/Ascenza

An AI Career Coach

Language: JavaScript - Size: 310 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

erickrr-bd/WSbeat

Easy data ingestion into ElasticSearch using Python and websockets.

Language: Python - Size: 38.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lu-pl/rdfingest

A simple tool for ingesting RDF data into a triplestore.

Language: Python - Size: 814 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

dd-Splunk/splunk-datalake

How to combine smart store and ingest action for datalake use case

Language: Python - Size: 360 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lizclipse/grohuh

Service definition of grott & custom ingest/dashboard

Language: Rust - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

teknoir/node-red-teknoir-ingest

A set of nodes that lets you ingest data from all your devices.

Language: JavaScript - Size: 55.7 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sxudan/react-native-live-file-publisher

This is a react native package to publish file to RTSP or RTMP server.

Language: TypeScript - Size: 4.16 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

data-modelisation/stock-market-ingesting

Market Data Ingestion and BigQuery Integration project

Language: Python - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

jnylen/vind 📦

Vind is a importer for EPG Data from various sources

Language: Elixir - Size: 707 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

znatty22/d3b-ingest-packages

Sandbox for setting up new ingest workflow - ingest data portal per PR

Language: Python - Size: 2.18 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

riffl/riffl

Riffl - generic streaming data ingestion framework

Language: Java - Size: 177 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

jcoon97/pushshift-ingest-delay

Simple webapp that will show the ingest delay for PushShift Reddit data

Language: TypeScript - Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

discoverygarden/islandora_paged_content_pdf_batch

Language: PHP - Size: 49.8 KB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 8

mikejoh/artifactory-elasticsearch-ingest-pipelines

Ingest pipelines to parse Artifactory logs sent to Elasticsearch using Filebeat

Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Staatsarchiv-Basel-Stadt/PREMIS-Profile-scopeIngest

Language: XSLT - Size: 29.3 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

discoverygarden/limerick_ingest

Language: PHP - Size: 205 KB - Last synced at: 3 months ago - Pushed at: almost 10 years ago - Stars: 0 - Forks: 0

discoverygarden/uofm_newspaper_batch

Batch ingest for IArchive format, as provided from UoM.

Language: XSLT - Size: 208 KB - Last synced at: 2 months ago - Pushed at: over 11 years ago - Stars: 0 - Forks: 0

discoverygarden/islandora_importer Fork of Islandora/islandora_importer

A pluggable batch ingester module for Islandora.

Language: XSLT - Size: 773 KB - Last synced at: over 1 year ago - Pushed at: over 11 years ago - Stars: 0 - Forks: 0

moses/moses_tools/docu

Documentation and demo notebooks MOSES tools

Last synced at: about 1 year ago - Stars: 0 - Forks: 0