Topic: "ingest"
sammcj/ingest
Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs
Language: Go - Size: 2.69 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 286 - Forks: 18

opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Language: Python - Size: 615 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 268 - Forks: 72

newrelic/nri-flex
An application-agnostic, all-in-one New Relic integration integration
Language: Go - Size: 9.07 MB - Last synced at: about 18 hours ago - Pushed at: about 20 hours ago - Stars: 116 - Forks: 126

alongL/srs_ingest_helper
a json controlled web server to help srs doing ingest.
Language: C++ - Size: 291 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 11

codingchili/ethereum-ingest
JavaFX and commandline application to import events from the Ethereum blockchain into ElasticSearch, MongoDB, Hazelcast, CQEngine and SQLite.
Language: Java - Size: 2.68 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 35 - Forks: 9

speculare-cloud/speculare-server
Receive, store info coming from the client into the database and handle alerts to report incidents based on criteria
Language: Rust - Size: 634 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 31 - Forks: 1

alephdata/alephclient
API client for Aleph, supports bulk entity and document upload.
Language: Python - Size: 250 KB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 28 - Forks: 14

rse/vdo-ninja-trampoline
VDO.Ninja Trampoline
Language: HTML - Size: 790 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 27 - Forks: 12

E-ARK-Software/earkweb
E-ARK Web is a software for the creation and management of archival information packages, and it supports full-text search for individual files contained in them.
Language: JavaScript - Size: 64.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 22 - Forks: 6

CloudFormations/CF.Cumulus
A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away the complexity, giving you the power to build, scale, and manage your dataflows with ease, accelerating data delivery.
Language: TSQL - Size: 10.7 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 20 - Forks: 13

artefactual-labs/enduro
A tool to support ingest and automation in digital preservation workflows
Language: Go - Size: 3.94 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 8

luludotdev/frameshift 📦
Ingest and Viewing stack for deploying Mixer's FTL streaming protocol.
Language: TypeScript - Size: 427 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

prasanthj/culvert
Hive streaming ingest test application
Language: Java - Size: 51.8 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 2

jmxt3/Git-Scape-Web
Understand Any Repo in seconds. Instantly generate AI-ready code digests, visualize repository structures, and chat with your codebase using Git Scape AI.
Language: TypeScript - Size: 445 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 8 - Forks: 1

nagare-media/ingest
nagare media ingest implements various HTTP based media ingest protocols
Language: Go - Size: 297 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 7 - Forks: 0

artefactual-sdps/enduro
A tool to support ingest and automation in digital preservation workflows
Language: Go - Size: 15.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 6 - Forks: 3

EFS-OpenSource/superb-data-kraken-ingest
Language: Python - Size: 83 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

lloydmeta/jhhi
Java Heap Histogram Ingest, written in Rust. Sends jmap heap histograms to Elasticsearch.
Language: Rust - Size: 1.28 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1

kids-first/kf-lib-data-ingest
🏭 Kids First Data Ingest Library
Language: Python - Size: 11.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Themimitoof/cambak
📸 A simple and easy to use tool for derushing digital cameras
Language: Go - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 5 - Forks: 1

Regionaal-Archief-Zuid-Utrecht/razulibs
Data transformation tools for the creation of RDF for ingest of digital archives.
Language: Python - Size: 244 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4 - Forks: 2

YathishGP003/Cyber-Quest-v1
A gamified cybersecurity learning platform with 10 progressive difficulty levels, each teaching a specific security concept.
Language: TypeScript - Size: 2.32 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

tspannhw/GetWebCamera
Apache NiFi 1.23 Custom Processor for WebCams
Language: Java - Size: 2.57 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 4

ovidiugiorgi/csv2opensearch
Import CSV files into OpenSearch
Language: Go - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

tmcgrath/cassandra-ingest
DataStax or Cassandra Ingest from Relational Databases with StreamSets
Language: PLSQL - Size: 12.3 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 13

ShreyPurohit/gitingest-vsextension
Seamlessly analyze your repositories inside VS Code!
Language: TypeScript - Size: 12.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 1

Lolle2000la/MangaIngestWithUpscaling
Manage ingesting mangas, placing them in the right places and merging multiple sources together, while also enabling automatic upscaling of said mangas.
Language: C# - Size: 1.96 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

rotkonetworks/githem
Githem repos for your LLMs
Language: Rust - Size: 621 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

jmxt3/Git-Scape-API
Official API of the project Git Space AI. Understand Any Repo in seconds
Language: Python - Size: 138 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

AilieOHagan/primm Fork of mohagan9/primm
Pre-Ingest Metadata Modifier
Language: Kotlin - Size: 70.3 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

LeidenUniversityLibrary/islandora_prepare_ingest
Islandora Prepare Ingest is a module that helps you build workflows for preparing data for ingest into Islandora.
Language: PHP - Size: 621 KB - Last synced at: 10 months ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

tspannhw/ClouderaNow2020
Twitter Ingest to S3, ORC, Slack, Hive Streaming
Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

tspannhw/clouddatawarehouse
Populating Cloud Data Warehouses with Apache NiFi
Size: 14.4 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

jrcichra/ingestd
HTTP server that easily ingests data into a database
Language: Go - Size: 348 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

xerosic/gnam
🍽️ gnam - HTTP request ingester and visualizer
Language: Go - Size: 30.3 KB - Last synced at: 4 days ago - Pushed at: 23 days ago - Stars: 1 - Forks: 0

stevenlafl/GPTLoader
Ingest an entire codebase into ChatGPT to ask questions about it
Language: TypeScript - Size: 188 KB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

discoverygarden/dgi_migrate
A module to facilitate bulk ingest and migrations into Islandora using the migrate module.
Language: PHP - Size: 823 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 3

josev2046/Watch-Folder-Automation
A simple watch folder prototype for OVPs/OTTs/MAMs/DAMs.
Size: 49.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

sxudan/lets-watch
This is a watch party video streaming app
Language: C++ - Size: 12.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sxudan/live_file_publisher
This is a flutter package to publish file to RTSP or RTMP server.
Language: Dart - Size: 28.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

shitizenlism/goForwarder
用ffmpeg采样File/rtsp/rtmp流,推送到RTMP Server,断开自动重拉,防假死。Use ffmpeg to ingest File/rtsp/rtmp streams, push to RTMP Server, auo re-pull and prevent fake-dead.
Language: Go - Size: 1.55 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

tspannhw/kudrone-nifi
Notes, scripts, images, Apache NiFi templates, processors
Size: 7.97 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

discoverygarden/islandora_ordered_zip_newspaper_batch
Newspaper batch ingest that uses the ordering of a ZIP to dictate sequence.
Language: PHP - Size: 21.5 KB - Last synced at: 5 months ago - Pushed at: about 9 years ago - Stars: 1 - Forks: 1

discoverygarden/book_batch_rename_script
Renames a folder full of TIFF/XML/MRC files and moves them into the correct folders to prepare for ingest using the Islandora Book Batch module
Language: Shell - Size: 158 KB - Last synced at: 3 months ago - Pushed at: over 11 years ago - Stars: 1 - Forks: 3

suwa-sh/local-RAG-backend
This is the backend for a RAG system that runs on Docker Compose. It registers documents in a wide range of file formats, which can be searched using the MCP server.
Language: Python - Size: 313 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

bigbossbrother/do-ingest-api-aggregate-r2
Aggregate 100.000s of files in a Durable Object, streaming out to R2, circumventing limitations
Language: TypeScript - Size: 37.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

codewmanas/Ascenza
An AI Career Coach
Language: JavaScript - Size: 310 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

erickrr-bd/WSbeat
Easy data ingestion into ElasticSearch using Python and websockets.
Language: Python - Size: 38.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lu-pl/rdfingest
A simple tool for ingesting RDF data into a triplestore.
Language: Python - Size: 814 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

dd-Splunk/splunk-datalake
How to combine smart store and ingest action for datalake use case
Language: Python - Size: 360 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lizclipse/grohuh
Service definition of grott & custom ingest/dashboard
Language: Rust - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

teknoir/node-red-teknoir-ingest
A set of nodes that lets you ingest data from all your devices.
Language: JavaScript - Size: 55.7 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sxudan/react-native-live-file-publisher
This is a react native package to publish file to RTSP or RTMP server.
Language: TypeScript - Size: 4.16 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

data-modelisation/stock-market-ingesting
Market Data Ingestion and BigQuery Integration project
Language: Python - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

jnylen/vind 📦
Vind is a importer for EPG Data from various sources
Language: Elixir - Size: 707 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

znatty22/d3b-ingest-packages
Sandbox for setting up new ingest workflow - ingest data portal per PR
Language: Python - Size: 2.18 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

riffl/riffl
Riffl - generic streaming data ingestion framework
Language: Java - Size: 177 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

jcoon97/pushshift-ingest-delay
Simple webapp that will show the ingest delay for PushShift Reddit data
Language: TypeScript - Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

discoverygarden/islandora_paged_content_pdf_batch
Language: PHP - Size: 49.8 KB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 8

mikejoh/artifactory-elasticsearch-ingest-pipelines
Ingest pipelines to parse Artifactory logs sent to Elasticsearch using Filebeat
Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Staatsarchiv-Basel-Stadt/PREMIS-Profile-scopeIngest
Language: XSLT - Size: 29.3 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

discoverygarden/limerick_ingest
Language: PHP - Size: 205 KB - Last synced at: 3 months ago - Pushed at: almost 10 years ago - Stars: 0 - Forks: 0

discoverygarden/uofm_newspaper_batch
Batch ingest for IArchive format, as provided from UoM.
Language: XSLT - Size: 208 KB - Last synced at: 2 months ago - Pushed at: over 11 years ago - Stars: 0 - Forks: 0

discoverygarden/islandora_importer Fork of Islandora/islandora_importer
A pluggable batch ingester module for Islandora.
Language: XSLT - Size: 773 KB - Last synced at: over 1 year ago - Pushed at: over 11 years ago - Stars: 0 - Forks: 0

moses/moses_tools/docu
Documentation and demo notebooks MOSES tools
Last synced at: about 1 year ago - Stars: 0 - Forks: 0