An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-catalog"

datahub-project/datahub

The Metadata Platform for your Data and AI Stack

Language: Java - Size: 392 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 10,646 - Forks: 3,120

open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

Language: TypeScript - Size: 1.84 GB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 6,762 - Forks: 1,234

amundsen-io/amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Language: Python - Size: 38.6 MB - Last synced at: 12 days ago - Pushed at: 14 days ago - Stars: 4,574 - Forks: 970

apache/gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Language: Java - Size: 48.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,519 - Forks: 468

opendatadiscovery/odd-platform

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

Language: Java - Size: 27.9 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 1,317 - Forks: 123

intake/intake

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Language: Python - Size: 14.6 MB - Last synced at: 5 days ago - Pushed at: 20 days ago - Stars: 1,048 - Forks: 148

opendatadiscovery/awesome-data-catalogs

📙 Awesome Data Catalogs and Observability Platforms.

Size: 119 KB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 844 - Forks: 62

rsyi/whale

🐳 The stupidly simple CLI workspace for your data warehouse.

Language: Python - Size: 11.6 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 728 - Forks: 38

gabledata/recap

Work with your web service, database, and streaming schemas in a single format.

Language: Python - Size: 1.43 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 343 - Forks: 26

tokern/piicatcher

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Language: Python - Size: 1.38 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 311 - Forks: 99

raystack/meteor

Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.

Language: Go - Size: 14.5 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 207 - Forks: 42

intake/intake-esm

An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.

Language: Python - Size: 11.8 MB - Last synced at: 5 days ago - Pushed at: 17 days ago - Stars: 150 - Forks: 48

GoogleCloudPlatform/bigquery-data-lineage 📦

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Language: Java - Size: 356 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 143 - Forks: 41

getmetamapper/metamapper

Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.

Language: Python - Size: 33 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 79 - Forks: 6

aws-samples/aws-dbs-refarch-datalake

Reference Architectures for Datalakes on AWS

Language: HTML - Size: 4.52 MB - Last synced at: 26 days ago - Pushed at: about 5 years ago - Stars: 79 - Forks: 31

GoogleCloudPlatform/datacatalog-connectors-rdbms 📦

Sample code with integration between Data Catalog and RDBMS data sources.

Language: Python - Size: 532 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 72 - Forks: 50

google/grizzly

End-to-end DataOps platform deployed by Terraform.

Language: Python - Size: 113 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 66 - Forks: 10

Tinkoff/data-detective 📦

Data catalog for everything in your company

Language: Python - Size: 8.99 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 45 - Forks: 13

opendatadiscovery/odd-collector 📦

Open-source metadata collector based on ODD Specification

Language: Python - Size: 1.96 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 13

Bayer-Group/COLID-Documentation

The documentation repository is part of the Corporate Linked Data Catalog - short: COLID - application.

Language: HTML - Size: 3.93 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 6

commondataio/dataportals-registry

Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard

Language: Python - Size: 103 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 41 - Forks: 6

ihsn/nada

National Data Archive (NADA) is an open source data cataloging system that serves as a portal for researchers to browse, search, compare, apply for access, and download relevant census or survey information. It was originally developed to support the establishment of national survey data archives.

Language: PHP - Size: 72 MB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 40 - Forks: 12

getstrm/pace

Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.

Language: Kotlin - Size: 13 MB - Last synced at: about 12 hours ago - Pushed at: about 24 hours ago - Stars: 36 - Forks: 1

awesome-mlops/awesome-data-management

A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀

Size: 4.88 KB - Last synced at: 27 days ago - Pushed at: about 3 years ago - Stars: 32 - Forks: 3

GoogleCloudPlatform/datacatalog-connectors-bi 📦

Sample code with integration between Data Catalog and BI data sources.

Language: Python - Size: 573 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 32 - Forks: 15

rejot-dev/rejot

Supercharged Replication for Developers

Language: TypeScript - Size: 6.08 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 31 - Forks: 1

tosh2230/stairlight

A data lineage tool detects table dependencies from rendered SQL statements.

Language: Python - Size: 2.36 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 29 - Forks: 0

montara-io/dbt-command-center

Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

Language: TypeScript - Size: 3.55 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 28 - Forks: 0

carte-data/carte

A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.

Language: Python - Size: 268 KB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 27 - Forks: 0

SciCatProject/frontend

SciCat Project Official Frontend

Language: TypeScript - Size: 46.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 25 - Forks: 33

odpi/egeria-docs

Documentation repository for the Egeria project.

Language: HTML - Size: 403 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 23 - Forks: 31

unytics/catalog_builder

Data Catalogs Made Easy

Language: Python - Size: 2.64 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 2

datopian/portal.js.bak

🌀 The JS data presentation framework. For a single dataset to a full catalog.

Language: JavaScript - Size: 2.34 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 2

dbt-content/google-datacatalog-dbt-tag

Update a Google Data Catalog tag with dbt Cloud run metadata

Language: Python - Size: 454 KB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 2

apecs-org/Polar-EO-Database

Polar Earth Observation Database of satellite sensors

Language: Python - Size: 131 KB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 5

aaronspring/remote_climate_data

a collection of remote climate data accessed via intake cached to disk

Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 19 - Forks: 2

tum-gis/sddi-ckan-k8s

Helm chart for Smart District Data Infrastructure enabled CKAN

Language: Smarty - Size: 771 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 4

darenasc/aeda

Build a data catalog by running a single line of code

Language: Python - Size: 3.24 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 0

related-sciences/articat

articat: data artifact catalog

Language: Python - Size: 124 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 2

Bayer-Group/COLID-Setup

The setup repository is part of the Corporate Linked Data Catalog - short: COLID - application. It helps setting up a local environment based on Docker Compose.

Language: Shell - Size: 29.4 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 17 - Forks: 6

FINRAOS/herd-mdl

Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.

Language: Java - Size: 3.33 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 14

open-metadata/openmetadata-site

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

Language: TypeScript - Size: 54.6 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 14 - Forks: 11

ulbmuenster/dataasee

DatAasee - A Metadata-Lake for Libraries

Language: Makefile - Size: 3.06 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 14 - Forks: 2

CafIncubator/Midden

A research metadata catalog and metadata editor that integrates into common workflows used in academic research.

Language: C# - Size: 147 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 16

GuinsooLab/darkseal

A Single place to Discover, Collaborate, and Get your data right

Language: TypeScript - Size: 272 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 6

NCAR/esm-collection-spec 📦

Earth System Model Collection specification

Size: 74.2 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 7

GoogleCloudPlatform/datacatalog-tag-history 📦

Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quality and user behaviour. This solution creates Data Catalog Tags history in BigQuery since Data Catalog keeps only the latest version of metadata for fast searchability.

Language: Java - Size: 152 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 5

nasa/data-nasa-gov-frontpage

a frontpage for data.nasa.gov

Language: SCSS - Size: 4.71 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 12 - Forks: 10

CDCgov/cdh-lava-react

CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.

Language: CSS - Size: 14.4 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 11 - Forks: 4

LiamBindle/bashdatacatalog

A simple portable file cataloging tool for bash

Language: Shell - Size: 84 KB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 1

apache/airavata-data-catalog

Apache Airavata Data Catalog

Language: Java - Size: 166 KB - Last synced at: about 1 hour ago - Pushed at: 3 months ago - Stars: 10 - Forks: 6

slaclab/datacat

A system for managing files and file replicas across many diverse sites

Language: Java - Size: 2.21 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 3

MattTriano/analytics_data_where_house

An analytics engineering sandbox focusing on real estates prices in Cook County, IL

Language: Python - Size: 17.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 9 - Forks: 0

opendatadiscovery/odd-collectors

Language: Python - Size: 2.02 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 9 - Forks: 11

Bayer-Group/COLID-Data-Marketplace-Frontend

The Data Marketplace frontend repository is part of the Corporate Linked Data Catalog - short: COLID - application. Users can search for registered resources in COLID. It provides a search bar, aggregation filters and search result displaying including term highlighting.

Language: TypeScript - Size: 5.78 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 9 - Forks: 4

catalyst-cooperative/pudl-catalog 📦

An Intake catalog for distributing open energy system data liberated by Catalyst Cooperative.

Language: Python - Size: 602 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 2

Bayer-Group/COLID-AppData-Service

The appdata service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It maintains the user data and application settings.

Language: C# - Size: 380 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 1

zillow/intake-nested-yaml-catalog

Supports a single YAML file hierarchical catalog to organize datasets and avoid a data swamp.

Language: Python - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 2

blw-ofag-ufag/data-catalog

A MVP data catalog for DigiAgriFoodCH

Language: TypeScript - Size: 44.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 6 - Forks: 1

Bayer-Group/COLID-Editor-Frontend

The editor frontend repository is part of the Corporate Linked Data Catalog - short: COLID - application. It offers user an metadata based user interface to register resources in COLID.

Language: TypeScript - Size: 2.15 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1

Bayer-Group/COLID-Indexing-Crawler-Service

The Indexing Crawler Service (ICS) repository is part of the Corporate Linked Data Catalog - short: COLID - application. It is responsible to extract data from a RDF storage system, transform and enrich the data and finally to send it via a message queue to the DMP Webservice for indexing.

Language: C# - Size: 137 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 0

Bayer-Group/COLID-Search-Service

The search service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It makes the data findable and provides indexing and search functionalities based on Elasticsearch.

Language: C# - Size: 246 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1

asc-csa/ckanext-asc-csa

📈 Extension CKAN pour le portail de données et information ouvertes de l'ASC | 📈CKAN Extension for the CSA open data and information portal

Language: JavaScript - Size: 10.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 2

ryanrozich/snowflake-dbml-generator

Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from your Snowflake DB. Ideal for data engineers and analysts, it supports custom primary key configurations and relationship inference.

Language: Python - Size: 1.41 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

Bayer-Group/COLID-Registration-Service

The registration service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It is the central microservice to register resources in the triplestore.

Language: C# - Size: 1.7 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

Bayer-Group/COLID-Scheduler-Service

The scheduler service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It sets up recurring jobs for user notifications and analytics.

Language: C# - Size: 114 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

Bayer-Group/COLID-Reporting-Service

The reporting service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It offers an API for statistics of registered resources.

Language: C# - Size: 146 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

sahays/serverless-analytics

AWS Serverless Analytics using Amazon S3, Athena, Glue, and QuickSight

Language: Python - Size: 16.1 MB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 5

opendatadiscovery/odd-collector-gcp 📦

Open-source GCP metadata collector based on ODD Specification

Language: Python - Size: 188 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

Snow-Fox-Data/dss-thread

Dataiku Thread™ Data Catalog Plugin by Snow Fox Data

Language: CSS - Size: 217 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 2

SciCatProject/localdeploy

SciCat Data Catalog Kubernetes Deployment

Language: Shell - Size: 278 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

victorcouste/google-data-catalog-dataprep

Create or update Google Cloud Data Catalog tags with Cloud Dataprep metadata and column profile

Language: Python - Size: 1.7 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

data2health/repository-and-index-software

This is a list of repositories, repository frameworks, and data catalogs. It focuses on technical architecture, how metadata is handled and what standards are used, and what next-generation repository features (if any) are implemented.

Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

stelar-eu/data-api

REST API for managing and searching resources in the Knowledge Lake Management System

Language: JavaScript - Size: 3.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

open-metadata/openmetadata-sdk

OpenMetadata client SDK. Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

Language: Go - Size: 109 KB - Last synced at: 6 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 2

jorge-martinez-gil/dataq

Framework to Automatically Determine the Quality of Open Data Catalogs

Language: Python - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

DieGit0/windfarm

Data Engineer project using Python and some AWS data services

Language: Jupyter Notebook - Size: 3.91 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

compilerla/intake-html-table 📦

Intake plugins for HTML tables.

Language: Python - Size: 85.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

macroecology-society/data-catalog

MAS group data catalog

Language: HTML - Size: 37.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

aminekaabachi/lexy

📙 Lexy enables you to easily build and share data dictionaries to explain and document your data terminology using code.

Language: Python - Size: 55.7 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

victorcouste/google-datacatalog-dbt-tag

Update a Google Data Catalog tag with dbt Cloud run metadata

Language: Python - Size: 401 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

jornh/amundsen Fork of amundsen-io/amundsen

Just a personal fork

Size: 3.24 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

victorcouste/dataprep-datacatalog-explorer

Web application to explore BigQuery tables tagged in Google Cloud Data Catalog with Cloud Dataprep tags

Language: HTML - Size: 324 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

CSIRO-enviro-informatics/HBee

An Approximate Deep Spatial Catalog and Search

Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

lpraat/inbq

inbq extracts schema-aware, column-level lineage from multi-statement BigQuery queries.

Language: Rust - Size: 437 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

hackolade/glue

Hackolade(https://hackolade.com) plugin for AWS Glue Data Catalog

Language: JavaScript - Size: 22.9 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 8

krista-t/data-catalog

Data Catalog documentation, from terms and explanation to strategic roadmap for implementation. Published to Github pages from my Obsidian via Quartz plugin.

Language: TypeScript - Size: 10.7 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

ev2900/DataZone_Demo

Prebuilt demo of Amazon DataZone using fake data for Pharmaceutical drug discovery

Language: Python - Size: 425 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

alankrantas/loc-documentation

End-user documentation for FST Network's Logic Operating Centre (LOC), a serverless SaaS data product platform

Language: MDX - Size: 17.6 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

SciCatProject/scicatproject.github.io

SciCat Data Catalogue Documentation Website

Language: HTML - Size: 44.8 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 4

CIAT-DAPA/agrilac_catalogue

CMS portal focused in sharing agroclimatic information

Language: HTML - Size: 5.52 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

jonahkeegan/data-dictionary-agency

A specialized data analysis tool designed to automatically scan GitHub repositories for structured data files, extract their schemas, map relationships between tables, and generate comprehensive documentation with interactive visualizations.

Language: Python - Size: 631 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

SwissOpenEM/swissopenem.github.io

Project website

Language: HTML - Size: 28.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 2

OrangeBoatPencil/OpenMetadata Fork of open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

Size: 1.73 GB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

garethcmurphy/SciCat-HDF5-Import-Tool

# SciCat HDF5 Import Tool 🐍📊 This repository provides a tool for importing **HDF5 raw data** into the **SciCat data catalog** used at the **European Spallation Source (ESS)**. It also supports metadata harvesting to ensure comprehensive data cataloging. --- ## Features ✨ - **HDF5 Import**: Automatically imports raw data files into SciCat.

Language: Python - Size: 50.8 KB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

garethcmurphy/Photon-and-Neutron-Scientific-Dataset-Search

# Photon and Neutron Scientific Dataset Search 🌌🔬 This repository provides a **LoopBack-based FAIR Data API** for searching photon and neutron scientific datasets from the **PaNOSC (Photon and Neutron Open Science Cloud)**. It connects researchers to data from 15 PaNOSC institutes, enabling seamless discovery and access. --- ## Features ✨

Language: JavaScript - Size: 58.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

garethcmurphy/Inquire-Docker-Setup

# Inquire Docker Setup 🐳✨ This repository provides a **Docker setup** for deploying the **Inquire Data Catalogue App**, featuring a **React frontend** and a **FastAPI backend**. The setup includes all necessary Python package dependencies, such as `pydantic` and others, for seamless deployment. --- ## Features ✨ - **React Frontend**: Inter

Language: Dockerfile - Size: 27.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lasyakonduru/superstore-sales-data-analysis

Analysis of sales performance and operational efficiency in a superstore using AWS Athena and QuickSight

Size: 2.25 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ruisdael-observatory/Ruisdael-Data-Catalog

repository to aid the creation of Ruisdael Data Catalog - documentation, resarch and scripts

Language: Shell - Size: 546 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DieGit0/data_realtime_-_batch_analytics

Data Streaming and Batch processing using AWS Services

Language: Python - Size: 13.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0