Topic: "data-catalog"
datahub-project/datahub
The Metadata Platform for your Data and AI Stack
Language: Java - Size: 392 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 10,646 - Forks: 3,120

open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Language: TypeScript - Size: 1.84 GB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 6,762 - Forks: 1,234

amundsen-io/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Language: Python - Size: 38.6 MB - Last synced at: 12 days ago - Pushed at: 14 days ago - Stars: 4,574 - Forks: 970

apache/gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Language: Java - Size: 48.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,519 - Forks: 468

opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Language: Java - Size: 27.9 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 1,317 - Forks: 123

intake/intake
Intake is a lightweight package for finding, investigating, loading and disseminating data.
Language: Python - Size: 14.6 MB - Last synced at: 5 days ago - Pushed at: 20 days ago - Stars: 1,048 - Forks: 148

opendatadiscovery/awesome-data-catalogs
📙 Awesome Data Catalogs and Observability Platforms.
Size: 119 KB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 844 - Forks: 62

rsyi/whale
🐳 The stupidly simple CLI workspace for your data warehouse.
Language: Python - Size: 11.6 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 728 - Forks: 38

gabledata/recap
Work with your web service, database, and streaming schemas in a single format.
Language: Python - Size: 1.43 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 343 - Forks: 26

tokern/piicatcher
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Language: Python - Size: 1.38 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 311 - Forks: 99

raystack/meteor
Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.
Language: Go - Size: 14.5 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 207 - Forks: 42

intake/intake-esm
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Language: Python - Size: 11.8 MB - Last synced at: 5 days ago - Pushed at: 17 days ago - Stars: 150 - Forks: 48

GoogleCloudPlatform/bigquery-data-lineage 📦
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Language: Java - Size: 356 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 143 - Forks: 41

getmetamapper/metamapper
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Language: Python - Size: 33 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 79 - Forks: 6

aws-samples/aws-dbs-refarch-datalake
Reference Architectures for Datalakes on AWS
Language: HTML - Size: 4.52 MB - Last synced at: 26 days ago - Pushed at: about 5 years ago - Stars: 79 - Forks: 31

GoogleCloudPlatform/datacatalog-connectors-rdbms 📦
Sample code with integration between Data Catalog and RDBMS data sources.
Language: Python - Size: 532 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 72 - Forks: 50

google/grizzly
End-to-end DataOps platform deployed by Terraform.
Language: Python - Size: 113 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 66 - Forks: 10

Tinkoff/data-detective 📦
Data catalog for everything in your company
Language: Python - Size: 8.99 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 45 - Forks: 13

opendatadiscovery/odd-collector 📦
Open-source metadata collector based on ODD Specification
Language: Python - Size: 1.96 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 13

Bayer-Group/COLID-Documentation
The documentation repository is part of the Corporate Linked Data Catalog - short: COLID - application.
Language: HTML - Size: 3.93 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 6

commondataio/dataportals-registry
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
Language: Python - Size: 103 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 41 - Forks: 6

ihsn/nada
National Data Archive (NADA) is an open source data cataloging system that serves as a portal for researchers to browse, search, compare, apply for access, and download relevant census or survey information. It was originally developed to support the establishment of national survey data archives.
Language: PHP - Size: 72 MB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 40 - Forks: 12

getstrm/pace
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.
Language: Kotlin - Size: 13 MB - Last synced at: about 12 hours ago - Pushed at: about 24 hours ago - Stars: 36 - Forks: 1

awesome-mlops/awesome-data-management
A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀
Size: 4.88 KB - Last synced at: 27 days ago - Pushed at: about 3 years ago - Stars: 32 - Forks: 3

GoogleCloudPlatform/datacatalog-connectors-bi 📦
Sample code with integration between Data Catalog and BI data sources.
Language: Python - Size: 573 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 32 - Forks: 15

rejot-dev/rejot
Supercharged Replication for Developers
Language: TypeScript - Size: 6.08 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 31 - Forks: 1

tosh2230/stairlight
A data lineage tool detects table dependencies from rendered SQL statements.
Language: Python - Size: 2.36 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 29 - Forks: 0

montara-io/dbt-command-center
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
Language: TypeScript - Size: 3.55 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 28 - Forks: 0

carte-data/carte
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
Language: Python - Size: 268 KB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 27 - Forks: 0

SciCatProject/frontend
SciCat Project Official Frontend
Language: TypeScript - Size: 46.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 25 - Forks: 33

odpi/egeria-docs
Documentation repository for the Egeria project.
Language: HTML - Size: 403 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 23 - Forks: 31

unytics/catalog_builder
Data Catalogs Made Easy
Language: Python - Size: 2.64 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 2

datopian/portal.js.bak
🌀 The JS data presentation framework. For a single dataset to a full catalog.
Language: JavaScript - Size: 2.34 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 2

dbt-content/google-datacatalog-dbt-tag
Update a Google Data Catalog tag with dbt Cloud run metadata
Language: Python - Size: 454 KB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 2

apecs-org/Polar-EO-Database
Polar Earth Observation Database of satellite sensors
Language: Python - Size: 131 KB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 5

aaronspring/remote_climate_data
a collection of remote climate data accessed via intake cached to disk
Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 19 - Forks: 2

tum-gis/sddi-ckan-k8s
Helm chart for Smart District Data Infrastructure enabled CKAN
Language: Smarty - Size: 771 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 17 - Forks: 4

darenasc/aeda
Build a data catalog by running a single line of code
Language: Python - Size: 3.24 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 0

related-sciences/articat
articat: data artifact catalog
Language: Python - Size: 124 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 2

Bayer-Group/COLID-Setup
The setup repository is part of the Corporate Linked Data Catalog - short: COLID - application. It helps setting up a local environment based on Docker Compose.
Language: Shell - Size: 29.4 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 17 - Forks: 6

FINRAOS/herd-mdl
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
Language: Java - Size: 3.33 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 14

open-metadata/openmetadata-site
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Language: TypeScript - Size: 54.6 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 14 - Forks: 11

ulbmuenster/dataasee
DatAasee - A Metadata-Lake for Libraries
Language: Makefile - Size: 3.06 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 14 - Forks: 2

CafIncubator/Midden
A research metadata catalog and metadata editor that integrates into common workflows used in academic research.
Language: C# - Size: 147 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 16

GuinsooLab/darkseal
A Single place to Discover, Collaborate, and Get your data right
Language: TypeScript - Size: 272 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 6

NCAR/esm-collection-spec 📦
Earth System Model Collection specification
Size: 74.2 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 7

GoogleCloudPlatform/datacatalog-tag-history 📦
Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quality and user behaviour. This solution creates Data Catalog Tags history in BigQuery since Data Catalog keeps only the latest version of metadata for fast searchability.
Language: Java - Size: 152 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 5

nasa/data-nasa-gov-frontpage
a frontpage for data.nasa.gov
Language: SCSS - Size: 4.71 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 12 - Forks: 10

CDCgov/cdh-lava-react
CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.
Language: CSS - Size: 14.4 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 11 - Forks: 4

LiamBindle/bashdatacatalog
A simple portable file cataloging tool for bash
Language: Shell - Size: 84 KB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 1

apache/airavata-data-catalog
Apache Airavata Data Catalog
Language: Java - Size: 166 KB - Last synced at: about 1 hour ago - Pushed at: 3 months ago - Stars: 10 - Forks: 6

slaclab/datacat
A system for managing files and file replicas across many diverse sites
Language: Java - Size: 2.21 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 3

MattTriano/analytics_data_where_house
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
Language: Python - Size: 17.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 9 - Forks: 0

opendatadiscovery/odd-collectors
Language: Python - Size: 2.02 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 9 - Forks: 11

Bayer-Group/COLID-Data-Marketplace-Frontend
The Data Marketplace frontend repository is part of the Corporate Linked Data Catalog - short: COLID - application. Users can search for registered resources in COLID. It provides a search bar, aggregation filters and search result displaying including term highlighting.
Language: TypeScript - Size: 5.78 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 9 - Forks: 4

catalyst-cooperative/pudl-catalog 📦
An Intake catalog for distributing open energy system data liberated by Catalyst Cooperative.
Language: Python - Size: 602 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 2

Bayer-Group/COLID-AppData-Service
The appdata service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It maintains the user data and application settings.
Language: C# - Size: 380 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 1

zillow/intake-nested-yaml-catalog
Supports a single YAML file hierarchical catalog to organize datasets and avoid a data swamp.
Language: Python - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 2

blw-ofag-ufag/data-catalog
A MVP data catalog for DigiAgriFoodCH
Language: TypeScript - Size: 44.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 6 - Forks: 1

Bayer-Group/COLID-Editor-Frontend
The editor frontend repository is part of the Corporate Linked Data Catalog - short: COLID - application. It offers user an metadata based user interface to register resources in COLID.
Language: TypeScript - Size: 2.15 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1

Bayer-Group/COLID-Indexing-Crawler-Service
The Indexing Crawler Service (ICS) repository is part of the Corporate Linked Data Catalog - short: COLID - application. It is responsible to extract data from a RDF storage system, transform and enrich the data and finally to send it via a message queue to the DMP Webservice for indexing.
Language: C# - Size: 137 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 0

Bayer-Group/COLID-Search-Service
The search service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It makes the data findable and provides indexing and search functionalities based on Elasticsearch.
Language: C# - Size: 246 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1

asc-csa/ckanext-asc-csa
📈 Extension CKAN pour le portail de données et information ouvertes de l'ASC | 📈CKAN Extension for the CSA open data and information portal
Language: JavaScript - Size: 10.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 2

ryanrozich/snowflake-dbml-generator
Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from your Snowflake DB. Ideal for data engineers and analysts, it supports custom primary key configurations and relationship inference.
Language: Python - Size: 1.41 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

Bayer-Group/COLID-Registration-Service
The registration service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It is the central microservice to register resources in the triplestore.
Language: C# - Size: 1.7 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

Bayer-Group/COLID-Scheduler-Service
The scheduler service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It sets up recurring jobs for user notifications and analytics.
Language: C# - Size: 114 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

Bayer-Group/COLID-Reporting-Service
The reporting service repository is part of the Corporate Linked Data Catalog - short: COLID - application. It offers an API for statistics of registered resources.
Language: C# - Size: 146 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

sahays/serverless-analytics
AWS Serverless Analytics using Amazon S3, Athena, Glue, and QuickSight
Language: Python - Size: 16.1 MB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 5

opendatadiscovery/odd-collector-gcp 📦
Open-source GCP metadata collector based on ODD Specification
Language: Python - Size: 188 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

Snow-Fox-Data/dss-thread
Dataiku Thread™ Data Catalog Plugin by Snow Fox Data
Language: CSS - Size: 217 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 2

SciCatProject/localdeploy
SciCat Data Catalog Kubernetes Deployment
Language: Shell - Size: 278 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

victorcouste/google-data-catalog-dataprep
Create or update Google Cloud Data Catalog tags with Cloud Dataprep metadata and column profile
Language: Python - Size: 1.7 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

data2health/repository-and-index-software
This is a list of repositories, repository frameworks, and data catalogs. It focuses on technical architecture, how metadata is handled and what standards are used, and what next-generation repository features (if any) are implemented.
Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

stelar-eu/data-api
REST API for managing and searching resources in the Knowledge Lake Management System
Language: JavaScript - Size: 3.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

open-metadata/openmetadata-sdk
OpenMetadata client SDK. Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Language: Go - Size: 109 KB - Last synced at: 6 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 2

jorge-martinez-gil/dataq
Framework to Automatically Determine the Quality of Open Data Catalogs
Language: Python - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

DieGit0/windfarm
Data Engineer project using Python and some AWS data services
Language: Jupyter Notebook - Size: 3.91 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

compilerla/intake-html-table 📦
Intake plugins for HTML tables.
Language: Python - Size: 85.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

macroecology-society/data-catalog
MAS group data catalog
Language: HTML - Size: 37.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

aminekaabachi/lexy
📙 Lexy enables you to easily build and share data dictionaries to explain and document your data terminology using code.
Language: Python - Size: 55.7 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

victorcouste/google-datacatalog-dbt-tag
Update a Google Data Catalog tag with dbt Cloud run metadata
Language: Python - Size: 401 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

jornh/amundsen Fork of amundsen-io/amundsen
Just a personal fork
Size: 3.24 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

victorcouste/dataprep-datacatalog-explorer
Web application to explore BigQuery tables tagged in Google Cloud Data Catalog with Cloud Dataprep tags
Language: HTML - Size: 324 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

CSIRO-enviro-informatics/HBee
An Approximate Deep Spatial Catalog and Search
Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

lpraat/inbq
inbq extracts schema-aware, column-level lineage from multi-statement BigQuery queries.
Language: Rust - Size: 437 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

hackolade/glue
Hackolade(https://hackolade.com) plugin for AWS Glue Data Catalog
Language: JavaScript - Size: 22.9 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 8

krista-t/data-catalog
Data Catalog documentation, from terms and explanation to strategic roadmap for implementation. Published to Github pages from my Obsidian via Quartz plugin.
Language: TypeScript - Size: 10.7 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

ev2900/DataZone_Demo
Prebuilt demo of Amazon DataZone using fake data for Pharmaceutical drug discovery
Language: Python - Size: 425 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

alankrantas/loc-documentation
End-user documentation for FST Network's Logic Operating Centre (LOC), a serverless SaaS data product platform
Language: MDX - Size: 17.6 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

SciCatProject/scicatproject.github.io
SciCat Data Catalogue Documentation Website
Language: HTML - Size: 44.8 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 4

CIAT-DAPA/agrilac_catalogue
CMS portal focused in sharing agroclimatic information
Language: HTML - Size: 5.52 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

jonahkeegan/data-dictionary-agency
A specialized data analysis tool designed to automatically scan GitHub repositories for structured data files, extract their schemas, map relationships between tables, and generate comprehensive documentation with interactive visualizations.
Language: Python - Size: 631 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

SwissOpenEM/swissopenem.github.io
Project website
Language: HTML - Size: 28.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 2

OrangeBoatPencil/OpenMetadata Fork of open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Size: 1.73 GB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

garethcmurphy/SciCat-HDF5-Import-Tool
# SciCat HDF5 Import Tool 🐍📊 This repository provides a tool for importing **HDF5 raw data** into the **SciCat data catalog** used at the **European Spallation Source (ESS)**. It also supports metadata harvesting to ensure comprehensive data cataloging. --- ## Features ✨ - **HDF5 Import**: Automatically imports raw data files into SciCat.
Language: Python - Size: 50.8 KB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

garethcmurphy/Photon-and-Neutron-Scientific-Dataset-Search
# Photon and Neutron Scientific Dataset Search 🌌🔬 This repository provides a **LoopBack-based FAIR Data API** for searching photon and neutron scientific datasets from the **PaNOSC (Photon and Neutron Open Science Cloud)**. It connects researchers to data from 15 PaNOSC institutes, enabling seamless discovery and access. --- ## Features ✨
Language: JavaScript - Size: 58.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

garethcmurphy/Inquire-Docker-Setup
# Inquire Docker Setup 🐳✨ This repository provides a **Docker setup** for deploying the **Inquire Data Catalogue App**, featuring a **React frontend** and a **FastAPI backend**. The setup includes all necessary Python package dependencies, such as `pydantic` and others, for seamless deployment. --- ## Features ✨ - **React Frontend**: Inter
Language: Dockerfile - Size: 27.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lasyakonduru/superstore-sales-data-analysis
Analysis of sales performance and operational efficiency in a superstore using AWS Athena and QuickSight
Size: 2.25 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ruisdael-observatory/Ruisdael-Data-Catalog
repository to aid the creation of Ruisdael Data Catalog - documentation, resarch and scripts
Language: Shell - Size: 546 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DieGit0/data_realtime_-_batch_analytics
Data Streaming and Batch processing using AWS Services
Language: Python - Size: 13.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0
