Topic: "data-integration"
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language: Python - Size: 382 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 40,370 - Forks: 15,104

airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Language: Python - Size: 677 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 18,320 - Forks: 4,554

Avaiga/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
Language: Python - Size: 151 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 18,107 - Forks: 1,886

dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
Language: Python - Size: 1.26 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 13,302 - Forks: 1,702

apache/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Language: Java - Size: 43.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,548 - Forks: 1,991

mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Language: Python - Size: 234 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 8,358 - Forks: 847

cloudquery/cloudquery
The developer first cloud governance platform
Language: Go - Size: 173 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 6,108 - Forks: 527

apache/flink-cdc
Flink CDC is a streaming data integration tool
Language: Java - Size: 41.1 MB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 6,081 - Forks: 2,012

apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
Language: Java - Size: 1.81 GB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5,816 - Forks: 2,406

infinyon/fluvio
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Language: Rust - Size: 34.4 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 4,937 - Forks: 517

jitsucom/jitsu
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Language: TypeScript - Size: 42.1 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 4,305 - Forks: 311

rudderlabs/rudder-server
Privacy and Security focused Segment-alternative, in Golang and React
Language: Go - Size: 309 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4,200 - Forks: 337

DTStack/chunjun
A data integration framework
Language: Java - Size: 126 MB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 4,052 - Forks: 1,695

seandavi/awesome-single-cell
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Size: 1.43 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 3,388 - Forks: 1,021

bruin-data/ingestr
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Language: Python - Size: 168 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,966 - Forks: 83

apache/incubator-devlake
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
Language: Go - Size: 38.5 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2,747 - Forks: 582

mara/mara-pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Language: Python - Size: 3.29 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 2,080 - Forks: 100

bytedance/bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
Language: Java - Size: 26.4 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 1,666 - Forks: 333

apache/hop
Hop Orchestration Platform
Language: Java - Size: 198 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 1,151 - Forks: 376

kuwala-io/kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
Language: JavaScript - Size: 7.79 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 792 - Forks: 54

apache/seatunnel-web
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Language: Java - Size: 17.4 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 687 - Forks: 307

artie-labs/transfer
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.
Language: Go - Size: 4.02 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 654 - Forks: 33

immunogenomics/harmony
Fast, sensitive and accurate integration of single-cell data with Harmony
Language: R - Size: 52.9 MB - Last synced at: about 21 hours ago - Pushed at: 7 months ago - Stars: 576 - Forks: 102

leesf/hudi-resources
汇总Apache Hudi相关资料
Size: 23.7 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 552 - Forks: 160

saeyslab/nichenetr
NicheNet: predict active ligand-target links between interacting cells
Language: R - Size: 152 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 540 - Forks: 125

ConduitIO/conduit
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
Language: Go - Size: 13.1 MB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 523 - Forks: 55

theislab/scarches
Reference mapping for single-cell genomics
Language: Jupyter Notebook - Size: 825 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 365 - Forks: 59

gabledata/recap
Work with your web service, database, and streaming schemas in a single format.
Language: Python - Size: 1.43 MB - Last synced at: about 21 hours ago - Pushed at: about 21 hours ago - Stars: 343 - Forks: 26

CategoricalData/CQL
Categorical Query Language IDE
Language: Java - Size: 145 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 299 - Forks: 22

cuebook/cuelake
Use SQL to build ELT pipelines on a data lakehouse.
Language: JavaScript - Size: 28 MB - Last synced at: 18 days ago - Pushed at: about 3 years ago - Stars: 288 - Forks: 28

hetio/hetionet
Hetionet: an integrative network of disease
Language: HTML - Size: 380 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 285 - Forks: 69

pracdata/awesome-open-source-data-engineering
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Size: 219 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 274 - Forks: 29

CommonCoreOntology/CommonCoreOntologies
The Common Core Ontology Repository holds the current released version of the Common Core Ontology suite.
Language: Makefile - Size: 16.7 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 234 - Forks: 61

dataplane-app/dataplane
Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.
Language: JavaScript - Size: 281 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 226 - Forks: 33

slowkow/harmonypy
🎼 Integrate multiple high-dimensional datasets with fuzzy k-means and locally linear adjustments.
Language: Python - Size: 2.77 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 218 - Forks: 24

morph-kgc/morph-kgc
Powerful RDF Knowledge Graph Generation with RML Mappings
Language: Python - Size: 32.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 215 - Forks: 41

opensanctions/nomenklatura
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Language: Python - Size: 5.98 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 210 - Forks: 38

mara/mara-example-project-2
An example mini data warehouse for python project stats, template for new projects
Language: Python - Size: 24 MB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 178 - Forks: 39

ceumicrodata/mETL
mito ETL tool
Language: Python - Size: 7.43 MB - Last synced at: 30 days ago - Pushed at: about 4 years ago - Stars: 163 - Forks: 41

atrocore/atrocore
AtroCore is an open-source Data Platform, Data Management and Master Data Management (MDM) software, which can be used to quickly create any business application.
Language: JavaScript - Size: 107 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 157 - Forks: 42

mims-harvard/scikit-fusion
scikit-fusion: Data fusion via collective latent factor models
Language: Python - Size: 9.28 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 148 - Forks: 44

google/megalista 📦
First Party data integration solution built for marketing teams to enable audience and conversion onboarding into Google Marketing products (Google Ads, Campaign Manager, Google Analytics).
Language: Python - Size: 1.34 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 137 - Forks: 55

genular/pandora
PANDORA :computer:
Language: Vue - Size: 16.4 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 135 - Forks: 21

SDM-TIB/SDM-RDFizer
An Efficient RML-Compliant Engine for Knowledge Graph Construction
Language: Python - Size: 21.2 MB - Last synced at: 12 days ago - Pushed at: 16 days ago - Stars: 119 - Forks: 25

starlake-ai/starlake
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
Language: Scala - Size: 170 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 117 - Forks: 22

Teichlab/cellhint
A tool for semi-automatic cell type harmonization and integration
Language: Python - Size: 6.78 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 106 - Forks: 14

olehmberg/winter
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
Language: Java - Size: 18.6 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 105 - Forks: 32

thedataengineeringbook/thedataengineeringbook
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Language: JavaScript - Size: 1.54 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 103 - Forks: 43

delftdata/valentine
A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
Language: Python - Size: 112 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 88 - Forks: 26

runprism/prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
Language: Python - Size: 2.42 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 85 - Forks: 2

SysBioChalmers/GECKO
Toolbox for including enzyme constraints on a genome-scale model.
Language: MATLAB - Size: 107 MB - Last synced at: 14 days ago - Pushed at: 28 days ago - Stars: 71 - Forks: 52

paloaltodatabases/sequor
Sequor is a SQL-centric platform for building API integrations without lock-in and black boxes. Fuses API execution with SQL logic to provide an open, flexible platform for all your data and app integrations.
Language: Python - Size: 171 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 64 - Forks: 1

saezlab/cosmosR
COSMOS (Causal Oriented Search of Multi-Omic Space) is a method that integrates phosphoproteomics, transcriptomics, and metabolomics data sets.
Language: R - Size: 53.2 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 61 - Forks: 16

munchy-bytes/SchemaMapper
A .NET class library that allows you to import data from different sources into a unified destination
Language: C# - Size: 5.9 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 60 - Forks: 16

jupyter-naas/drivers
Low-code Python library enabling access to APIs, tools, data sources in seconds.
Language: Python - Size: 1.53 MB - Last synced at: 21 days ago - Pushed at: 11 months ago - Stars: 59 - Forks: 13

linkml/linkml-model
Link Modeling Language (LinkML) model
Language: Python - Size: 13.1 MB - Last synced at: 13 days ago - Pushed at: 16 days ago - Stars: 53 - Forks: 20

siyul-park/uniflow
A high-performance, extremely flexible, and easily extensible universal workflow engine.
Language: Go - Size: 3.09 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 52 - Forks: 5

CogStack/CogStack-NiFi
Building data processing pipelines for documents processing with NLP using Apache NiFi and related services
Language: Python - Size: 97 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 51 - Forks: 19

DP6/marketing-data-sync Fork of google/megalista
First Party data integration solution built for marketing teams to enable audience and conversion onboarding into Google Marketing products and Facebook Ads.
Language: Python - Size: 959 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 6

datasphere-oss/datasphere-integration
an data-centric integration platform
Language: Java - Size: 20.7 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 48 - Forks: 17

umer7/Data-Warehouse-Concepts-Design-and-Data-Integration
Repo for Data Warehouse Concepts, Design, and Data Integration by University of Colorado System (coursera)(Notes,Assignments, quiz and research papers)
Size: 35 MB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 45 - Forks: 32

neuroforgede/nfcompose
Build REST APIs/Integrations in minutes instead of hours - NF Compose is a (data) integration platform that allows developers to define REST APIs in seconds instead of hours. Generated REST APIs are backed by postgres and support automatic consumer webhook notifications on data changes out of the box.
Language: Python - Size: 2.57 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 39 - Forks: 3

Azure/data-product-batch
Template to deploy a Data Product for Batch data processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Language: Bicep - Size: 11.3 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 38 - Forks: 22

mara/mara-etl-tools
Utilities for creating ETL pipelines with mara
Language: PLpgSQL - Size: 54.7 KB - Last synced at: 27 days ago - Pushed at: about 3 years ago - Stars: 36 - Forks: 4

Azure/data-product-streaming
Template to deploy a Data Product for data stream processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Language: Bicep - Size: 12.1 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 35 - Forks: 12

AltschulerWu-Lab/MUSE
MUSE is a deep learning approach characterizing tissue composition through combined analysis of morphologies and transcriptional states for spatially resolved transcriptomics data.
Language: Jupyter Notebook - Size: 153 MB - Last synced at: 12 days ago - Pushed at: about 3 years ago - Stars: 35 - Forks: 8

linkedin/data-integration-library
The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
Language: Java - Size: 1.53 MB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 32 - Forks: 15

selbouhaddani/OmicsPLS
R package for High dimensional data analysis and integration with O2PLS!
Language: HTML - Size: 31.4 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 32 - Forks: 8

DerwenAI/ERKG
Demonstrate integration of Senzing and Neo4j to construct an Entity Resolved Knowledge Graph
Size: 13.9 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 32 - Forks: 6

JonnyTran/OpenOmics
A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.
Language: Python - Size: 68.5 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 32 - Forks: 11

oeg-upm/mapeathor
Translator of spreadsheet mappings into R2RML, RML or YARRRML
Language: Python - Size: 58.8 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 10

dhimmel/integrate
Scripts and resources to create Hetionet v1.0, a heterogeneous network for drug repurposing
Language: Jupyter Notebook - Size: 565 MB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 32 - Forks: 17

zazuko/barnard59
An intuitive and flexible RDF pipeline solution designed to simplify and automate ETL processes for efficient data management.
Language: JavaScript - Size: 3.66 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 31 - Forks: 2

DTUComputeStatisticsAndDataAnalysis/MBPLS
(Multiblock) Partial Least Squares Regression for Python
Language: Python - Size: 16.6 MB - Last synced at: 1 day ago - Pushed at: over 5 years ago - Stars: 31 - Forks: 7

YangLabHKUST/Portal
Adversarial domain translation networks for integrating large-scale atlas-level single-cell datasets
Language: Python - Size: 119 KB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 30 - Forks: 6

thymeflow/thymeflow
Installer for Thymeflow, a personal knowledge management system.
Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 30 - Forks: 5

raamana/pyradigm
Research data management in biomedical and machine learning applications
Language: Python - Size: 7.25 MB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 12

cthoyt/doctoral-thesis
📖 Generation and Applications of Knowledge Graphs in Systems and Networks Biology
Language: TeX - Size: 68.6 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 29 - Forks: 2

dosorio/rPanglaoDB
An R package to download and merge labeled single-cell RNA-seq data from the PanglaoDB database into a Seurat object.
Language: HTML - Size: 2.24 MB - Last synced at: about 4 hours ago - Pushed at: about 2 years ago - Stars: 27 - Forks: 3

ginkgobioworks/geckopy
Enzyme-constrained genome-scale models in python
Language: Python - Size: 4.84 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 26 - Forks: 7

cloudquery/plugin-sdk
CloudQuery Go SDK for source and destination plugins
Language: Go - Size: 18.2 MB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 24 - Forks: 25

glasgowcompbio/pyMultiOmics
Python toolbox for multi-omics data mapping and analysis
Language: Jupyter Notebook - Size: 45.9 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 5

davidfoerster/schema-matching
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
Language: Python - Size: 271 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 24 - Forks: 8

JinmiaoChenLab/FastIntegration
FastIntegrate integrates thousands of scRNA-seq datasets and outputs batch-corrected values for downstream analysis
Language: R - Size: 2.37 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 23 - Forks: 4

shuxiaoc/mario-py
MARIO: single-cell proteomic data matching and integration using both shared and distinct features
Language: Jupyter Notebook - Size: 660 MB - Last synced at: 20 days ago - Pushed at: almost 2 years ago - Stars: 23 - Forks: 2

abcsys/libem
Compound AI toolchain for fast and accurate entity matching, powered by LLMs.
Language: Python - Size: 3.54 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 4

yezhengSTAT/ADTnorm
ADTnorm normalizes the cell surface protein measurement of CITE-seq data, facilitating across batches and across studies data integration.
Language: R - Size: 48.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 22 - Forks: 5

bio2bel/bio2bel
A Python framework for integrating biological databases and structured data sources in Biological Expression Language (BEL)
Language: Python - Size: 417 KB - Last synced at: 27 days ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 5

NPLinker/nplinker
A python framework for microbial natural products data mining by integrating genomics and metabolomics data
Language: Python - Size: 116 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 20 - Forks: 13

JohnnyBravo75/DataBridge.NET
Configurable data bridge for permanent ETL jobs
Language: C# - Size: 11.1 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 10

Amine-Smahi/R-Learning-Journey
Some of the projects i made when starting to learn R for Data Science at the university
Language: R - Size: 63.5 KB - Last synced at: 2 months ago - Pushed at: almost 6 years ago - Stars: 20 - Forks: 0

CloudFormations/CF.Cumulus
A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away the complexity, giving you the power to build, scale, and manage your dataflows with ease, accelerating data delivery.
Language: TSQL - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 19 - Forks: 12

caokai1073/Pamona
The software of Pamona, a partial manifold alignment algorithm.
Language: Jupyter Notebook - Size: 41.4 MB - Last synced at: 19 days ago - Pushed at: about 4 years ago - Stars: 19 - Forks: 3

oeg-upm/gtfs-bench
GTFS-Madrid-Bench: A Benchmark for Knowledge Graph Construction Engines
Language: Python - Size: 197 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 18 - Forks: 13

scify/jedai-ui
UI for JedAI Toolkit
Language: Java - Size: 1.09 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 5

BioDWH2/BioDWH2
BioDWH2 is an easy-to-use, automated, graph-based data warehouse and mapping tool for bioinformatics and medical informatics.
Language: Java - Size: 6.9 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 16 - Forks: 14

cutterkom/remove-na-lgbtiq-queer-knowledge-graph
A knowledge graph on queer history
Language: R - Size: 9.45 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 16 - Forks: 1

alexkychen/assignPOP
Population Assignment using Genetic, Non-genetic or Integrated Data in a Machine-learning Framework. Methods in Ecology and Evolution. 2018;9:439–446.
Language: R - Size: 8.81 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 4

NYXFLOWER/GripNet
GripNet: Graph Information Propagation on Supergraph for Heterogeneous Graphs (PatternRecognit, 2023)
Language: Python - Size: 88 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 16 - Forks: 2

lisad/phaser
The missing layer for complex data batch integration pipelines
Language: Python - Size: 552 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 14 - Forks: 1
