GitHub topics: data-harmonization
SCAI-BIO/kitsune
Kitsune is a next-generation data steward and harmonization tool.
Language: TypeScript - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 1

harmonydata/harmonyapi
This is the source code for the Harmony project REST API
Language: Python - Size: 99.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 9

harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.
Language: Jupyter Notebook - Size: 23.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 26 - Forks: 39

VIDA-NYU/bdi-kit
A Python toolkit for biomedical data integration and harmonization
Language: Python - Size: 45.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 10 - Forks: 3

SCAI-BIO/datastew
Python library for intelligent data stewardship using Large Language Model (LLM) embeddings
Language: Python - Size: 1.66 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 5 - Forks: 0

NPLinker/nplinker
A python framework for microbial natural products data mining by integrating genomics and metabolomics data
Language: Python - Size: 116 MB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 18 - Forks: 13

harmonydata/harmonydata.github.io
Blog for NLP data harmonisation project Harmony, open source solution using Python for psychologists
Language: HTML - Size: 207 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 1

VIDA-NYU/bdi-viz
Language: Python - Size: 733 KB - Last synced at: 6 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

datasnack/datahub
Self-hostable, open-source engine for reproducible data harmonization, dataset building & exploration
Language: Python - Size: 11.7 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 2

cidgoh/pathogen-genomics-package
This is the DataHarmonizer spreadsheet web application bundled with pathogen genomics data entry and validation templates
Language: HTML - Size: 27 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 4

finjahasi/clinical-text-mining_R_SCRIPT
A lightweight R script for text mining and harmonizing medical phenotype data. Cleans, standardizes, and maps diagnoses to ICD-10 codes, with clinical annotations for enhanced data usability.
Language: R - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SCAI-BIO/tsnepad
AD & PD cohort variable distributions
Language: HTML - Size: 5.63 MB - Last synced at: 25 days ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

jcaperella29/clinical-text-mining_R_SCRIPT
A lightweight R script for text mining and harmonizing medical phenotype data. Cleans, standardizes, and maps diagnoses to ICD-10 codes, with clinical annotations for enhanced data usability.
Language: R - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

pha4ge/hAMRonization
Parse multiple Antimicrobial Resistance Analysis Reports into a common data structure
Language: Python - Size: 5.49 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 145 - Forks: 30

harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
Language: HTML - Size: 764 KB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

menicgiulia/CPIExtract
Language: Jupyter Notebook - Size: 27.3 MB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 1

CoAxLab/pycombat
Python implementation of Combat for data harmonisation, allowing also to remove unwanted effects
Language: Jupyter Notebook - Size: 44.7 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 6

ncsuSEAL/McGregor-et-al-2024
Code and sample data for McGregor et al., 2024
Language: R - Size: 16 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

SCAI-BIO/ad-mapper
Language: Python - Size: 11.4 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lodi-m/piccard
Visualizing demographic evolution using geographically inconsistent census data
Language: Python - Size: 4.68 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

syedmfuad/geospatial_misc
Miscellaneous codes for harmonizing agricultural output and other agri-related data raster files and shapefiles. Extracts from raster files the grid-cell data by shapefile boundary.
Language: R - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

maelstrom-research/ipaq
International Physical Activity Questionnaire (IPAQ) variables
Language: R - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 2

NajiaAhmadi/VisualisationWithPython
Graphics for the article "Methods used in the development of Common Data Models for health data – a Scoping Review"
Language: Jupyter Notebook - Size: 3.06 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dfornika/amrhike
Proof-of-concept for storing and querying harmonized AMR Genomic Analysis Results in datahike
Language: Clojure - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0
