Topic: "hpc"
JuliaLang/julia
The Julia Programming Language
Language: Julia - Size: 314 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 46,816 - Forks: 5,580

hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language: Python - Size: 62.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 40,788 - Forks: 4,495

spack/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
Language: Python - Size: 215 MB - Last synced at: about 19 hours ago - Pushed at: about 19 hours ago - Stars: 4,624 - Forks: 2,384

volcano-sh/volcano
A Cloud Native Batch System (Project under CNCF)
Language: Go - Size: 83.4 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 4,587 - Forks: 1,050

arrayfire/arrayfire
ArrayFire: a general purpose GPU library.
Language: C++ - Size: 17.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4,559 - Forks: 534

ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
Language: C++ - Size: 21.1 MB - Last synced at: 8 days ago - Pushed at: 18 days ago - Stars: 4,375 - Forks: 375

nextflow-io/nextflow
A DSL for data-driven computational pipelines
Language: Groovy - Size: 58 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,909 - Forks: 680

apptainer/singularity
Singularity has been renamed to Apptainer as part of us moving the project to the Linux Foundation. This repo has been persisted as a snapshot right before the changes.
Language: Go - Size: 34.8 MB - Last synced at: 9 minutes ago - Pushed at: over 2 years ago - Stars: 2,563 - Forks: 427

diku-dk/futhark
:boom::computer::boom: A data-parallel functional programming language
Language: Haskell - Size: 49.4 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 2,498 - Forks: 174

open-mpi/ompi
Open MPI main development repository
Language: C - Size: 174 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 2,324 - Forks: 893

flame/blis
BLAS-like Library Instantiation Software Framework
Language: C - Size: 47.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2,288 - Forks: 366

zml/zml
Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild
Language: Zig - Size: 2.11 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 2,197 - Forks: 78

ChunelFeng/CGraph
【A common used C++/Python DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
Language: C++ - Size: 5.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,959 - Forks: 349

mfem/mfem
Lightweight, general, scalable C++ library for finite element methods
Language: C++ - Size: 236 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,865 - Forks: 523

chapel-lang/chapel
a Productive Parallel Programming Language
Language: Chapel - Size: 1000 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,862 - Forks: 426

DTolm/VkFFT
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library
Language: C++ - Size: 38 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,612 - Forks: 104

NVIDIA/cccl
CUDA Core Compute Libraries
Language: C++ - Size: 79.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,602 - Forks: 208

boostorg/compute
A C++ GPU Computing Library for OpenCL
Language: C++ - Size: 8.32 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 1,598 - Forks: 336

AdaptiveCpp/AdaptiveCpp
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
Language: C++ - Size: 14 MB - Last synced at: 9 days ago - Pushed at: 11 days ago - Stars: 1,585 - Forks: 195

indigo-dc/udocker
A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.
Language: Python - Size: 6.4 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 1,455 - Forks: 138

su2code/SU2
SU2: An Open-Source Suite for Multiphysics Simulation and Design
Language: C++ - Size: 899 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,450 - Forks: 856

NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Language: C++ - Size: 39.1 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 1,311 - Forks: 96

openucx/ucx
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Language: C - Size: 74.2 MB - Last synced at: about 16 hours ago - Pushed at: about 17 hours ago - Stars: 1,305 - Forks: 454

apptainer/apptainer
Apptainer: Application containers for Linux
Language: Go - Size: 39.8 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1,282 - Forks: 148

trilinos/Trilinos
Primary repository for the Trilinos Project
Language: C++ - Size: 803 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,271 - Forks: 582

jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Language: C++ - Size: 56.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,174 - Forks: 60

kubernetes-retired/kube-batch 📦
A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
Language: Go - Size: 44.1 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 1,069 - Forks: 265

gunrock/gunrock
Programmable CUDA/C++ GPU Graph Analytics
Language: C++ - Size: 74.6 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 1,017 - Forks: 205

broadinstitute/cromwell
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
Language: Scala - Size: 47.3 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1,011 - Forks: 361

Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language: Cuda - Size: 1.25 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 997 - Forks: 152

ashvardanian/less_slow.cpp
Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO
Language: C++ - Size: 2.72 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 989 - Forks: 43

futureverse/future
:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone
Language: R - Size: 12.3 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 975 - Forks: 88

RaftLib/RaftLib
The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators
Language: C++ - Size: 23.6 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 950 - Forks: 125

cp2k/cp2k
Quantum chemistry and solid state physics software package
Language: Fortran - Size: 260 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 925 - Forks: 408

openhpc/ohpc
OpenHPC Integration, Packaging, and Test Repo
Language: C - Size: 32.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 904 - Forks: 196

sylabs/singularity
SingularityCE is the Community Edition of Singularity, an open source container platform designed to be simple, fast, and secure.
Language: Go - Size: 45.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 834 - Forks: 103

trevor-vincent/awesome-high-performance-computing
A curated list of awesome high performance computing resources
Size: 2.94 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 834 - Forks: 86

arrayfire/arrayfire-rust
Rust wrapper for ArrayFire
Language: Rust - Size: 18.4 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 826 - Forks: 59

AgnostiqHQ/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
Language: Python - Size: 370 MB - Last synced at: 7 days ago - Pushed at: 20 days ago - Stars: 820 - Forks: 103

chrisvoncsefalvay/learn-julia-the-hard-way
Learn Julia the hard way!
Language: Makefile - Size: 558 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 779 - Forks: 151

envmodules/modules
Environment Modules: provides dynamic modification of a user's environment
Language: Tcl - Size: 22 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 763 - Forks: 113

romeric/Fastor
A lightweight high performance tensor algebra framework for modern C++
Language: C++ - Size: 3.2 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 751 - Forks: 69

sslotin/amh-code
Complete implementations from "Algorithms for Modern Hardware"
Language: Jupyter Notebook - Size: 8.86 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 744 - Forks: 47

hypre-space/hypre
Parallel solvers for sparse linear systems featuring multigrid methods.
Language: C - Size: 216 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 740 - Forks: 205

openpbs/openpbs
An HPC workload manager and job scheduler for desktops, clusters, and clouds.
Language: C - Size: 39.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 726 - Forks: 341

nndeploy/nndeploy
nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performance, and mainstream framework compatible model inference and deployment experience.一款端到端的模型推理和部署框架。它旨在为用户提供功能强大、简单易用、高性能且兼容主流框架的模型推理和部署体验。
Language: C++ - Size: 30.3 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 722 - Forks: 106

uxlfoundation/oneMath
oneAPI Math Library (oneMath)
Language: C++ - Size: 11.6 MB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 666 - Forks: 165

zanellia/prometeo
An experimental Python-to-C transpiler and domain specific language for embedded high-performance computing
Language: Python - Size: 1.93 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 632 - Forks: 33

open-mpi/hwloc
Hardware locality (hwloc)
Language: C - Size: 38.6 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 611 - Forks: 178

pmodels/mpich
Official MPICH Repository
Language: C - Size: 87.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 598 - Forks: 293

devitocodes/devito
DSL and compiler framework for automated finite-differences and stencil computation
Language: Python - Size: 237 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 595 - Forks: 234

LLNL/sundials
Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.
Language: C - Size: 244 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 562 - Forks: 144

lablup/backend.ai
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
Language: Python - Size: 603 MB - Last synced at: about 6 hours ago - Pushed at: about 6 hours ago - Stars: 555 - Forks: 161

warewulf/warewulf
Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual systems.
Language: Go - Size: 25.4 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 540 - Forks: 100

openhackathons-org/gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI
Language: Jupyter Notebook - Size: 261 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 532 - Forks: 256

TACC/Lmod
Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy
Language: Lua - Size: 36.1 MB - Last synced at: 10 days ago - Pushed at: 27 days ago - Stars: 527 - Forks: 131

PySlurm/pyslurm
Python Interface to Slurm
Language: Cython - Size: 5.39 MB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 510 - Forks: 120

haptork/easyLambda
distributed dataflows with functional list operations for data processing with C++14
Language: C++ - Size: 1.96 MB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 497 - Forks: 43

easybuilders/easybuild
EasyBuild - building software with ease
Language: Shell - Size: 585 MB - Last synced at: 9 days ago - Pushed at: 13 days ago - Stars: 483 - Forks: 145

NVIDIA/hpc-container-maker
HPC Container Maker
Language: Python - Size: 2.06 MB - Last synced at: 9 days ago - Pushed at: 26 days ago - Stars: 475 - Forks: 96

visit-dav/visit
VisIt - Visualization and Data Analysis for Mesh-based Scientific Data
Language: C - Size: 371 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 472 - Forks: 122

3dem/relion
Image-processing software for cryo-electron microscopy
Language: C++ - Size: 58.2 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 465 - Forks: 205

ginkgo-project/ginkgo
Numerical linear algebra software package
Language: C++ - Size: 155 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 453 - Forks: 95

luispedro/jug
Parallel programming with Python
Language: Python - Size: 2.25 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 452 - Forks: 63

oracle/coherence
Oracle Coherence Community Edition
Language: Java - Size: 549 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 443 - Forks: 72

ParRes/Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
Language: C - Size: 23.6 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 427 - Forks: 109

arrayfire/arrayfire-python
Python bindings for ArrayFire: A general purpose GPU library.
Language: Python - Size: 1.59 MB - Last synced at: 19 days ago - Pushed at: almost 2 years ago - Stars: 419 - Forks: 64

libocca/occa
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
Language: C++ - Size: 17.7 MB - Last synced at: 17 days ago - Pushed at: 27 days ago - Stars: 416 - Forks: 87

blitzpp/blitz
Blitz++ Multi-Dimensional Array Library for C++
Language: C++ - Size: 19.6 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 413 - Forks: 82

easybuilders/easybuild-easyconfigs
A collection of easyconfig files that describe which software to build using which build options with EasyBuild.
Language: Python - Size: 101 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 407 - Forks: 726

It4innovations/hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
Language: Rust - Size: 22.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 406 - Forks: 30

rackslab/Slurm-web
Open source web interface for Slurm HPC & AI clusters
Language: Python - Size: 15.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 399 - Forks: 102

gem/oq-engine
OpenQuake Engine: a software for Seismic Hazard and Risk Analysis
Language: Python - Size: 1000 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 396 - Forks: 286

tumaer/JAXFLUIDS
Differentiable Fluid Dynamics Package
Language: Python - Size: 12.8 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 391 - Forks: 69

JuliaParallel/MPI.jl
MPI wrappers for Julia
Language: Julia - Size: 4.44 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 391 - Forks: 124

xcat2/xcat-core
Code repo for xCAT core packages
Language: Perl - Size: 199 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 380 - Forks: 173

Nek5000/Nek5000
our classic
Language: Fortran - Size: 21.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 377 - Forks: 241

alpaka-group/alpaka
Abstraction Library for Parallel Kernel Acceleration :llama:
Language: C++ - Size: 17.9 MB - Last synced at: 7 days ago - Pushed at: 25 days ago - Stars: 372 - Forks: 76

LLNL/Caliper
Caliper is an instrumentation and performance profiling library
Language: C++ - Size: 8.94 MB - Last synced at: about 11 hours ago - Pushed at: about 11 hours ago - Stars: 371 - Forks: 69

pyccel/pyccel
Python extension language using accelerators
Language: Python - Size: 20 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 362 - Forks: 60

NERSC/shifter
Shifter - Linux Containers for HPC
Language: C - Size: 3.58 MB - Last synced at: about 22 hours ago - Pushed at: about 1 year ago - Stars: 361 - Forks: 65

giovtorres/slurm-docker-cluster
A Slurm cluster using docker-compose
Language: Dockerfile - Size: 40 KB - Last synced at: 14 days ago - Pushed at: 7 months ago - Stars: 360 - Forks: 197

rapier1/hpn-ssh Fork of openssh/openssh-portable
HPN-SSH based on OpenSSH
Language: C - Size: 92.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 357 - Forks: 47

LLNL/Umpire
An application-focused API for memory management on NUMA & GPU architectures
Language: C++ - Size: 16 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 350 - Forks: 53

pipefunc/pipefunc
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
Language: Python - Size: 1.86 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 347 - Forks: 14

cnuernber/dtype-next
A Clojure library designed to aid in the implementation of high performance algorithms and systems.
Language: Clojure - Size: 4.06 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 344 - Forks: 21

elasticluster/elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
Language: Python - Size: 5.7 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 335 - Forks: 150

agenium-scale/nsimd
Agenium Scale vectorization library for CPUs and GPUs
Language: C - Size: 6.92 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 333 - Forks: 30

UoB-HPC/BabelStream
STREAM, for lots of devices written in many programming models
Language: C++ - Size: 2.36 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 332 - Forks: 117

QMCPACK/qmcpack
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support
Language: C++ - Size: 395 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 329 - Forks: 147

OSC/ondemand
Supercomputing. Seamlessly. Open, Interactive HPC Via the Web
Language: JavaScript - Size: 32.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 329 - Forks: 126

RRZE-HPC/OSACA
Open Source Architecture Code Analyzer
Language: Jupyter Notebook - Size: 8.12 MB - Last synced at: 7 days ago - Pushed at: 21 days ago - Stars: 319 - Forks: 21

Nek5000/nekRS
our next generation fast and scalable CFD code
Language: C++ - Size: 91.7 MB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 315 - Forks: 82

definelicht/hlslib
A collection of extensions for Vitis and Intel FPGA OpenCL to improve developer quality of life.
Language: C++ - Size: 577 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 311 - Forks: 58

SeisSol/SeisSol
A scientific software for the numerical simulation of seismic wave phenomena and earthquake dynamics
Language: C++ - Size: 308 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 294 - Forks: 116

charmplusplus/charm4py
Parallel Programming with Python and Charm++
Language: Python - Size: 2.42 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 294 - Forks: 22

jeffhammond/HPCInfo
Information about many aspects of high-performance computing. Wiki content moved to ~/docs.
Language: C - Size: 6.71 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 290 - Forks: 60

deephyper/deephyper
DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks
Language: Python - Size: 44.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 287 - Forks: 61

ornladios/ADIOS2
Next generation of ADIOS developed in the Exascale Computing Program
Language: C++ - Size: 32.2 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 284 - Forks: 129

Azure/batch-shipyard 📦
Simplify HPC and Batch workloads on Azure
Language: Python - Size: 6.15 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 277 - Forks: 121
