GitHub topics: heterogeneous-computing
array2d/deepx
Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support
Language: C++ - Size: 1.82 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 42 - Forks: 4

pulp-platform/carfield
A mixed-criticality platform built around Cheshire, with a number of safety/security and predictability features. Ready-to-use FPGA flow on multiple boards is available.
Language: Tcl - Size: 2.21 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 96 - Forks: 18

arc-research-lab/CHARM
CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture
Language: C++ - Size: 169 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 140 - Forks: 22

ThomasAtlantis/DistPipe
A distributed framework to implement device-cloud collaborative workflow
Language: Python - Size: 5.86 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

fangq/mcxcl
Monte Carlo eXtreme for OpenCL (MCXCL)
Language: C - Size: 6.66 MB - Last synced at: 5 days ago - Pushed at: 18 days ago - Stars: 43 - Forks: 29

Vincent-Therrien/gpu-arena
Compare and test GPU programming frameworks
Language: C++ - Size: 3.52 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 109 - Forks: 8

taskflow/awesome-parallel-computing
A curated list of awesome parallel computing resources
Size: 3.41 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 722 - Forks: 68

cjmcv/ai-infra-notes
Reading notes on the open source code of AI infrastructure (sglang, llm, cutlass, hpc, etc.)
Size: 777 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

arc-research-lab/AIM
AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP (Full Paper accepted to ICCAD2023)!
Language: C++ - Size: 427 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 4

IntelPython/DPEP
Data Parallel Extensions for Python*
Language: Jupyter Notebook - Size: 8.36 MB - Last synced at: 22 days ago - Pushed at: about 2 months ago - Stars: 35 - Forks: 8

pulp-platform/hero
Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and an application-class host CPU, including full-stack software and hardware.
Language: SystemVerilog - Size: 61.8 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 101 - Forks: 25

QuantumBFS/YaoCompiler.jl
The Yao compiler project
Language: Julia - Size: 1.68 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 7

Heteroflow/Heteroflow
Concurrent CPU-GPU Programming using Task Models
Language: C++ - Size: 1.58 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 13

JuliaGPU/DaggerGPU.jl
GPU integrations for Dagger.jl
Language: Julia - Size: 61.5 KB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 52 - Forks: 11

APPFL/FedCompass
[ICLR 2024] FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler
Language: Python - Size: 411 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 4

LLNL/thicket
Language: JavaScript - Size: 86.6 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 9

pulp-platform/astral Fork of pulp-platform/carfield
A space computing platform built around Cheshire, with a configurable number of safety, security, reliability and predictability features with a ready-to-use FPGA flow on multiple boards.
Language: Tcl - Size: 93.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 4

PlatformAwareProgramming/PlatformAware.jl
Platform-aware programming in Julia
Language: Julia - Size: 1.56 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 13 - Forks: 1

XFluids/XFluids
a unified cross-architecture heterogeneous CFD solver
Language: C++ - Size: 6.03 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 23 - Forks: 7

tugrul512bit/libGPGPU
Multi-GPU & CPU OpenCL kernel executor with load-balancing as if there is one big GPU.
Language: C++ - Size: 2.09 MB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 9 - Forks: 2

bert13069598/LoadBalancing
Efficient Resource Sharing across Heterogeneous Computing
Language: C++ - Size: 69.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

a-sidorova/gpu_opencl_cource
Course Programming on new Architecture-1 (GPU), autumn 2021
Language: C++ - Size: 42 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ParCoreLab/BeyondMoore
BeyondMoore has an ambitious goal to develop a software framework that performs static and dynamic optimizations, issues accelerator-initiated data transfers, and reasons about parallel execution strategies that exploit both processor and memory heterogeneity.
Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

itzmeanjan/merklize-sha
SYCL accelerated Binary Merklization using SHA1, SHA2 & SHA3
Language: C++ - Size: 243 KB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

eismont21/AlexLens Fork of FriedemannClaus/AlexLens
AlexLens is a comprehensive Image Classification and Transfer Learning application, specifically designed for heterogeneous computing platforms. It features a custom-built AlexNet Neural Network for in-depth analysis and learning.
Language: C++ - Size: 14.9 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EduardChou/Accelerated-Dynamic-State-Estimation-Toolbox
Power System Dynamic State Estimation Based on Heterogeneous Computing Acceleration
Language: Python - Size: 7.57 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

xqoasis/GPU-Profiling-with-N-body-problem
GPU profiling through CUDA with N-body problem
Language: Cuda - Size: 446 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ut-parla/Parla.py
A Python based programming system for heterogeneous computing
Language: Python - Size: 31 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 9

mmattioli/OpenCL-Adventures
Learning how to design heterogeneous compute applications using OpenCL with an emphasis on GPU acceleration
Language: C++ - Size: 43.9 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

Choimoe/cylinder-flow-simulation
Heterogeneous Computation for 2D Cylinder Flow Simulation
Language: C++ - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

amamory-ampere/openmp-offloading
repository to test the OpenMP offloading capabilities to an NVIDIA GPU
Language: C - Size: 8.4 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

cjmcv/ecas
ECAS is a library for edge AI computing acceleration.
Language: C++ - Size: 598 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

zhangyachen/ComputerArchitectureAndCppBooks
📚 计算机体系结构与C++书籍收集(持续更新)
Size: 36.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 33 - Forks: 7

lukastruemper/patterntree 📦
PatternTree is a high-performance optimization framework for heterogeneous computing architectures
Language: C++ - Size: 85 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

olutosinbanjo/direction_field
Plotting a Direction Field with Python (Intel® DevMesh Project)
Language: Python - Size: 6.19 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Tolisz/blurhash
blurhash algorithm implemented on GPU (OpenCL)
Language: C - Size: 2.42 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

UCLA-SEAL/HeteroGen
HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair (ASPLOS 2022)
Language: Python - Size: 31 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

makotonakai/heqsim
Heterogeneous quantum computing simulator
Language: Python - Size: 463 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

g4m3r0/MagmaDNN-Benchmarsuite
MagmaDNN Benchmarksuite for heterogenous architectures
Language: C++ - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

amamory/power-optim
Minizinc model of a power-aware task placement onto a heterogenous platform (big-little, gpu, fpga)
Language: Python - Size: 446 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

UCLA-SEAL/HeteroFuzz
Fuzz Testing to Detect Platform Dependent Divergence for Heterogeneous Application (FSE 2021)
Language: C - Size: 1.31 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

whitelok/gpu-computation-gems-codes
GPU Computing Gems Jade 2012 Edition实用示例代码
Language: Cuda - Size: 155 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohankumar42/delta
A fluid framework for heterogeneous function execution
Language: Python - Size: 152 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

milladgit/gecko
Gecko - A programming model for heterogeneous environments
Language: C++ - Size: 355 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

milladgit/deepcopy-benchmark
A set of microbenchmarks for deep copy in directive-based programming models
Language: C++ - Size: 170 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0
