An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: heterogeneous-computing

array2d/deepx

Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support

Language: C++ - Size: 1.82 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 42 - Forks: 4

pulp-platform/carfield

A mixed-criticality platform built around Cheshire, with a number of safety/security and predictability features. Ready-to-use FPGA flow on multiple boards is available.

Language: Tcl - Size: 2.21 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 96 - Forks: 18

arc-research-lab/CHARM

CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture

Language: C++ - Size: 169 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 140 - Forks: 22

ThomasAtlantis/DistPipe

A distributed framework to implement device-cloud collaborative workflow

Language: Python - Size: 5.86 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

fangq/mcxcl

Monte Carlo eXtreme for OpenCL (MCXCL)

Language: C - Size: 6.66 MB - Last synced at: 5 days ago - Pushed at: 18 days ago - Stars: 43 - Forks: 29

Vincent-Therrien/gpu-arena

Compare and test GPU programming frameworks

Language: C++ - Size: 3.52 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 109 - Forks: 8

taskflow/awesome-parallel-computing

A curated list of awesome parallel computing resources

Size: 3.41 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 722 - Forks: 68

cjmcv/ai-infra-notes

Reading notes on the open source code of AI infrastructure (sglang, llm, cutlass, hpc, etc.)

Size: 777 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

arc-research-lab/AIM

AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP (Full Paper accepted to ICCAD2023)!

Language: C++ - Size: 427 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 4

IntelPython/DPEP

Data Parallel Extensions for Python*

Language: Jupyter Notebook - Size: 8.36 MB - Last synced at: 22 days ago - Pushed at: about 2 months ago - Stars: 35 - Forks: 8

pulp-platform/hero

Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and an application-class host CPU, including full-stack software and hardware.

Language: SystemVerilog - Size: 61.8 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 101 - Forks: 25

QuantumBFS/YaoCompiler.jl

The Yao compiler project

Language: Julia - Size: 1.68 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 7

Heteroflow/Heteroflow

Concurrent CPU-GPU Programming using Task Models

Language: C++ - Size: 1.58 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 13

JuliaGPU/DaggerGPU.jl

GPU integrations for Dagger.jl

Language: Julia - Size: 61.5 KB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 52 - Forks: 11

APPFL/FedCompass

[ICLR 2024] FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler

Language: Python - Size: 411 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 4

LLNL/thicket

Language: JavaScript - Size: 86.6 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 9

pulp-platform/astral Fork of pulp-platform/carfield

A space computing platform built around Cheshire, with a configurable number of safety, security, reliability and predictability features with a ready-to-use FPGA flow on multiple boards.

Language: Tcl - Size: 93.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 4

PlatformAwareProgramming/PlatformAware.jl

Platform-aware programming in Julia

Language: Julia - Size: 1.56 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 13 - Forks: 1

XFluids/XFluids

a unified cross-architecture heterogeneous CFD solver

Language: C++ - Size: 6.03 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 23 - Forks: 7

tugrul512bit/libGPGPU

Multi-GPU & CPU OpenCL kernel executor with load-balancing as if there is one big GPU.

Language: C++ - Size: 2.09 MB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 9 - Forks: 2

bert13069598/LoadBalancing

Efficient Resource Sharing across Heterogeneous Computing

Language: C++ - Size: 69.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

a-sidorova/gpu_opencl_cource

Course Programming on new Architecture-1 (GPU), autumn 2021

Language: C++ - Size: 42 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ParCoreLab/BeyondMoore

BeyondMoore has an ambitious goal to develop a software framework that performs static and dynamic optimizations, issues accelerator-initiated data transfers, and reasons about parallel execution strategies that exploit both processor and memory heterogeneity.

Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

itzmeanjan/merklize-sha

SYCL accelerated Binary Merklization using SHA1, SHA2 & SHA3

Language: C++ - Size: 243 KB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

eismont21/AlexLens Fork of FriedemannClaus/AlexLens

AlexLens is a comprehensive Image Classification and Transfer Learning application, specifically designed for heterogeneous computing platforms. It features a custom-built AlexNet Neural Network for in-depth analysis and learning.

Language: C++ - Size: 14.9 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EduardChou/Accelerated-Dynamic-State-Estimation-Toolbox

Power System Dynamic State Estimation Based on Heterogeneous Computing Acceleration

Language: Python - Size: 7.57 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

xqoasis/GPU-Profiling-with-N-body-problem

GPU profiling through CUDA with N-body problem

Language: Cuda - Size: 446 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ut-parla/Parla.py

A Python based programming system for heterogeneous computing

Language: Python - Size: 31 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 9

mmattioli/OpenCL-Adventures

Learning how to design heterogeneous compute applications using OpenCL with an emphasis on GPU acceleration

Language: C++ - Size: 43.9 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

Choimoe/cylinder-flow-simulation

Heterogeneous Computation for 2D Cylinder Flow Simulation

Language: C++ - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

amamory-ampere/openmp-offloading

repository to test the OpenMP offloading capabilities to an NVIDIA GPU

Language: C - Size: 8.4 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

cjmcv/ecas

ECAS is a library for edge AI computing acceleration.

Language: C++ - Size: 598 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

zhangyachen/ComputerArchitectureAndCppBooks

📚 计算机体系结构与C++书籍收集(持续更新)

Size: 36.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 33 - Forks: 7

lukastruemper/patterntree 📦

PatternTree is a high-performance optimization framework for heterogeneous computing architectures

Language: C++ - Size: 85 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

olutosinbanjo/direction_field

Plotting a Direction Field with Python (Intel® DevMesh Project)

Language: Python - Size: 6.19 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Tolisz/blurhash

blurhash algorithm implemented on GPU (OpenCL)

Language: C - Size: 2.42 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

UCLA-SEAL/HeteroGen

HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair (ASPLOS 2022)

Language: Python - Size: 31 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

makotonakai/heqsim

Heterogeneous quantum computing simulator

Language: Python - Size: 463 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

g4m3r0/MagmaDNN-Benchmarsuite

MagmaDNN Benchmarksuite for heterogenous architectures

Language: C++ - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

amamory/power-optim

Minizinc model of a power-aware task placement onto a heterogenous platform (big-little, gpu, fpga)

Language: Python - Size: 446 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

UCLA-SEAL/HeteroFuzz

Fuzz Testing to Detect Platform Dependent Divergence for Heterogeneous Application (FSE 2021)

Language: C - Size: 1.31 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

whitelok/gpu-computation-gems-codes

GPU Computing Gems Jade 2012 Edition实用示例代码

Language: Cuda - Size: 155 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohankumar42/delta

A fluid framework for heterogeneous function execution

Language: Python - Size: 152 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

milladgit/gecko

Gecko - A programming model for heterogeneous environments

Language: C++ - Size: 355 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

milladgit/deepcopy-benchmark

A set of microbenchmarks for deep copy in directive-based programming models

Language: C++ - Size: 170 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0