An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: heterogeneous-computing

cjmcv/ai-infra-notes

Reading notes on the open source code of AI infrastructure (sglang, llm, cutlass, hpc, etc.)

Size: 777 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3 - Forks: 0

arc-research-lab/CHARM

CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture

Language: C++ - Size: 182 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 143 - Forks: 22

pulp-platform/carfield

A mixed-criticality platform built around Cheshire, with a number of safety/security and predictability features. Ready-to-use FPGA flow on multiple boards is available.

Language: Tcl - Size: 3.29 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 103 - Forks: 20

array2d/deepx

Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support

Language: C++ - Size: 1.88 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 46 - Forks: 4

pulp-platform/hero

Heterogeneous Research Platform (HERO) for exploration of heterogeneous computers consisting of programmable many-core accelerators and an application-class host CPU, including full-stack software and hardware.

Language: SystemVerilog - Size: 61.8 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 105 - Forks: 26

pulp-platform/astral Fork of pulp-platform/carfield

A space computing platform built around Cheshire, with a configurable number of safety, security, reliability and predictability features with a ready-to-use FPGA flow on multiple boards.

Language: Tcl - Size: 93.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 9 - Forks: 4

arc-research-lab/AIM

AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP (Full Paper accepted to ICCAD2023)!

Language: C++ - Size: 429 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 22 - Forks: 3

LLNL/thicket

Language: JavaScript - Size: 87.1 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 16 - Forks: 9

itzmeanjan/merklize-sha

SYCL accelerated Binary Merklization using SHA1, SHA2 & SHA3

Language: C++ - Size: 243 KB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

ThomasAtlantis/DistPipe

A distributed framework to implement device-cloud collaborative workflow

Language: Python - Size: 23.4 KB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

fangq/mcxcl

Monte Carlo eXtreme for OpenCL (MCXCL)

Language: C - Size: 6.66 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 43 - Forks: 29

Vincent-Therrien/gpu-arena

Compare and test GPU programming frameworks

Language: C++ - Size: 3.52 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 109 - Forks: 8

taskflow/awesome-parallel-computing

A curated list of awesome parallel computing resources

Size: 3.41 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 722 - Forks: 68

IntelPython/DPEP

Data Parallel Extensions for Python*

Language: Jupyter Notebook - Size: 8.36 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 35 - Forks: 8

QuantumBFS/YaoCompiler.jl

The Yao compiler project

Language: Julia - Size: 1.68 MB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 7

Heteroflow/Heteroflow

Concurrent CPU-GPU Programming using Task Models

Language: C++ - Size: 1.58 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 13

JuliaGPU/DaggerGPU.jl

GPU integrations for Dagger.jl

Language: Julia - Size: 61.5 KB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 52 - Forks: 11

APPFL/FedCompass

[ICLR 2024] FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler

Language: Python - Size: 411 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 4

PlatformAwareProgramming/PlatformAware.jl

Platform-aware programming in Julia

Language: Julia - Size: 1.56 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 13 - Forks: 1

XFluids/XFluids

a unified cross-architecture heterogeneous CFD solver

Language: C++ - Size: 6.03 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 23 - Forks: 7

tugrul512bit/libGPGPU

Multi-GPU & CPU OpenCL kernel executor with load-balancing as if there is one big GPU.

Language: C++ - Size: 2.09 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 2

bert13069598/LoadBalancing

Efficient Resource Sharing across Heterogeneous Computing

Language: C++ - Size: 69.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

a-sidorova/gpu_opencl_cource

Course Programming on new Architecture-1 (GPU), autumn 2021

Language: C++ - Size: 42 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ParCoreLab/BeyondMoore

BeyondMoore has an ambitious goal to develop a software framework that performs static and dynamic optimizations, issues accelerator-initiated data transfers, and reasons about parallel execution strategies that exploit both processor and memory heterogeneity.

Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

eismont21/AlexLens Fork of FriedemannClaus/AlexLens

AlexLens is a comprehensive Image Classification and Transfer Learning application, specifically designed for heterogeneous computing platforms. It features a custom-built AlexNet Neural Network for in-depth analysis and learning.

Language: C++ - Size: 14.9 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EduardChou/Accelerated-Dynamic-State-Estimation-Toolbox

Power System Dynamic State Estimation Based on Heterogeneous Computing Acceleration

Language: Python - Size: 7.57 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

xqoasis/GPU-Profiling-with-N-body-problem

GPU profiling through CUDA with N-body problem

Language: Cuda - Size: 446 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ut-parla/Parla.py

A Python based programming system for heterogeneous computing

Language: Python - Size: 31 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 9

mmattioli/OpenCL-Adventures

Learning how to design heterogeneous compute applications using OpenCL with an emphasis on GPU acceleration

Language: C++ - Size: 43.9 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

Choimoe/cylinder-flow-simulation

Heterogeneous Computation for 2D Cylinder Flow Simulation

Language: C++ - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

amamory-ampere/openmp-offloading

repository to test the OpenMP offloading capabilities to an NVIDIA GPU

Language: C - Size: 8.4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

cjmcv/ecas

ECAS is a library for edge AI computing acceleration.

Language: C++ - Size: 598 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

zhangyachen/ComputerArchitectureAndCppBooks

📚 计算机体系结构与C++书籍收集(持续更新)

Size: 36.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 33 - Forks: 7

lukastruemper/patterntree 📦

PatternTree is a high-performance optimization framework for heterogeneous computing architectures

Language: C++ - Size: 85 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

olutosinbanjo/direction_field

Plotting a Direction Field with Python (Intel® DevMesh Project)

Language: Python - Size: 6.19 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Tolisz/blurhash

blurhash algorithm implemented on GPU (OpenCL)

Language: C - Size: 2.42 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

UCLA-SEAL/HeteroGen

HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair (ASPLOS 2022)

Language: Python - Size: 31 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

makotonakai/heqsim

Heterogeneous quantum computing simulator

Language: Python - Size: 463 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

g4m3r0/MagmaDNN-Benchmarsuite

MagmaDNN Benchmarksuite for heterogenous architectures

Language: C++ - Size: 14.6 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

amamory/power-optim

Minizinc model of a power-aware task placement onto a heterogenous platform (big-little, gpu, fpga)

Language: Python - Size: 446 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

UCLA-SEAL/HeteroFuzz

Fuzz Testing to Detect Platform Dependent Divergence for Heterogeneous Application (FSE 2021)

Language: C - Size: 1.31 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

whitelok/gpu-computation-gems-codes

GPU Computing Gems Jade 2012 Edition实用示例代码

Language: Cuda - Size: 155 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohankumar42/delta

A fluid framework for heterogeneous function execution

Language: Python - Size: 152 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

milladgit/gecko

Gecko - A programming model for heterogeneous environments

Language: C++ - Size: 355 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

milladgit/deepcopy-benchmark

A set of microbenchmarks for deep copy in directive-based programming models

Language: C++ - Size: 170 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Related Keywords
heterogeneous-computing 45 gpu 10 heterogeneous-parallel-programming 8 cuda 6 fpga 6 parallel-computing 5 high-performance-computing 5 gpu-computing 5 opencl 5 parallel-programming 4 python 4 hpc 4 distributed-computing 3 sycl 3 riscv 3 high-level-synthesis 3 distributed-systems 3 c 3 numpy 2 gpu-programming 2 heterogeneous 2 computer-architecture 2 cfd 2 openmp-offloading 2 shared-memory 2 quantum-computing 2 performance 2 scientific-computing 2 gpgpu 2 simd 2 acap 2 design-space-exploration 2 domain-specific-architecture 2 electronic-design-automation 2 versal 2 versalacap 2 cplusplus 2 safety-critical 2 systemverilog 2 numba 2 edge-computing 1 graph 1 cv 1 ai 1 openmp-target 1 openmp 1 omp 1 kws 1 benchmark-suite 1 multi-gpu 1 aritificial-intelligence 1 distributed-processing 1 embedded-system 1 parallel-processing 1 gemm 1 jacobi 1 erc20-horizon 1 moores-law 1 alexnet 1 data-science 1 deep-learning 1 image-classification 1 intel-movidius 1 machine-learning 1 qt5 1 transfer-learning 1 power-systems-analysis 1 state-estimation 1 openmpi 1 profiling 1 gpgpu-computing 1 matrix-calculations 1 magma 1 magmadnn 1 constraint-programming 1 minizinc 1 scheduling-algorithms 1 automated-testing 1 fuzzing 1 nvidia-gpu 1 search-algorithm 1 computing-continuum 1 function-as-a-service 1 serverless 1 hierarchical-models 1 portability 1 benchmarking 1 benchmarks 1 deepcopy 1 directives 1 microbenchmarks 1 programming-model 1 neon 1 vulkan 1 algorithms 1 book 1 chinese 1 cpp 1 cs-books 1 operating-system 1