Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: openacc

predsci/POT3D

POT3D: High Performance Potential Field Solver

Language: Fortran - Size: 24.2 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 40 - Forks: 22

trholding/llama2.c Fork of karpathy/llama2.c

Llama 2 Everywhere (L2E)

Language: C - Size: 2.68 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 1,385 - Forks: 40

MFlowCode/MFC

Exascale multiphase flow simulation

Language: Fortran - Size: 448 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 125 - Forks: 56

TommasoTarchi/Advanced_HPC-Final_assignments

Work in progress...

Language: C - Size: 117 KB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0

nakib/elphbolt

A solver for the coupled and decoupled electron and phonon Boltzmann transport equations.

Language: Fortran - Size: 12.2 MB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 36 - Forks: 23

intel/intel-application-migration-tool-for-openacc-to-openmp

OpenACC* to OpenMP* API assisting migration tool

Language: Python - Size: 164 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 30 - Forks: 6

UoB-HPC/BabelStream

STREAM, for lots of devices written in many programming models

Language: C++ - Size: 2.33 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 311 - Forks: 106

Tamerkobba/Parallel_Matrix_Mul

Parallelizing Matrix multiplication using CUDA C and OpenACC

Language: Jupyter Notebook - Size: 51.8 KB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 0 - Forks: 0

jefflarkin/miniWeather Fork of mrnorman/miniWeather

A parallel programming training mini app simulating weather-like flows

Language: C++ - Size: 8.22 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

ParRes/Kernels

This is a set of simple programs that can be used to explore the features of a parallel platform.

Language: C - Size: 14 MB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 401 - Forks: 106

pyccel/pyccel

Python extension language using accelerators

Language: Python - Size: 17.8 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 330 - Forks: 54

peterdschwartz/SPEL_OpenACC

Python tool designed for E3SM Land Model to create unit-tests and code-insertion of GPU compiler directives.

Language: Python - Size: 2.69 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 1

gilbertobastos/prj_perceptron_multicamadas_OpenACC_NVIDIA

Implementação paralela e simples do Perceptron Multicamadas utilizando OpenACC destinada para GPUs da NVIDIA

Language: C - Size: 36.1 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

gilbertobastos/prj_perceptron_multicamadas_OpenACC_MULTICORE

Implementação paralela e simples do Perceptron Multicamadas utilizando OpenACC destinada para CPU

Language: C - Size: 34.2 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

dc-fukuoka/gpumm

gpumm - matrix-matrix multiplication by using CUDA, cublas, cublasxt and OpenACC.

Language: Cuda - Size: 7.6 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 4 - Forks: 0

dc-fukuoka/gpu_ring

a test of GPU direct with CUDA/OpenACC.

Language: C - Size: 7.81 KB - Last synced: 2 months ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 0

dc-fukuoka/jacobi

jacobi - a benchmark by solving 2D laplace equation with jacobi iterative method. GPU or Xeon Phi can be used.

Language: Fortran - Size: 24.4 KB - Last synced: 2 months ago - Pushed: about 6 years ago - Stars: 7 - Forks: 4

dc-fukuoka/mandelbrot

Mandelbrot set by MPI/OpenMP/OpenACC.

Language: Fortran - Size: 73.2 KB - Last synced: 2 months ago - Pushed: about 6 years ago - Stars: 2 - Forks: 0

dc-fukuoka/can

can - a simple dense matrix-matrix multiplication benchmark with MPI/OpenMP/OpenACC. MPI version is based on Cannon's algorithm.

Language: Fortran - Size: 27.3 KB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 2 - Forks: 2

szaghi/FUNDAL

Fortran UNified Device Acceleration Library

Language: Fortran - Size: 2.87 MB - Last synced: 10 days ago - Pushed: about 2 months ago - Stars: 8 - Forks: 2

eric2003/OneFLOW

LargeScale Multiphysics Scientific Simulation Environment-OneFLOW CFD

Language: C++ - Size: 110 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 239 - Forks: 79

tan2/geoflac

Code for lithospheric scale geodynamics

Language: Fortran - Size: 1.18 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 5 - Forks: 9

Rodolfo-Gallegos/Brownian-Dynamics-Simulation-OpenACC

This is my thesis work for the Bachelor's degree in Physics. / Este es mi trabajo de titulación para la Licenciatura en Física.

Language: C++ - Size: 5.54 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

usnistgov/hiperc

High Performance Computing Strategies for Boundary Value Problems

Language: HTML - Size: 63.4 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 37 - Forks: 8

OpenACC/openacc-training-materials

Training materials provided by OpenACC.org.

Language: C - Size: 16.8 MB - Last synced: 2 months ago - Pushed: about 3 years ago - Stars: 73 - Forks: 27

stfc/PSycloneBench

Various benchmarks used to inform PSyclone optimisations

Language: Fortran - Size: 18.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 6 - Forks: 5

openhackathons-org/gpubootcamp

This repository consists for gpu bootcamp material for HPC and AI

Language: Jupyter Notebook - Size: 261 MB - Last synced: 3 months ago - Pushed: 7 months ago - Stars: 491 - Forks: 255

Programacao-Paralela-e-Distribuida/programacao-paralela-e-distribuida.github.io

Página do Livro Programação Paralela e Distribuída

Language: HTML - Size: 124 KB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

predsci/multigpu-test-code

This code mimics the basic MPI+OpenACC tasks of PSI's MAS Solar MHD code, for use with testing multi-GPU multi-node clusters

Language: Fortran - Size: 36.1 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

alpaka-group/alpaka

Abstraction Library for Parallel Kernel Acceleration :llama:

Language: C++ - Size: 1.03 GB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 303 - Forks: 62

KaiErikNiermann/hpc-uzh-notes

These are some notes for the High Performance Computing course taught at UZH

Size: 108 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

claw-project/claw-compiler

CLAW Compiler for Performance Portability

Language: Java - Size: 8.48 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 39 - Forks: 15

OpenACC/openacc-best-practices-guide

The sources for the OpenACC Programming and Best Practices Guide.

Language: TeX - Size: 2.87 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 23 - Forks: 10

ROCmSoftwarePlatform/gpufort

GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify

Language: Fortran - Size: 7.48 MB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 156 - Forks: 13

danxuZhang/HPC

HPC and Parallel Computing Learning Notes and Code

Language: Cuda - Size: 41 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

openhackathons-org/nways_accelerated_programming

N-Ways to GPU Programming Bootcamp

Language: Jupyter Notebook - Size: 18 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 25 - Forks: 16

jeng1220/openacc_fortran_examples

Simple OpenACC Fortran Examples

Language: Fortran - Size: 119 KB - Last synced: 4 months ago - Pushed: almost 3 years ago - Stars: 32 - Forks: 6

GRO4T/PORR_particle_swarm_optimization_in_OpenMP_OpenACC_and_OpenMPI

Implementations of particle swarm optimization and random search algorithms using various parallel programming APIs.

Language: C++ - Size: 4.22 MB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

pkestene/tsp

traveling salesman problem solved with different programing models

Language: C++ - Size: 56.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 3 - Forks: 0

yasahi-hpc/P3-miniapps

Kinetic plasma simulation code parallelized with C++ parallel algorithm

Language: C++ - Size: 4.91 MB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 4 - Forks: 0

Hopobcn/FWI

RTM

Language: C - Size: 754 KB - Last synced: 7 months ago - Pushed: about 6 years ago - Stars: 33 - Forks: 14

jnbntz/gpu-edu-workshops

Code examples for CUDA and OpenACC

Language: Cuda - Size: 9.04 MB - Last synced: 7 months ago - Pushed: about 7 years ago - Stars: 35 - Forks: 18

openhackathons-org/HPC_Profiler

Profiling with NVIDIA Nsight Tools Bootcamp

Language: C++ - Size: 23.5 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 1

marcjoos-cea/dumses-hybrid

CFD/MHD code for astrophysics

Language: Fortran - Size: 24.2 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 3 - Forks: 0

paveon/PCG-NBody-OpenACC

[VUT FIT] OpenACC N-Body simulation project for the PCG course

Language: C++ - Size: 2.35 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 1

hahnjo/CGxx

Object-Oriented Implementation of the Conjugate Gradients Method

Language: C - Size: 204 KB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 1

Aesfrost/ZPIC_OmpSs2

Parallel 2D EM-PIC kinetic plasma simulator based on the ZPIC suite

Language: C - Size: 943 KB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

mnicely/computeWorks_examples

Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA

Language: C++ - Size: 834 KB - Last synced: 8 months ago - Pushed: almost 2 years ago - Stars: 6 - Forks: 1

olcf-tutorials/openmp_offloading

OpenMP programming tips for GPU offloading

Language: C++ - Size: 46.9 KB - Last synced: 6 months ago - Pushed: over 4 years ago - Stars: 5 - Forks: 2

OpenACC/openacc-interoperability-examples Fork of jefflarkin/openacc-interoperability

Interoperability examples for OpenACC.

Language: C - Size: 199 KB - Last synced: 4 months ago - Pushed: about 9 years ago - Stars: 5 - Forks: 5

kuldeep-tolia/OpenACC_FORTRAN_Codes

OpenACC GPU parallelization for various numerical methods and miscellaneous problems using FORTRAN

Language: Fortran - Size: 136 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

kuldeep-tolia/OpenACC_C_Codes

OpenACC GPU parallelization for various numerical methods and miscellaneous problems using C

Language: C - Size: 42 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

ENCCS/openacc

OpenACC

Language: C - Size: 3.24 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 2

phbastosa/seismic_tomography_3D

Master degree project using object-oriented programming

Language: C++ - Size: 18.8 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

WalterNadalin/ParallelJacobi

Numerical solution of the Laplace equation implementing the Jacobi method

Language: C - Size: 44.1 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

DonAurelio/coder

the base for a web-based parallel programming environment build over a microservice approach

Language: JavaScript - Size: 7.41 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

ShadyBoukhary/GPU-research-FFT-OpenACC-CUDA

Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.

Language: Cuda - Size: 9.12 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 7 - Forks: 2

jefflarkin/openacc-interoperability

Interoperability examples for OpenACC.

Language: C - Size: 43 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 41 - Forks: 24

goatandsheep/mandelboxes 📦

:package: mandelboxes for a course, SFWR ENG 4F03

Language: C++ - Size: 3.86 MB - Last synced: 18 days ago - Pushed: about 8 years ago - Stars: 1 - Forks: 0

MFlowCode/MicroFC

A micro MFC and CFD mini-app

Language: Fortran - Size: 35.3 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

OpenACCUserGroup/openacc-users-group

Language: C - Size: 6.69 MB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 71 - Forks: 25

sotanmochi/HPC-Samples

Code Samples for CUDA, OpenACC and OpenMP

Language: C++ - Size: 16.6 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

yasahi-hpc/vlp4d_mpi

MPI+Kokkos/OpenACC/OpenMP4.5/stdpar implementation of vlp4d

Language: C++ - Size: 374 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

muriloboratto/benchmark-mode-optimization-GPU

Benchmark Matrix Multiply on GPU Environment.

Language: Shell - Size: 49.8 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 0

piyueh/PETSC-OpenACC Fork of olcf/PETSC-OpenACC

A mirror to https://github.com/olcf/PETSC-OpenACC -- An example of accelerating PETSc with OpenACC

Language: Shell - Size: 286 KB - Last synced: 4 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0

jefflarkin/acc-events

Language: Fortran - Size: 5.86 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

eafit-apolo/2DPartInt

Soil particles contact simulation

Language: C - Size: 1.17 MB - Last synced: 4 months ago - Pushed: almost 3 years ago - Stars: 6 - Forks: 0

PawseySC/sc20-gpu-offloading

Materials for "Differences between OpenACC and OpenMP offloading models" tutorial.

Language: C - Size: 650 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 4 - Forks: 4

larsgeb/fd-wave-modelling-gpu

Forward 2D elastic wave equation modelling using either OpenMP or OpenACC. Compiles with PGI compiler.

Language: C++ - Size: 28.3 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 0

gabrielchristo/prog-dist

Programação Paralela e Distribuída

Language: C - Size: 7.93 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

MaxStrange/pyACC

OpenACC for Python

Language: Python - Size: 184 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 18 - Forks: 1

ajarmusch/Testsuite

OpenACC Validation and Verification Testsuite repository

Language: JavaScript - Size: 1.53 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

estradjm/Parallel-Gaussian-Blurring

LANL Parallel Computing Summer Research Institute 2017 GPU Exercise - C implementation of Gaussian Blurring of .ppm format image

Language: C - Size: 483 KB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 1

xstupi00/N-Body-OpenACC

Parallel Computations on GPU - Project - N-Body-OpenACC

Language: Python - Size: 76 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

diamantopoulos/diamantopoulos.github.io

Dionysios Diamantopoulos Web Edition

Language: JavaScript - Size: 10.5 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

capellil/IHPCSS_Programming_challenge_2019

The repository containing everything you need to compete in the IHPCSS 2019 programming challenge.

Language: Fortran - Size: 1.14 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 9 - Forks: 11

nachovizzo/saxpy_openacc_cpp

My way of thinking about OpenACC, C++, and Parallel computing in general

Language: C++ - Size: 16.6 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

pyccel/lampy

Extension of Pyccel for functional programming

Language: Python - Size: 1.52 MB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

PawseySC/2D-Laplace-Offload

The main purpose of this tutorial is to present similarities and differences between device offload mechanisms available in OpenACC and OpenMP standards.

Language: C - Size: 25.4 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 2

milladgit/rodinia Fork of qbunia/rodinia

Rodinia 2.1 benchmark modified to run with OpenACC 2.7 and PGI 18.4

Language: C - Size: 344 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

truong-qe/hello

Some Fortran codes to practice programming in Fortran.

Language: Fortran - Size: 48.8 KB - Last synced: 4 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

Raienryu97/parallelizationstudy

A performance study of various parallelisation tools on a few benchmarks

Language: C++ - Size: 51 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 1 - Forks: 0

spino327/NAS_SHOC_OpenACC_2.5

Code repository for paper "Exploring translation of OpenMP to OpenACC 2.5: Lessons Learned"

Language: Python - Size: 39.1 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 2

RodrigoOt/OpenaccBuildScript

Just another build script for gcc an nvptx

Language: Shell - Size: 39.1 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 0

RodrigoOt/nvptx-tools Fork of MentorEmbedded/nvptx-tools

nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.

Language: C - Size: 874 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 0

AndiH/jarvice-gtc19-power-image

Docker Image for JARVICE

Language: Dockerfile - Size: 807 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

Trick-17/backends

Interchangeable backends in C++, OpenMP, CUDA, OpenCL, OpenACC

Language: C++ - Size: 80.1 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0