Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: nvcc

codingonion/cuda-beginner-course-python-version

bilibili视频【CUDA 12.1 并行编程入门(Python语言版)】配套代码

Language: Python - Size: 3.91 KB - Last synced: 1 day ago - Pushed: 3 months ago - Stars: 5 - Forks: 0

codingonion/cuda-beginner-course-cpp-version

bilibili视频【CUDA 12.1 并行编程入门(C++语言版)】配套代码

Language: Cuda - Size: 15.6 KB - Last synced: 1 day ago - Pushed: 3 months ago - Stars: 16 - Forks: 2

mattbdean/novaXfer

Lightning fast NVCC course equivalencies

Language: TypeScript - Size: 1.2 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 0 - Forks: 0

Hobbbbes/MandelBrot-Cuda-olc-PixelGameEngine

Language: Cuda - Size: 327 KB - Last synced: about 1 month ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

codingonion/cuda-beginner-course-rust-version

bilibili视频【CUDA 12.1 并行编程入门(Rust语言版)】配套代码

Language: Rust - Size: 3.91 KB - Last synced: 1 day ago - Pushed: 3 months ago - Stars: 4 - Forks: 0

termoshtt/link_cuda_kernel

HowTo: Compile CUDA with nvcc, and link to Rust

Language: Cuda - Size: 4.88 KB - Last synced: about 1 month ago - Pushed: almost 6 years ago - Stars: 38 - Forks: 7

TravisWThompson1/Makefile_Example_CUDA_CPP_To_Executable

Example Makefile for CUDA and C++ source files in a standard project layout.

Language: Cuda - Size: 3.91 KB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 39 - Forks: 17

Rodolfo-Gallegos/Brownian-Dynamics-Simulation-OpenACC

This is my thesis work for the Bachelor's degree in Physics. / Este es mi trabajo de titulación para la Licenciatura en Física.

Language: C++ - Size: 5.54 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

iamsubhranil/Renderer

A barebones 3D renderer in C++ and Python

Language: C++ - Size: 418 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

TheUnsolvedDev/CUDA_NN_FS

creating a personal neural network library in cuda by as a learning stage side by side project

Language: Cuda - Size: 21.8 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 3 - Forks: 0

jbcbezerra/gradle-nvcc

Gradle plugin for integrating Cuda's nvcc tool

Language: Kotlin - Size: 75.2 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

gvvsnrnaveen/cuda

this repository contains the various programs that can written using CUDA Toolkit.

Language: C - Size: 1.31 MB - Last synced: 5 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

kangyolo/get-started-jetson-nano

Guidance for Nvidia Jetson Nano

Size: 816 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

phrb/nvidia-workshop-autotuning

Resources for autotuning CUDA compiler parameters

Language: Julia - Size: 1.31 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 3 - Forks: 1

Kvatsx/GPU-Computing-Assignments

Language: C - Size: 46.7 MB - Last synced: 11 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

ashutoshIITK/install_cuda_cudnn_ubuntu_20

Tutorial to install NVIDIA Drivers, CUDA 11.4 and cuDNN for deep learning programming on Ubuntu 20.04.

Size: 2.2 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 23 - Forks: 7

alpha74/CUDA_basics

NVCC CUDA programs for begineers.

Language: Cuda - Size: 71.3 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

ShadyBoukhary/GPU-research-FFT-OpenACC-CUDA

Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.

Language: Cuda - Size: 9.12 MB - Last synced: over 1 year ago - Pushed: almost 6 years ago - Stars: 7 - Forks: 2

phrb/gpu-autotuning

Autotuning NVCC Compiler Parameters, published @ CCPE Journal

Language: C - Size: 471 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 9 - Forks: 2

mkf450/nvcc4jupyter Fork of depctg/nvcc4jupyter

A plugin for Jupyter Notebook to run CUDA C/C++ code

Size: 8.79 KB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

pockemon/Computer-Architecture-Lab

Solutions to assignment given in the class of CO316

Language: C++ - Size: 4.37 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

jinparksj/deeplearning_cpp_libraries

Personal libraries for deep learning with C++

Language: C - Size: 2.26 MB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 0

alpha74/HungarianAlgoCUDA

Hungarian Algorithm for Linear Assignment Problem implemented using CUDA.

Language: Cuda - Size: 10.7 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

manikandan-ravikiran/Leetcode_June_Challenge

Problems of June day to day challenge in Leetcode

Language: Python - Size: 13.7 KB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

underscoreanuj/bitonic-sort-visualization

A python script which helps visualize the sorting routine of bitonic sort (executed in parallel using nvcc).

Language: Jupyter Notebook - Size: 340 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 1

gilbertobastos/prj_perceptron_multicamadas_CUDA

Implementação simples do Perceptron Multicamadas em CUDA.

Language: C - Size: 39.1 KB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0