GitHub topics: gpu-acceleration
tensorflow/tfjs-core 📦
WebGL-accelerated ML // linear algebra // automatic differentiation for JavaScript.
Language: TypeScript - Size: 362 MB - Last synced at: about 7 hours ago - Pushed at: almost 6 years ago - Stars: 8,477 - Forks: 944

SamuelSchmidgall/SurgicalGym
High-performance GPU-based simulation platform for reinforcement learning with surgical robot learning
Language: Python - Size: 47.9 MB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 74 - Forks: 6

Krasnomakov/EventDrivenArchitecture
Prototypes of Event-Driven Architecture with Computer Vision, games, aniamtion and LLM models
Language: Python - Size: 110 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

TurakhiaLab/TWILIGHT
High throughput tool for tall and wide multiple sequence alignment.
Language: C++ - Size: 42.9 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 3 - Forks: 1

tensorflow/tfjs
A WebGL accelerated JavaScript library for training and deploying ML models.
Language: TypeScript - Size: 166 MB - Last synced at: about 7 hours ago - Pushed at: 14 days ago - Stars: 18,853 - Forks: 1,973

uncomplicate/clojurecuda
Clojure library for CUDA development
Language: Clojure - Size: 511 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 186 - Forks: 10

kunitoki/yup
YUP is an open-source library dedicated to empowering developers with advanced tools for cross-platform application development.
Language: C++ - Size: 25.2 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 83 - Forks: 10

AI4Finance-Foundation/RLSolver
Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.
Language: Python - Size: 60.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 151 - Forks: 35

EMI-Group/evox
Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.
Language: Python - Size: 42.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 654 - Forks: 95

weatherisgood2/ngpt
A lightweight Python CLI and library for interacting with OpenAI-compatible APIs, supporting both official and self-hosted LLM endpoints.
Language: Python - Size: 133 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

MrGKanev/TensorFlow-GPU-Docker-Setup
A Docker environment for TensorFlow GPU development with optimized configurations for WSL2, troubleshooting guides, and common error fixes
Language: Python - Size: 43.9 KB - Last synced at: about 9 hours ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Caltech-Biophotonics-Lab/fpm-96eyes-reconstruction Fork of antonysigma/fpm-96eyes-reconstruction
GPU-accelerated Fourier Ptychography for brightfield-only images
Language: C++ - Size: 73.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

matteospanio/torchfx
A GPU accelerated and torch based audio DSP library
Language: Python - Size: 5.31 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 84 - Forks: 4

IntelPython/dpnp
Data Parallel Extension for NumPy
Language: Python - Size: 748 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 109 - Forks: 22

mikeroyal/GPU-Guide
Graphics Processing Unit (GPU) Architecture Guide
Language: Shell - Size: 815 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 215 - Forks: 18

NVIDIA/cccl
CUDA Core Compute Libraries
Language: C++ - Size: 82.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,690 - Forks: 224

8e8bdba457c18cf692a95fe2ec67000b/VulkanCooperativeMatrixAttention
Vulkan & GLSL implementation of FlashAttention-2
Size: 1.95 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

kiankyars/Ultra-Scale-Playbook-Series
Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Jaysmito101/TerraForge3D
Cross Platform Professional Procedural Terrain Generation & Texturing Tool
Language: C++ - Size: 630 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,066 - Forks: 98

NikolasEnt/ollama-webui-intel
Ollama with intel (i)GPU acceleration in docker and benchmark
Language: Python - Size: 1.55 MB - Last synced at: about 13 hours ago - Pushed at: 4 days ago - Stars: 14 - Forks: 4

JuliaHealth/KomaMRI.jl
Koma is a Pulseq-compatible framework to efficiently simulate Magnetic Resonance Imaging (MRI) acquisitions. The main focus of this package is to simulate general scenarios that could arise in pulse sequence development.
Language: Julia - Size: 575 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 148 - Forks: 23

thomasvrussell/sfft
Saccadic Fast Fourier Transform (SFFT) algorithm for Image subtraction in Fourier space
Language: Jupyter Notebook - Size: 441 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 53 - Forks: 9

TianZerL/Anime4KCPP
A high performance anime upscaler
Language: C++ - Size: 7.67 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,895 - Forks: 146

NexusGPU/tensor-fusion-site
TensorFusion landing page and product docs
Language: Vue - Size: 1.59 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 2

Project-HAMi/HAMi
Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)
Language: Go - Size: 129 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,791 - Forks: 314

ROCm/Tensile
[DEPRECATED] Moved to ROCm/rocm-libraries repo
Language: Python - Size: 95.2 MB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 245 - Forks: 166

ParaGroup/WindFlow
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
Language: C++ - Size: 48.9 MB - Last synced at: about 5 hours ago - Pushed at: 4 months ago - Stars: 84 - Forks: 19

ProjectPhysX/OpenCL-Wrapper
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Language: C++ - Size: 344 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 406 - Forks: 40

beehive-lab/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Language: Java - Size: 152 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 1,261 - Forks: 119

Autodesk/Neon
Multi-GPU Framework for Voxel Grid Computations
Language: C++ - Size: 102 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 57 - Forks: 14

raphamorim/rio
A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.
Language: Rust - Size: 276 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5,168 - Forks: 195

AlejandroAmat/3dgs-vulkan-cpp
Cross-platform Vulkan 3D Gaussian Splatting renderer - Windows/Mac/Linux, any GPU, with Python binding support
Language: C++ - Size: 131 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 6 - Forks: 0

dgasmith/opt_einsum
⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
Language: Python - Size: 4.11 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 918 - Forks: 73

rajat709/Cloud-API-Builder
Dataoorts Cloud Frontend
Language: Python - Size: 20.6 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Shenggan/xfold
Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction
Language: Python - Size: 2.2 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 35 - Forks: 4

NVIDIA/warp
A Python framework for accelerated simulation, data generation and spatial computing.
Language: Python - Size: 48.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 5,201 - Forks: 319

NCAR/micm
A model-independent chemistry module for atmosphere models
Language: C++ - Size: 46.5 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 7

mitmath/JuliaComputation
Repository for Common Ground C25
Language: Julia - Size: 69.7 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 101 - Forks: 14

NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Language: Jupyter Notebook - Size: 91.8 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 3,184 - Forks: 764

NVIDIA/optix-dev
OptiX SDK headers, everything needed to build & run OptiX applications. SDK samples not included.
Language: C++ - Size: 186 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 2

EMI-Group/evomo
EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)
Language: Python - Size: 995 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 78 - Forks: 8

bgin/Radar-ElectroOptical-Simulation
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Language: C++ - Size: 31.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 60 - Forks: 20

nakib/elphbolt
A solver for the coupled and decoupled electron and phonon Boltzmann transport equations.
Language: Fortran - Size: 12.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 51 - Forks: 29

yifanzhu-fluid/PCOMCOT2.1
2.1 version of PCOMCOT—An efficient parallel dispersive tsunami model
Language: Fortran - Size: 66.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 8 - Forks: 0

Chaz-Ortiz/EMBA-CS-4613-901
EMBA is a firmware security analysis tool. I created a module for EMBA (p51 mustang binwalk extractor) that checks if the firmware file system being analyzed has a deep max depth that can benefit from an optimized extractor (>50 levels deep), leverages GPUs for processing speed and uses parallelization for file operations.
Language: Shell - Size: 44.9 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

DanteMichaeli/GPU-option-pricing-thesis
(In progress) Bachelor's thesis on GPU acceleration of the CRR binomial tree model and Monte Carlo option pricing
Language: TeX - Size: 1.38 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

RobinKa/tfga
Python package for Geometric / Clifford Algebra with TensorFlow
Language: Jupyter Notebook - Size: 2.31 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 52 - Forks: 7

bgin/Radar_ElectroOptical_Simulation
(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.
Language: Fortran - Size: 51.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 50 - Forks: 15

NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language: C++ - Size: 136 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 11,706 - Forks: 2,199

BotBlake/jellybench_py
Client for the Jellyfin Hardware Survey Server (https://hwa.jellyfin.org/). Benchmarks hardware performance for simultaneous ffmpeg transcoding, enabling detailed comparisons and insights for optimizing Jellyfin setups.
Language: Python - Size: 393 KB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 21 - Forks: 12

real-space/AngstromCube
A parallel and GPU-accelerated Code for Real-Space All-Electron Linear-Scaling Density Functional Theory
Language: C++ - Size: 33.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 7 - Forks: 2

kklmn/xrt
Package xrt (XRayTracer) is a python software library for ray tracing and wave propagation in x-ray regime. It is primarily meant for modeling synchrotron sources, beamlines and beamline elements.
Language: Python - Size: 472 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 93 - Forks: 31

Charmve/AccANN
🐆 A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration for *AdderNet*
Size: 396 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 1

dominic-chang/Krang.jl
Fast analytic raytracing around Kerr black holes
Language: Julia - Size: 290 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 14 - Forks: 5

GalacticDynamics/galax
Galactic and Gravitational Dynamics in Python (+ GPU and autodiff)
Language: Python - Size: 5.64 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 39 - Forks: 8

z6cc/LocalWebAI
LocalWebAI: Run AI models directly in browsers & Node.js with no backend or API calls. Privacy-first, offline-capable LLM inference engine powered by WebAssembly (WASM)
Language: TypeScript - Size: 29.3 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

Alisah-Ozcan/HEonGPU
HEonGPU is a high-performance library that optimizes Fully Homomorphic Encryption (FHE) on GPUs. Leveraging GPU parallelism, it reduces computational load through concurrent execution. Its multi-stream architecture minimizes data transfer overhead, making it ideal for large-scale encrypted computations with reduced latency.
Language: Cuda - Size: 547 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 71 - Forks: 19

Alisah-Ozcan/GPU-FFT
Welcome to the GPU-FFT-Optimization repository! We present cutting-edge algorithms and implementations for optimizing the Fast Fourier Transform (FFT) on Graphics Processing Units (GPUs).
Language: Cuda - Size: 85.9 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 16 - Forks: 2

AsadiAhmad/100_Sports_Image_Classification
A deep learning project for sport image classification using a custom VGG19-based architecture with integrated Grad-CAM heatmap visualization for model interpretability.
Language: Jupyter Notebook - Size: 4.08 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

BlazingDB/blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Language: C++ - Size: 41.4 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 1,979 - Forks: 184

tinh2044/YOLO12-UnderWater
YOLOv12 Underwater Object Detection is an open-source suite for underwater object detection, built on YOLOv12. It offers an end-to-end pipeline with GPU-accelerated training, customizable data augmentations, real-time inference via Gradio, and support for model export (ONNX & PyTorch).
Language: Python - Size: 52.1 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

NVlabs/sionna-rt
Sionna RT: The Ray Tracing Package of Sionna
Language: Jupyter Notebook - Size: 27.8 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 42 - Forks: 15

baggepinnen/MonteCarloMeasurements.jl
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Language: Julia - Size: 4.91 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 275 - Forks: 18

dominic-chang/JacobiElliptic.jl
Elliptic integrals and Jacobi elliptic functions that are GPU friendly and auto differentiable
Language: Julia - Size: 988 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 8 - Forks: 1

LLNL/CARE
CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Language: C++ - Size: 1.47 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 30 - Forks: 4

Dodotree/webgl_packing
The technique for processing binary (black and white) image data using WebGL.
Language: JavaScript - Size: 49.8 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

ivanZanardi/pyharmx
Polyharmonic spline interpolation in PyTorch
Language: Python - Size: 5.21 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

AMReX-Combustion/PeleLMeX
An adaptive mesh hydrodynamics simulation code for low Mach number reacting flows without level sub-cycling.
Language: C++ - Size: 28.3 MB - Last synced at: 5 days ago - Pushed at: 16 days ago - Stars: 39 - Forks: 50

NVlabs/sionna
Sionna: An Open-Source Library for Research on Communication Systems
Language: Jupyter Notebook - Size: 260 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,026 - Forks: 303

AntoninHorkel/paint
GPU accelerated paint app in Rust
Language: Rust - Size: 57.6 KB - Last synced at: 3 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

qcpydev/qcpy
qc simulator
Language: Python - Size: 54.2 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 12 - Forks: 2

anormi001/chatterbox-tts-api
Chatterbox TTS API is a FastAPI-powered REST API designed for text-to-speech applications. It offers seamless integration and efficient performance, making it a great choice for developers looking to enhance their projects. ⭐️👩💻
Language: Python - Size: 253 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

thieu1995/WaveletML
WaveletML: A Scalable and Extensible Wavelet Neural Network Framework
Language: Python - Size: 179 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

ucl-bug/jwave
A JAX-based research framework for differentiable and parallelizable acoustic simulations, on CPU, GPUs and TPUs
Language: Python - Size: 54.8 MB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 170 - Forks: 21

beehive-lab/kfusion-tornadovm
🎥 A Java implementation of Kinect Fusion running on Tornado VM.
Language: Java - Size: 819 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 24 - Forks: 8

DiamondLightSource/httomo
High-throughput tomography pipeline
Language: Python - Size: 284 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 8 - Forks: 4

DiamondLightSource/httomolibgpu
A library of GPU-enabled data processing and reconstruction methods for tomography
Language: Python - Size: 126 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 6

viam-modules/viam-mlmodelservice-triton
MLModelService wrapping Nvidia's Triton Server
Language: C++ - Size: 168 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 5 - Forks: 6

Awesomegamergame/FfmpegConverter
A .NET Framework 4.8, C# Console app which converts any video file dropped onto it or in the same folder as it into a mkv file with the av1 codec using either qsv from intel or nvenc from nvidia on supported cards. This is all possible by passing command arguments to ffmpeg.
Language: C# - Size: 69.3 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

eszdman/PhotonCamera
Android Camera that uses Enhanced image processing
Language: Java - Size: 22.8 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 865 - Forks: 78

NVIDIA-Merlin/HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Language: C++ - Size: 55.7 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 1,009 - Forks: 205

AudioKit/Waveform
GPU accelerated waveform view
Language: Swift - Size: 4.62 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 227 - Forks: 16

nlesc-dirac/sagecal
SAGECal is a fast, memory efficient and GPU accelerated radio interferometric calibration program. It supports all source models including points, Gaussians and Shapelets. Distributed calibration using MPI and consensus optimization is enabled. Both spectral and spatial priors can be used as constraints. Tools to build/restore sky models are included.
Language: C - Size: 39.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 13 - Forks: 8

SCRobarts/OptiFax
OptiFaχ - Nonlinear Optical Facsimile
Language: MATLAB - Size: 33.4 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 4 - Forks: 1

curtisgray/wingman
Wingman is the fastest and easiest way to run Llama models on your PC or Mac.
Language: TypeScript - Size: 188 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 41 - Forks: 2

MegviiRobot/MegBA
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
Language: Cuda - Size: 1.3 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 467 - Forks: 61

Maria-Antony/KernelCraft
KernelCraft is a GPU kernel visualizer and profiler built with Triton, PyTorch, and Streamlit.
Language: Python - Size: 21.5 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

ashvardanian/ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
Language: C++ - Size: 17.4 MB - Last synced at: 9 days ago - Pushed at: 23 days ago - Stars: 99 - Forks: 9

MuzzammilShah/Project-based-ML-Roadmap
A 10‑week weekday plan (1.5 h/day) emphasizes hands-on projects with integrated theory – a “code-first, theory-later” approach favored by experienced ML engineers. We focus on building real AI/ML systems and sharing them on GitHub (projects “open doors” and demonstrate skills. Each week has a clear goal and mini-project to practice key tools.
Language: Jupyter Notebook - Size: 354 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

janverschelde/PHCpack
The primary source code repository for PHCpack, a software package to solve polynomial systems with homotopy continuation methods.
Language: Ada - Size: 45.4 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 67 - Forks: 22

TianZerL/pyanime4k
An easy way to use anime4k in python
Language: Python - Size: 76.2 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 116 - Forks: 17

TSavo/chatterbox-tts-api
High-performance TTS API with voice cloning, emotion control, and synchronous MP3 generation. Built with FastAPI and powered by Chatterbox TTS.
Language: Python - Size: 82 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

davidAlgis/InteropUnityCUDA
Demonstrate interoperability between Unity Engine and CUDA
Language: C++ - Size: 4.61 MB - Last synced at: 21 days ago - Pushed at: 22 days ago - Stars: 45 - Forks: 3

otto-link/HighMap
A C++ library to generate two-dimensional terrain heightmaps for software rendering or video games.
Language: C++ - Size: 185 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 35 - Forks: 5

Vuk-Luzanin/Feynman-Kac-Parallelization-Research
Repository focused on research and implementation of parallel algorithms for the Feynman-Kac problem. Covers CPU and GPU parallelization using programming models like OpenMP, Pthreads, CUDA, and others. The goal is to explore performance optimizations and scalability across different hardware.
Language: C - Size: 3.93 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

Clemapfel/crisp
Real-time Interactive Image/Video/Audio Processing Library for Math-Averse People
Language: C++ - Size: 40.8 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

iot-salzburg/gpu-jupyter
GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 743 - Forks: 237

raj200501/NVIDIA-GPU-HPC-Platform
Integrates NVIDIA GPUs for HPC and edge computing. Leveraging CUDA, Jetson, and Triton Inference Server, it offers real-time data processing with Kafka and Spark. Scalable microservices with Spring Boot, Docker, Kubernetes, robust security, and CI/CD with Jenkins make it ideal for advanced computational tasks. @NVIDIA
Language: C++ - Size: 378 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 3 - Forks: 0

themahani/rctorch
rcTorch is a framework developed based on pyTorch for research on Reservoir Computing.
Language: Python - Size: 111 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

marian-nmt/marian-dev
Fast Neural Machine Translation in C++ - development repository
Language: C++ - Size: 18.7 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 273 - Forks: 130
