An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gpu-acceleration

tensorflow/tfjs-core 📦

WebGL-accelerated ML // linear algebra // automatic differentiation for JavaScript.

Language: TypeScript - Size: 362 MB - Last synced at: about 7 hours ago - Pushed at: almost 6 years ago - Stars: 8,477 - Forks: 944

SamuelSchmidgall/SurgicalGym

High-performance GPU-based simulation platform for reinforcement learning with surgical robot learning

Language: Python - Size: 47.9 MB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 74 - Forks: 6

Krasnomakov/EventDrivenArchitecture

Prototypes of Event-Driven Architecture with Computer Vision, games, aniamtion and LLM models

Language: Python - Size: 110 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

TurakhiaLab/TWILIGHT

High throughput tool for tall and wide multiple sequence alignment.

Language: C++ - Size: 42.9 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 3 - Forks: 1

tensorflow/tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Language: TypeScript - Size: 166 MB - Last synced at: about 7 hours ago - Pushed at: 14 days ago - Stars: 18,853 - Forks: 1,973

uncomplicate/clojurecuda

Clojure library for CUDA development

Language: Clojure - Size: 511 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 186 - Forks: 10

kunitoki/yup

YUP is an open-source library dedicated to empowering developers with advanced tools for cross-platform application development.

Language: C++ - Size: 25.2 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 83 - Forks: 10

AI4Finance-Foundation/RLSolver

Solvers for NP-hard and NP-complete problems with an emphasis on high-performance GPU computing.

Language: Python - Size: 60.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 151 - Forks: 35

EMI-Group/evox

Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.

Language: Python - Size: 42.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 654 - Forks: 95

weatherisgood2/ngpt

A lightweight Python CLI and library for interacting with OpenAI-compatible APIs, supporting both official and self-hosted LLM endpoints.

Language: Python - Size: 133 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

MrGKanev/TensorFlow-GPU-Docker-Setup

A Docker environment for TensorFlow GPU development with optimized configurations for WSL2, troubleshooting guides, and common error fixes

Language: Python - Size: 43.9 KB - Last synced at: about 9 hours ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Caltech-Biophotonics-Lab/fpm-96eyes-reconstruction Fork of antonysigma/fpm-96eyes-reconstruction

GPU-accelerated Fourier Ptychography for brightfield-only images

Language: C++ - Size: 73.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

matteospanio/torchfx

A GPU accelerated and torch based audio DSP library

Language: Python - Size: 5.31 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 84 - Forks: 4

IntelPython/dpnp

Data Parallel Extension for NumPy

Language: Python - Size: 748 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 109 - Forks: 22

mikeroyal/GPU-Guide

Graphics Processing Unit (GPU) Architecture Guide

Language: Shell - Size: 815 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 215 - Forks: 18

NVIDIA/cccl

CUDA Core Compute Libraries

Language: C++ - Size: 82.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,690 - Forks: 224

8e8bdba457c18cf692a95fe2ec67000b/VulkanCooperativeMatrixAttention

Vulkan & GLSL implementation of FlashAttention-2

Size: 1.95 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

kiankyars/Ultra-Scale-Playbook-Series

Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Jaysmito101/TerraForge3D

Cross Platform Professional Procedural Terrain Generation & Texturing Tool

Language: C++ - Size: 630 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,066 - Forks: 98

NikolasEnt/ollama-webui-intel

Ollama with intel (i)GPU acceleration in docker and benchmark

Language: Python - Size: 1.55 MB - Last synced at: about 13 hours ago - Pushed at: 4 days ago - Stars: 14 - Forks: 4

JuliaHealth/KomaMRI.jl

Koma is a Pulseq-compatible framework to efficiently simulate Magnetic Resonance Imaging (MRI) acquisitions. The main focus of this package is to simulate general scenarios that could arise in pulse sequence development.

Language: Julia - Size: 575 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 148 - Forks: 23

thomasvrussell/sfft

Saccadic Fast Fourier Transform (SFFT) algorithm for Image subtraction in Fourier space

Language: Jupyter Notebook - Size: 441 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 53 - Forks: 9

TianZerL/Anime4KCPP

A high performance anime upscaler

Language: C++ - Size: 7.67 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,895 - Forks: 146

NexusGPU/tensor-fusion-site

TensorFusion landing page and product docs

Language: Vue - Size: 1.59 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 2

Project-HAMi/HAMi

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Language: Go - Size: 129 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,791 - Forks: 314

ROCm/Tensile

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Language: Python - Size: 95.2 MB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 245 - Forks: 166

ParaGroup/WindFlow

A C++17 Data Stream Processing Parallel Library for Multicores and GPUs

Language: C++ - Size: 48.9 MB - Last synced at: about 5 hours ago - Pushed at: 4 months ago - Stars: 84 - Forks: 19

ProjectPhysX/OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language: C++ - Size: 344 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 406 - Forks: 40

beehive-lab/TornadoVM

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

Language: Java - Size: 152 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 1,261 - Forks: 119

Autodesk/Neon

Multi-GPU Framework for Voxel Grid Computations

Language: C++ - Size: 102 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 57 - Forks: 14

raphamorim/rio

A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.

Language: Rust - Size: 276 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5,168 - Forks: 195

AlejandroAmat/3dgs-vulkan-cpp

Cross-platform Vulkan 3D Gaussian Splatting renderer - Windows/Mac/Linux, any GPU, with Python binding support

Language: C++ - Size: 131 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 6 - Forks: 0

dgasmith/opt_einsum

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Language: Python - Size: 4.11 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 918 - Forks: 73

rajat709/Cloud-API-Builder

Dataoorts Cloud Frontend

Language: Python - Size: 20.6 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Shenggan/xfold

Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction

Language: Python - Size: 2.2 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 35 - Forks: 4

NVIDIA/warp

A Python framework for accelerated simulation, data generation and spatial computing.

Language: Python - Size: 48.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 5,201 - Forks: 319

NCAR/micm

A model-independent chemistry module for atmosphere models

Language: C++ - Size: 46.5 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 7

mitmath/JuliaComputation

Repository for Common Ground C25

Language: Julia - Size: 69.7 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 101 - Forks: 14

NVIDIA/GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language: Jupyter Notebook - Size: 91.8 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 3,184 - Forks: 764

NVIDIA/optix-dev

OptiX SDK headers, everything needed to build & run OptiX applications. SDK samples not included.

Language: C++ - Size: 186 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 2

EMI-Group/evomo

EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)

Language: Python - Size: 995 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 78 - Forks: 8

bgin/Radar-ElectroOptical-Simulation

(REOS) Radar and Electro-Optical Simulation Framework written in C++.

Language: C++ - Size: 31.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 60 - Forks: 20

nakib/elphbolt

A solver for the coupled and decoupled electron and phonon Boltzmann transport equations.

Language: Fortran - Size: 12.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 51 - Forks: 29

yifanzhu-fluid/PCOMCOT2.1

2.1 version of PCOMCOT—An efficient parallel dispersive tsunami model

Language: Fortran - Size: 66.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 8 - Forks: 0

Chaz-Ortiz/EMBA-CS-4613-901

EMBA is a firmware security analysis tool. I created a module for EMBA (p51 mustang binwalk extractor) that checks if the firmware file system being analyzed has a deep max depth that can benefit from an optimized extractor (>50 levels deep), leverages GPUs for processing speed and uses parallelization for file operations.

Language: Shell - Size: 44.9 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

DanteMichaeli/GPU-option-pricing-thesis

(In progress) Bachelor's thesis on GPU acceleration of the CRR binomial tree model and Monte Carlo option pricing

Language: TeX - Size: 1.38 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

RobinKa/tfga

Python package for Geometric / Clifford Algebra with TensorFlow

Language: Jupyter Notebook - Size: 2.31 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 52 - Forks: 7

bgin/Radar_ElectroOptical_Simulation

(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.

Language: Fortran - Size: 51.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 50 - Forks: 15

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language: C++ - Size: 136 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 11,706 - Forks: 2,199

BotBlake/jellybench_py

Client for the Jellyfin Hardware Survey Server (https://hwa.jellyfin.org/). Benchmarks hardware performance for simultaneous ffmpeg transcoding, enabling detailed comparisons and insights for optimizing Jellyfin setups.

Language: Python - Size: 393 KB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 21 - Forks: 12

real-space/AngstromCube

A parallel and GPU-accelerated Code for Real-Space All-Electron Linear-Scaling Density Functional Theory

Language: C++ - Size: 33.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 7 - Forks: 2

kklmn/xrt

Package xrt (XRayTracer) is a python software library for ray tracing and wave propagation in x-ray regime. It is primarily meant for modeling synchrotron sources, beamlines and beamline elements.

Language: Python - Size: 472 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 93 - Forks: 31

Charmve/AccANN

🐆 A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration for *AdderNet*

Size: 396 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 1

dominic-chang/Krang.jl

Fast analytic raytracing around Kerr black holes

Language: Julia - Size: 290 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 14 - Forks: 5

GalacticDynamics/galax

Galactic and Gravitational Dynamics in Python (+ GPU and autodiff)

Language: Python - Size: 5.64 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 39 - Forks: 8

z6cc/LocalWebAI

LocalWebAI: Run AI models directly in browsers & Node.js with no backend or API calls. Privacy-first, offline-capable LLM inference engine powered by WebAssembly (WASM)

Language: TypeScript - Size: 29.3 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

Alisah-Ozcan/HEonGPU

HEonGPU is a high-performance library that optimizes Fully Homomorphic Encryption (FHE) on GPUs. Leveraging GPU parallelism, it reduces computational load through concurrent execution. Its multi-stream architecture minimizes data transfer overhead, making it ideal for large-scale encrypted computations with reduced latency.

Language: Cuda - Size: 547 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 71 - Forks: 19

Alisah-Ozcan/GPU-FFT

Welcome to the GPU-FFT-Optimization repository! We present cutting-edge algorithms and implementations for optimizing the Fast Fourier Transform (FFT) on Graphics Processing Units (GPUs).

Language: Cuda - Size: 85.9 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 16 - Forks: 2

AsadiAhmad/100_Sports_Image_Classification

A deep learning project for sport image classification using a custom VGG19-based architecture with integrated Grad-CAM heatmap visualization for model interpretability.

Language: Jupyter Notebook - Size: 4.08 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

BlazingDB/blazingsql

BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.

Language: C++ - Size: 41.4 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 1,979 - Forks: 184

tinh2044/YOLO12-UnderWater

YOLOv12 Underwater Object Detection is an open-source suite for underwater object detection, built on YOLOv12. It offers an end-to-end pipeline with GPU-accelerated training, customizable data augmentations, real-time inference via Gradio, and support for model export (ONNX & PyTorch).

Language: Python - Size: 52.1 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

NVlabs/sionna-rt

Sionna RT: The Ray Tracing Package of Sionna

Language: Jupyter Notebook - Size: 27.8 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 42 - Forks: 15

baggepinnen/MonteCarloMeasurements.jl

Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.

Language: Julia - Size: 4.91 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 275 - Forks: 18

dominic-chang/JacobiElliptic.jl

Elliptic integrals and Jacobi elliptic functions that are GPU friendly and auto differentiable

Language: Julia - Size: 988 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 8 - Forks: 1

LLNL/CARE

CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.

Language: C++ - Size: 1.47 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 30 - Forks: 4

Dodotree/webgl_packing

The technique for processing binary (black and white) image data using WebGL.

Language: JavaScript - Size: 49.8 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

ivanZanardi/pyharmx

Polyharmonic spline interpolation in PyTorch

Language: Python - Size: 5.21 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

AMReX-Combustion/PeleLMeX

An adaptive mesh hydrodynamics simulation code for low Mach number reacting flows without level sub-cycling.

Language: C++ - Size: 28.3 MB - Last synced at: 5 days ago - Pushed at: 16 days ago - Stars: 39 - Forks: 50

NVlabs/sionna

Sionna: An Open-Source Library for Research on Communication Systems

Language: Jupyter Notebook - Size: 260 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,026 - Forks: 303

AntoninHorkel/paint

GPU accelerated paint app in Rust

Language: Rust - Size: 57.6 KB - Last synced at: 3 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

qcpydev/qcpy

qc simulator

Language: Python - Size: 54.2 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 12 - Forks: 2

anormi001/chatterbox-tts-api

Chatterbox TTS API is a FastAPI-powered REST API designed for text-to-speech applications. It offers seamless integration and efficient performance, making it a great choice for developers looking to enhance their projects. ⭐️👩💻

Language: Python - Size: 253 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

thieu1995/WaveletML

WaveletML: A Scalable and Extensible Wavelet Neural Network Framework

Language: Python - Size: 179 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

ucl-bug/jwave

A JAX-based research framework for differentiable and parallelizable acoustic simulations, on CPU, GPUs and TPUs

Language: Python - Size: 54.8 MB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 170 - Forks: 21

beehive-lab/kfusion-tornadovm

🎥 A Java implementation of Kinect Fusion running on Tornado VM.

Language: Java - Size: 819 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 24 - Forks: 8

DiamondLightSource/httomo

High-throughput tomography pipeline

Language: Python - Size: 284 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 8 - Forks: 4

DiamondLightSource/httomolibgpu

A library of GPU-enabled data processing and reconstruction methods for tomography

Language: Python - Size: 126 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 6

viam-modules/viam-mlmodelservice-triton

MLModelService wrapping Nvidia's Triton Server

Language: C++ - Size: 168 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 5 - Forks: 6

Awesomegamergame/FfmpegConverter

A .NET Framework 4.8, C# Console app which converts any video file dropped onto it or in the same folder as it into a mkv file with the av1 codec using either qsv from intel or nvenc from nvidia on supported cards. This is all possible by passing command arguments to ffmpeg.

Language: C# - Size: 69.3 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

eszdman/PhotonCamera

Android Camera that uses Enhanced image processing

Language: Java - Size: 22.8 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 865 - Forks: 78

NVIDIA-Merlin/HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

Language: C++ - Size: 55.7 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 1,009 - Forks: 205

AudioKit/Waveform

GPU accelerated waveform view

Language: Swift - Size: 4.62 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 227 - Forks: 16

nlesc-dirac/sagecal

SAGECal is a fast, memory efficient and GPU accelerated radio interferometric calibration program. It supports all source models including points, Gaussians and Shapelets. Distributed calibration using MPI and consensus optimization is enabled. Both spectral and spatial priors can be used as constraints. Tools to build/restore sky models are included.

Language: C - Size: 39.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 13 - Forks: 8

SCRobarts/OptiFax

OptiFaχ - Nonlinear Optical Facsimile

Language: MATLAB - Size: 33.4 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 4 - Forks: 1

curtisgray/wingman

Wingman is the fastest and easiest way to run Llama models on your PC or Mac.

Language: TypeScript - Size: 188 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 41 - Forks: 2

MegviiRobot/MegBA

MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment

Language: Cuda - Size: 1.3 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 467 - Forks: 61

Maria-Antony/KernelCraft

KernelCraft is a GPU kernel visualizer and profiler built with Triton, PyTorch, and Streamlit.

Language: Python - Size: 21.5 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

ashvardanian/ParallelReductionsBenchmark

Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!

Language: C++ - Size: 17.4 MB - Last synced at: 9 days ago - Pushed at: 23 days ago - Stars: 99 - Forks: 9

MuzzammilShah/Project-based-ML-Roadmap

A 10‑week weekday plan (1.5 h/day) emphasizes hands-on projects with integrated theory – a “code-first, theory-later” approach favored by experienced ML engineers. We focus on building real AI/ML systems and sharing them on GitHub (projects “open doors” and demonstrate skills. Each week has a clear goal and mini-project to practice key tools.

Language: Jupyter Notebook - Size: 354 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

janverschelde/PHCpack

The primary source code repository for PHCpack, a software package to solve polynomial systems with homotopy continuation methods.

Language: Ada - Size: 45.4 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 67 - Forks: 22

TianZerL/pyanime4k

An easy way to use anime4k in python

Language: Python - Size: 76.2 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 116 - Forks: 17

TSavo/chatterbox-tts-api

High-performance TTS API with voice cloning, emotion control, and synchronous MP3 generation. Built with FastAPI and powered by Chatterbox TTS.

Language: Python - Size: 82 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

davidAlgis/InteropUnityCUDA

Demonstrate interoperability between Unity Engine and CUDA

Language: C++ - Size: 4.61 MB - Last synced at: 21 days ago - Pushed at: 22 days ago - Stars: 45 - Forks: 3

otto-link/HighMap

A C++ library to generate two-dimensional terrain heightmaps for software rendering or video games.

Language: C++ - Size: 185 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 35 - Forks: 5

Vuk-Luzanin/Feynman-Kac-Parallelization-Research

Repository focused on research and implementation of parallel algorithms for the Feynman-Kac problem. Covers CPU and GPU parallelization using programming models like OpenMP, Pthreads, CUDA, and others. The goal is to explore performance optimizations and scalability across different hardware.

Language: C - Size: 3.93 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

Clemapfel/crisp

Real-time Interactive Image/Video/Audio Processing Library for Math-Averse People

Language: C++ - Size: 40.8 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

iot-salzburg/gpu-jupyter

GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 743 - Forks: 237

raj200501/NVIDIA-GPU-HPC-Platform

Integrates NVIDIA GPUs for HPC and edge computing. Leveraging CUDA, Jetson, and Triton Inference Server, it offers real-time data processing with Kafka and Spark. Scalable microservices with Spring Boot, Docker, Kubernetes, robust security, and CI/CD with Jenkins make it ideal for advanced computational tasks. @NVIDIA

Language: C++ - Size: 378 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 3 - Forks: 0

themahani/rctorch

rcTorch is a framework developed based on pyTorch for research on Reservoir Computing.

Language: Python - Size: 111 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

marian-nmt/marian-dev

Fast Neural Machine Translation in C++ - development repository

Language: C++ - Size: 18.7 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 273 - Forks: 130