Topic: "autotuning"
KernelTuner/kernel_tuner
Kernel Tuner
Language: Python - Size: 40.9 MB - Last synced at: 10 days ago - Pushed at: 15 days ago - Stars: 326 - Forks: 53

GenseeAI/cognify
Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower execution latency, and lower execution cost. Also has a simple agent/workflow framework
Language: Python - Size: 4.58 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 210 - Forks: 20

microsoft/MLOS
MLOS is a project to enable autotuning for systems.
Language: Python - Size: 324 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 153 - Forks: 71

ederwander/PyAutoTune
Autotune Module for Python "PyAutoTune"
Language: C - Size: 2.85 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 122 - Forks: 18

ChrisCummins/paper-end2end-dl
📝 "End-to-end Deep Learning of Optimization Heuristics" (🥇 PACT'17 Best Paper)
Language: TeX - Size: 177 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 73 - Forks: 21

ctuning/ck-autotuning
CK automation actions to let users implement portable, customizable and reusable program workflows for reproducible, collaborative and multi-objective benchmarking, optimization and SW/HW co-design:
Language: Python - Size: 22.4 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 57 - Forks: 21

HiPerCoRe/KTT
Kernel Tuning Toolkit
Language: C++ - Size: 220 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 52 - Forks: 10

lac-dcc/jotai-benchmarks
Collection of executable benchmarks
Language: C - Size: 55 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 5

ytopt-team/ytopt
ytopt: machine-learning-based search methods for autotuning
Language: C - Size: 22.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 18

phrb/NODAL.jl
NODAL is an Open Distributed Autotuning Library in Julia
Language: Julia - Size: 2 MB - Last synced at: 1 day ago - Pushed at: over 6 years ago - Stars: 38 - Forks: 5

PAA-NCIC/GSWITCH
A pattern-based algorithmic autotuner for graph processing on GPUs.
Language: Cuda - Size: 1.22 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 9

halo-project/halo
😇 Wholly Adaptive LLVM Optimizer
Language: C - Size: 41.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 0

ChrisCummins/paper-synthesizing-benchmarks
📝 "Synthesizing Benchmarks for Predictive Modeling" (🥇 CGO'17 Best Paper)
Language: Jupyter Notebook - Size: 602 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 22 - Forks: 6

NTNU-HPC-Lab/BAT
A GPU benchmark suite for autotuners
Language: Cuda - Size: 74.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 3

Nanosim-LIG/boast
BOAST aims at providing a framework to metaprogram, benchmark and validate computing kernels
Language: Ruby - Size: 1.28 MB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 5

CharlieCurry/tvm-learning
TVM learning and research
Language: Python - Size: 203 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 6

phrb/gpu-autotuning
Autotuning NVCC Compiler Parameters, published @ CCPE Journal
Language: C - Size: 471 MB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 2

phrb/legup-tuner
Autotuning High-Level Synthesis for FPGAs, published @ ReConFig '17
Language: PostScript - Size: 27.4 MB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 1

j-r-jones/optsearch
OptSearch -- a portable tuning framework for HPC
Language: C - Size: 2.36 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ederwander/HackPitchTrack
Pitch Track Hack - Pythonic Implementation of the Pitch track used in autotune patent
Language: Python - Size: 813 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

phrb/nvidia-workshop-autotuning
Resources for autotuning CUDA compiler parameters
Language: Julia - Size: 1.31 MB - Last synced at: 27 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

mahdifani14/CHStone-Codes-Annotated-By-Orio
This repository contains CHStone benchmark codes which have been annotated and tuned by Orio tool to test the speed up of execution.
Language: Verilog - Size: 111 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

phrb/ccgrid19
Autotuning Source Transformation Tools with Design of Experiments, published @ CCGRID'19
Language: TeX - Size: 12.8 MB - Last synced at: 27 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

specs-feup/LAT-Lara-Autotuning-Tool
:minidisc: C/C++ autotuning tool that follows the concept of ISAT implemented in LARA (AOP + javascript)
Size: 1.05 MB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

ChrisCummins/paper-autotuning-opencl-wgsize
"Autotuning OpenCL Workgroup Size for Stencil Patterns" (ADAPT 2016)
Language: TeX - Size: 7.82 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

mahdifani14/AutoTuning
Auto-Tuning chain to optimize software execution and compilation time upon heterogeneous systems
Size: 550 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

p-anastas/PARALiA-GEMMex
Language: C++ - Size: 1.58 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ChrisCummins/paper-towards-collaborative-performance-tuning
"Towards Collaborative Performance Tuning of Algorithmic Skeletons" (HLPGPU 2016)
Language: TeX - Size: 2.03 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

bcosenza/patus-aa Fork of matthias-christen/patus
Patus-AA is an extension of the Patus compiler, which adds a new auto-tuning framework based on machine learning, and a new backend generating code for ARM processor with NEON support.
Language: Java - Size: 82.3 MB - Last synced at: 5 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

fvella/AIdaptive
Towards a new generation of adaptive run-time and heuristic selection framework
Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

phrb/autotuning-gce-docs
Autotuning with OpenTuner and Cloud Computing
Language: TeX - Size: 19.1 MB - Last synced at: 27 days ago - Pushed at: about 9 years ago - Stars: 1 - Forks: 0

3it-inpaqt/line-classification-slope
Classification task from experimental charge stability diagrams to recognize angles of lines.
Language: Python - Size: 629 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ghbrown/mg_tune
Automatic tuning of multigrid parameters using black box optimization
Language: Python - Size: 95.7 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

phrb/cacti-tuner
Autotuning the HPE CACTI memory modelling tool
Language: C++ - Size: 1.04 MB - Last synced at: 27 days ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0
