An open API service providing repository metadata for many open source software ecosystems.

Topic: "autotuning"

KernelTuner/kernel_tuner

Kernel Tuner

Language: Python - Size: 40.9 MB - Last synced at: 10 days ago - Pushed at: 15 days ago - Stars: 326 - Forks: 53

GenseeAI/cognify

Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower execution latency, and lower execution cost. Also has a simple agent/workflow framework

Language: Python - Size: 4.58 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 210 - Forks: 20

microsoft/MLOS

MLOS is a project to enable autotuning for systems.

Language: Python - Size: 324 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 153 - Forks: 71

ederwander/PyAutoTune

Autotune Module for Python "PyAutoTune"

Language: C - Size: 2.85 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 122 - Forks: 18

ChrisCummins/paper-end2end-dl

📝 "End-to-end Deep Learning of Optimization Heuristics" (🥇 PACT'17 Best Paper)

Language: TeX - Size: 177 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 73 - Forks: 21

ctuning/ck-autotuning

CK automation actions to let users implement portable, customizable and reusable program workflows for reproducible, collaborative and multi-objective benchmarking, optimization and SW/HW co-design:

Language: Python - Size: 22.4 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 57 - Forks: 21

HiPerCoRe/KTT

Kernel Tuning Toolkit

Language: C++ - Size: 220 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 52 - Forks: 10

lac-dcc/jotai-benchmarks

Collection of executable benchmarks

Language: C - Size: 55 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 5

ytopt-team/ytopt

ytopt: machine-learning-based search methods for autotuning

Language: C - Size: 22.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 18

phrb/NODAL.jl

NODAL is an Open Distributed Autotuning Library in Julia

Language: Julia - Size: 2 MB - Last synced at: 1 day ago - Pushed at: over 6 years ago - Stars: 38 - Forks: 5

PAA-NCIC/GSWITCH

A pattern-based algorithmic autotuner for graph processing on GPUs.

Language: Cuda - Size: 1.22 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 9

halo-project/halo

😇 Wholly Adaptive LLVM Optimizer

Language: C - Size: 41.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 0

ChrisCummins/paper-synthesizing-benchmarks

📝 "Synthesizing Benchmarks for Predictive Modeling" (🥇 CGO'17 Best Paper)

Language: Jupyter Notebook - Size: 602 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 22 - Forks: 6

NTNU-HPC-Lab/BAT

A GPU benchmark suite for autotuners

Language: Cuda - Size: 74.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 3

Nanosim-LIG/boast

BOAST aims at providing a framework to metaprogram, benchmark and validate computing kernels

Language: Ruby - Size: 1.28 MB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 5

CharlieCurry/tvm-learning

TVM learning and research

Language: Python - Size: 203 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 6

phrb/gpu-autotuning

Autotuning NVCC Compiler Parameters, published @ CCPE Journal

Language: C - Size: 471 MB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 2

phrb/legup-tuner

Autotuning High-Level Synthesis for FPGAs, published @ ReConFig '17

Language: PostScript - Size: 27.4 MB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 1

j-r-jones/optsearch

OptSearch -- a portable tuning framework for HPC

Language: C - Size: 2.36 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ederwander/HackPitchTrack

Pitch Track Hack - Pythonic Implementation of the Pitch track used in autotune patent

Language: Python - Size: 813 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

phrb/nvidia-workshop-autotuning

Resources for autotuning CUDA compiler parameters

Language: Julia - Size: 1.31 MB - Last synced at: 27 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

mahdifani14/CHStone-Codes-Annotated-By-Orio

This repository contains CHStone benchmark codes which have been annotated and tuned by Orio tool to test the speed up of execution.

Language: Verilog - Size: 111 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

phrb/ccgrid19

Autotuning Source Transformation Tools with Design of Experiments, published @ CCGRID'19

Language: TeX - Size: 12.8 MB - Last synced at: 27 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

specs-feup/LAT-Lara-Autotuning-Tool

:minidisc: C/C++ autotuning tool that follows the concept of ISAT implemented in LARA (AOP + javascript)

Size: 1.05 MB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

ChrisCummins/paper-autotuning-opencl-wgsize

"Autotuning OpenCL Workgroup Size for Stencil Patterns" (ADAPT 2016)

Language: TeX - Size: 7.82 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

mahdifani14/AutoTuning

Auto-Tuning chain to optimize software execution and compilation time upon heterogeneous systems

Size: 550 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

p-anastas/PARALiA-GEMMex

Language: C++ - Size: 1.58 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ChrisCummins/paper-towards-collaborative-performance-tuning

"Towards Collaborative Performance Tuning of Algorithmic Skeletons" (HLPGPU 2016)

Language: TeX - Size: 2.03 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

bcosenza/patus-aa Fork of matthias-christen/patus

Patus-AA is an extension of the Patus compiler, which adds a new auto-tuning framework based on machine learning, and a new backend generating code for ARM processor with NEON support.

Language: Java - Size: 82.3 MB - Last synced at: 5 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

fvella/AIdaptive

Towards a new generation of adaptive run-time and heuristic selection framework

Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

phrb/autotuning-gce-docs

Autotuning with OpenTuner and Cloud Computing

Language: TeX - Size: 19.1 MB - Last synced at: 27 days ago - Pushed at: about 9 years ago - Stars: 1 - Forks: 0

3it-inpaqt/line-classification-slope

Classification task from experimental charge stability diagrams to recognize angles of lines.

Language: Python - Size: 629 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ghbrown/mg_tune

Automatic tuning of multigrid parameters using black box optimization

Language: Python - Size: 95.7 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

phrb/cacti-tuner

Autotuning the HPE CACTI memory modelling tool

Language: C++ - Size: 1.04 MB - Last synced at: 27 days ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0