Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: bfloat16
starkat99/half-rs
Half-precision floating point types f16 and bf16 for Rust.
Language: Rust - Size: 560 KB - Last synced: 20 days ago - Pushed: about 2 months ago - Stars: 217 - Forks: 45
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Language: C - Size: 297 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 795 - Forks: 181
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language: C++ - Size: 163 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3,442 - Forks: 949
sigurd4/custom_float
Customizable floating point types, with all standard floating point operations implemented from scratch.
Language: Rust - Size: 15.1 MB - Last synced: 24 days ago - Pushed: 3 months ago - Stars: 2 - Forks: 0
aahouzi/llama2-chatbot-cpu
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
Language: Python - Size: 30.3 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 6 - Forks: 0
higham/chop
Round matrix elements to lower precision in MATLAB
Language: MATLAB - Size: 52.7 KB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 30 - Forks: 11
DW0RKiN/Floating-point-Library-for-Z80
Floating-Point Arithmetic Library for Z80
Language: Assembly - Size: 8.32 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 19 - Forks: 3
d4l3k/go-bfloat16
Bfloat16 conversion utilities for Go/Golang
Language: Go - Size: 2.93 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 3 - Forks: 1
imciner2/ChopBLAS
Basic linear algebra routines implemented using the chop rounding function
Language: MATLAB - Size: 1.7 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1
puzzlef/pagerank-datatype
Comparison of PageRank algorithm using various datatypes.
Language: C++ - Size: 134 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
nestordemeure/jochastic
A JAX implementation of stochastic addition.
Language: Python - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 10 - Forks: 0
puzzlef/vector-sum
Comparison of vector element sum using various data types.
Language: C++ - Size: 13.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0
afterdusk/flop
IEEE 754-style floating-point converter
Language: TypeScript - Size: 1.31 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 0
nestordemeure/stochastorch
A Pytorch implementation of stochastic addition.
Language: Python - Size: 38.1 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 0