Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: bfloat16

starkat99/half-rs

Half-precision floating point types f16 and bf16 for Rust.

Language: Rust - Size: 560 KB - Last synced: 20 days ago - Pushed: about 2 months ago - Stars: 217 - Forks: 45

libxsmm/libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

Language: C - Size: 297 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 795 - Forks: 181

oneapi-src/oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language: C++ - Size: 163 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3,442 - Forks: 949

sigurd4/custom_float

Customizable floating point types, with all standard floating point operations implemented from scratch.

Language: Rust - Size: 15.1 MB - Last synced: 24 days ago - Pushed: 3 months ago - Stars: 2 - Forks: 0

aahouzi/llama2-chatbot-cpu

A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.

Language: Python - Size: 30.3 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 6 - Forks: 0

higham/chop

Round matrix elements to lower precision in MATLAB

Language: MATLAB - Size: 52.7 KB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 30 - Forks: 11

DW0RKiN/Floating-point-Library-for-Z80

Floating-Point Arithmetic Library for Z80

Language: Assembly - Size: 8.32 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 19 - Forks: 3

d4l3k/go-bfloat16

Bfloat16 conversion utilities for Go/Golang

Language: Go - Size: 2.93 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 3 - Forks: 1

imciner2/ChopBLAS

Basic linear algebra routines implemented using the chop rounding function

Language: MATLAB - Size: 1.7 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1

puzzlef/pagerank-datatype

Comparison of PageRank algorithm using various datatypes.

Language: C++ - Size: 134 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

nestordemeure/jochastic

A JAX implementation of stochastic addition.

Language: Python - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 10 - Forks: 0

puzzlef/vector-sum

Comparison of vector element sum using various data types.

Language: C++ - Size: 13.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

afterdusk/flop

IEEE 754-style floating-point converter

Language: TypeScript - Size: 1.31 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 0

nestordemeure/stochastorch

A Pytorch implementation of stochastic addition.

Language: Python - Size: 38.1 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 0