Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: half-precision

stdlib-js/constants-float16

Half-precision floating-point mathematical constants.

Language: JavaScript - Size: 533 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float16-cbrt-eps

Cube root of half-precision floating-point epsilon.

Language: JavaScript - Size: 272 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float16-sqrt-eps

Square root of half-precision floating-point epsilon.

Language: JavaScript - Size: 272 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float16-num-bytes

Size (in bytes) of a half-precision floating-point number.

Language: JavaScript - Size: 278 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 2 - Forks: 0

petamoriken/float16

Stage 3 IEEE 754 half-precision floating-point ponyfill

Language: JavaScript - Size: 8.66 MB - Last synced: 20 days ago - Pushed: 21 days ago - Stars: 95 - Forks: 7

Maratyszcza/FP16

Conversion to/from half-precision floating point formats

Language: C++ - Size: 123 KB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 305 - Forks: 84

shibatch/tlfloat

Template library for floating point operations

Language: C++ - Size: 282 KB - Last synced: 21 days ago - Pushed: 21 days ago - Stars: 0 - Forks: 0

SomeRandomiOSDev/Half

Swift Half-Precision Floating Point

Language: Swift - Size: 209 KB - Last synced: 26 days ago - Pushed: 7 months ago - Stars: 12 - Forks: 2

stillwater-sc/universal

Large collection of number systems providing custom arithmetic and mixed-precision algorithms for AI, Machine Learning, Computer Vision, Signal Processing, CAE, EDA, control, optimization, estimation, and approximation.

Language: C++ - Size: 116 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 382 - Forks: 59

x448/float16

float16 provides IEEE 754 half-precision format (binary16) with correct conversions to/from float32

Language: Go - Size: 189 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 61 - Forks: 7

canbula/ieee754

Python module which finds the IEEE-754 representation of a floating point number.

Language: Python - Size: 85.9 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 24 - Forks: 5

yowidin/fast-half-float

Fast Half precision Floating point operations for C++

Language: C++ - Size: 8.79 KB - Last synced: 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

bindog/pytorch-model-parallel

A memory balanced and communication efficient FullyConnected layer with CrossEntropyLoss model parallel implementation in PyTorch

Language: Python - Size: 85 KB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 74 - Forks: 20

nitronoid/floatingPoint

Language: C++ - Size: 29.3 KB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0

jamesalbert/halfprec

Half-precision assembly interface for C

Language: Assembly - Size: 9.77 KB - Last synced: about 2 months ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0

hma02/cublasHgemm-P100

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm

Language: Cuda - Size: 18.6 KB - Last synced: 8 months ago - Pushed: almost 5 years ago - Stars: 35 - Forks: 11

steven-varga/h5cpp

C++17 templates between [stl::vector | armadillo | eigen3 | ublas | blitz++] and HDF5 datasets

Language: C++ - Size: 21.9 MB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 139 - Forks: 32

fengwang/float16_t Fork of acgessler/half_float

CPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.

Language: C++ - Size: 204 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 19 - Forks: 5

higham/chop

Round matrix elements to lower precision in MATLAB

Language: MATLAB - Size: 52.7 KB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 30 - Forks: 11

DW0RKiN/Floating-point-Library-for-Z80

Floating-Point Arithmetic Library for Z80

Language: Assembly - Size: 8.32 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 19 - Forks: 3

minhhn2910/cuda-half2

Convert CUDA programs from float data type to half or half2 with SIMDization

Language: C++ - Size: 144 MB - Last synced: 10 months ago - Pushed: about 5 years ago - Stars: 19 - Forks: 6

DivergentClouds/subleq-linear

An implimentation of the Subleq OISC using only linear operations on half-precision (16 bit) IEEE-754 floats (and a loop).

Language: Zig - Size: 12.7 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

imciner2/ChopBLAS

Basic linear algebra routines implemented using the chop rounding function

Language: MATLAB - Size: 1.7 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1

georgy7/toyfloat

A library that encodes 3 to 16 bits wide floating-point numbers.

Language: Go - Size: 1.52 MB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

enp1s0/cuMpSGEMM

Fast SGEMM emulation on Tensor Cores

Language: Cuda - Size: 396 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 3 - Forks: 0

artyom-beilis/float16

half float library for C and for z80

Language: C - Size: 22.5 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 24 - Forks: 6

neslib/Neslib.Half

Half-Precision Floating-Point for Delphi

Language: Pascal - Size: 65.4 KB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 9 - Forks: 3

enp1s0/curand_fp16

FP16 pseudo random number generator on GPU

Language: Cuda - Size: 30.3 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

joeltg/fp16

Half-precision 16-bit floating point numbers

Language: JavaScript - Size: 397 KB - Last synced: 9 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

LeDuy-Vu/CS-147-Project

Implement arithmetic operations to handle half-precision numbers in MIPS instructions.

Language: Assembly - Size: 23.4 KB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

dyeo/dym

The DYM Math Library for Graphics and Game Programming

Language: C++ - Size: 440 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 2

jizhuoran/caffe-android-opencl-fp16

Optimised Caffe with OpenCL supporting for less powerful devices such as mobile phones

Language: C++ - Size: 50.3 MB - Last synced: almost 1 year ago - Pushed: over 5 years ago - Stars: 17 - Forks: 3

oleks/binary16

Emulating binary, half-precision IEEE-754 (2008) floats

Language: C - Size: 29.3 KB - Last synced: over 1 year ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0