Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: half-precision
stdlib-js/constants-float16
Half-precision floating-point mathematical constants.
Language: JavaScript - Size: 533 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 2 - Forks: 0
stdlib-js/constants-float16-cbrt-eps
Cube root of half-precision floating-point epsilon.
Language: JavaScript - Size: 272 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
stdlib-js/constants-float16-sqrt-eps
Square root of half-precision floating-point epsilon.
Language: JavaScript - Size: 272 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 2 - Forks: 0
stdlib-js/constants-float16-num-bytes
Size (in bytes) of a half-precision floating-point number.
Language: JavaScript - Size: 278 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 2 - Forks: 0
petamoriken/float16
Stage 3 IEEE 754 half-precision floating-point ponyfill
Language: JavaScript - Size: 8.66 MB - Last synced: 20 days ago - Pushed: 21 days ago - Stars: 95 - Forks: 7
Maratyszcza/FP16
Conversion to/from half-precision floating point formats
Language: C++ - Size: 123 KB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 305 - Forks: 84
shibatch/tlfloat
Template library for floating point operations
Language: C++ - Size: 282 KB - Last synced: 21 days ago - Pushed: 21 days ago - Stars: 0 - Forks: 0
SomeRandomiOSDev/Half
Swift Half-Precision Floating Point
Language: Swift - Size: 209 KB - Last synced: 26 days ago - Pushed: 7 months ago - Stars: 12 - Forks: 2
stillwater-sc/universal
Large collection of number systems providing custom arithmetic and mixed-precision algorithms for AI, Machine Learning, Computer Vision, Signal Processing, CAE, EDA, control, optimization, estimation, and approximation.
Language: C++ - Size: 116 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 382 - Forks: 59
x448/float16
float16 provides IEEE 754 half-precision format (binary16) with correct conversions to/from float32
Language: Go - Size: 189 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 61 - Forks: 7
canbula/ieee754
Python module which finds the IEEE-754 representation of a floating point number.
Language: Python - Size: 85.9 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 24 - Forks: 5
yowidin/fast-half-float
Fast Half precision Floating point operations for C++
Language: C++ - Size: 8.79 KB - Last synced: 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
bindog/pytorch-model-parallel
A memory balanced and communication efficient FullyConnected layer with CrossEntropyLoss model parallel implementation in PyTorch
Language: Python - Size: 85 KB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 74 - Forks: 20
nitronoid/floatingPoint
Language: C++ - Size: 29.3 KB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0
jamesalbert/halfprec
Half-precision assembly interface for C
Language: Assembly - Size: 9.77 KB - Last synced: about 2 months ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0
hma02/cublasHgemm-P100
Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm
Language: Cuda - Size: 18.6 KB - Last synced: 8 months ago - Pushed: almost 5 years ago - Stars: 35 - Forks: 11
steven-varga/h5cpp
C++17 templates between [stl::vector | armadillo | eigen3 | ublas | blitz++] and HDF5 datasets
Language: C++ - Size: 21.9 MB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 139 - Forks: 32
fengwang/float16_t Fork of acgessler/half_float
CPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.
Language: C++ - Size: 204 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 19 - Forks: 5
higham/chop
Round matrix elements to lower precision in MATLAB
Language: MATLAB - Size: 52.7 KB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 30 - Forks: 11
DW0RKiN/Floating-point-Library-for-Z80
Floating-Point Arithmetic Library for Z80
Language: Assembly - Size: 8.32 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 19 - Forks: 3
minhhn2910/cuda-half2
Convert CUDA programs from float data type to half or half2 with SIMDization
Language: C++ - Size: 144 MB - Last synced: 10 months ago - Pushed: about 5 years ago - Stars: 19 - Forks: 6
DivergentClouds/subleq-linear
An implimentation of the Subleq OISC using only linear operations on half-precision (16 bit) IEEE-754 floats (and a loop).
Language: Zig - Size: 12.7 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
imciner2/ChopBLAS
Basic linear algebra routines implemented using the chop rounding function
Language: MATLAB - Size: 1.7 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1
georgy7/toyfloat
A library that encodes 3 to 16 bits wide floating-point numbers.
Language: Go - Size: 1.52 MB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
enp1s0/cuMpSGEMM
Fast SGEMM emulation on Tensor Cores
Language: Cuda - Size: 396 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 3 - Forks: 0
artyom-beilis/float16
half float library for C and for z80
Language: C - Size: 22.5 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 24 - Forks: 6
neslib/Neslib.Half
Half-Precision Floating-Point for Delphi
Language: Pascal - Size: 65.4 KB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 9 - Forks: 3
enp1s0/curand_fp16
FP16 pseudo random number generator on GPU
Language: Cuda - Size: 30.3 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
joeltg/fp16
Half-precision 16-bit floating point numbers
Language: JavaScript - Size: 397 KB - Last synced: 9 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
LeDuy-Vu/CS-147-Project
Implement arithmetic operations to handle half-precision numbers in MIPS instructions.
Language: Assembly - Size: 23.4 KB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
dyeo/dym
The DYM Math Library for Graphics and Game Programming
Language: C++ - Size: 440 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 2
jizhuoran/caffe-android-opencl-fp16
Optimised Caffe with OpenCL supporting for less powerful devices such as mobile phones
Language: C++ - Size: 50.3 MB - Last synced: almost 1 year ago - Pushed: over 5 years ago - Stars: 17 - Forks: 3
oleks/binary16
Emulating binary, half-precision IEEE-754 (2008) floats
Language: C - Size: 29.3 KB - Last synced: over 1 year ago - Pushed: about 7 years ago - Stars: 1 - Forks: 0