An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: floating-point

death-rayz/constants-float64-max-nth-factorial

Maximum nth factorial when stored in double-precision floating-point format.

Language: JavaScript - Size: 43.9 KB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

Rag322/constants-float32-catalan

Catalan's constant.

Language: JavaScript - Size: 47.9 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

mpmath/mpmath

Python library for arbitrary-precision floating-point arithmetic

Language: Python - Size: 17.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,022 - Forks: 190

DeshmukH9921/constants-float32-eulergamma

The Euler-Mascheroni constant.

Language: JavaScript - Size: 47.9 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

giocip/LINUX_num7

num7 ISO C++14 Standard 64-BIT LIBRARY, ARBITRARY-PRECISION GENERAL PURPOSE ARITHMETIC-LOGIC DECIMAL CLASS FOR AMD64 ARCHITECTURE

Language: C++ - Size: 2.84 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

gangmavaddsaw/constants-float32-fourth-root-eps

Fourth root of single-precision floating-point epsilon.

Language: JavaScript - Size: 43.9 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

chrissimpkins/vectora

A Rust library for n-dimensional vector computation with real and complex scalar data

Language: Rust - Size: 299 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9 - Forks: 0

VoidStarKat/half-rs

Half-precision floating point types f16 and bf16 for Rust.

Language: Rust - Size: 622 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 250 - Forks: 61

opencompl/fp.lean

Floating Point Semantics Mechanization for Lean

Language: Lean - Size: 3.11 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

isaaqq98/constants-float32-max-safe-nth-double-factorial

Maximum safe nth double factorial when stored in single-precision floating-point format.

Language: JavaScript - Size: 43 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

stdlib-js/math-base-special-copysignf

Return a single-precision floating-point number with the magnitude of x and the sign of y.

Language: Python - Size: 758 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

fmtlib/fmt

A modern formatting library

Language: C++ - Size: 16 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 21,737 - Forks: 2,626

herbie-fp/herbie

Optimize floating-point expressions for accuracy

Language: HTML - Size: 81.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 798 - Forks: 38

x448/float16

float16 provides IEEE 754 half-precision format (binary16) with correct conversions to/from float32

Language: Go - Size: 181 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 75 - Forks: 8

stdlib-js/number-float32-base

Base utilities for single-precision floating-point numbers.

Language: JavaScript - Size: 1.53 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-sqrt-half

Square root of 1/2 as a single-precision floating-point number.

Language: JavaScript - Size: 140 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-exponent-bias

The bias of a single-precision floating-point number's exponent.

Language: JavaScript - Size: 331 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

sandialabs/elaenia

Automated Error Analysis of Numerical Software for High-Consequence Systems

Language: OCaml - Size: 472 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 6 - Forks: 1

giocip/ANDROID_num7

C++ ARBITRARY PRECISION ARITHMETIC-LOGIC DECIMAL LIBRARY FOR ANDROID 7.0 up

Language: C++ - Size: 971 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

feiyingjun/constants-float64-max-nth-double-factorial

Maximum nth double factorial when stored in double-precision floating-point format.

Language: JavaScript - Size: 43.9 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

szcompressor/cuSZp

Fast GPU error-bounded lossy compressor for floating-point data.

Language: Cuda - Size: 97.7 KB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 36 - Forks: 13

herbie-fp/odyssey

A platform for exploring floating-point expressions :boat:

Language: TypeScript - Size: 40.2 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 22 - Forks: 1

apytypes/apytypes

APyTypes - Algorithmic data types for Python

Language: C++ - Size: 14.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 22 - Forks: 2

abdk-consulting/abdk-libraries-solidity

Open-Source Libraries for Solidity by ABDK Consulting

Language: Solidity - Size: 21.5 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 410 - Forks: 114

io7m-com/ieee754b16

Functions for converting to/from IEEE754 binary16 values

Language: Java - Size: 1.63 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 1

invision-trading/num

A Java library that abstracts the mathematical operations on real decimal numbers represented in computer memory as floating-point binary numbers or arbitrary-precision decimal numbers.

Language: Java - Size: 269 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

JeffreySarnoff/ArbNumerics.jl

extended precision math, accurate and performant

Language: Julia - Size: 16 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 85 - Forks: 17

edf-hpc/verrou

floating-point errors checker

Language: C - Size: 8.97 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 56 - Forks: 14

28921ijkfd/constants-float32-max-nth-factorial

Maximum nth factorial when stored in single-precision floating-point format.

Language: JavaScript - Size: 43.9 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

keshavsaharia/numbers

Number Representations & States

Language: MDX - Size: 24.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

stdlib-js/number-float32-base-exponent

Return an integer corresponding to the unbiased exponent of a single-precision floating-point number.

Language: Python - Size: 400 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

JuliaMath/ChangePrecision.jl

macro to change the default floating-point precision in Julia code

Language: Julia - Size: 44.9 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 39 - Forks: 7

LLNL/FPChecker

A dynamic analysis tool to detect floating-point errors in HPC applications.

Language: Python - Size: 8.36 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 34 - Forks: 4

stdlib-js/number-float32-base-significand

Return an integer corresponding to the significand of a single-precision floating-point number.

Language: Python - Size: 557 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

Alexhuszagh/rust-lexical

Fast numeric to- and from-string conversion routines.

Language: Rust - Size: 124 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 326 - Forks: 40

giocip/WINDOWS_num7

num7 ISO C++14 Standard 64-BIT LIBRARY, ARBITRARY-PRECISION GENERAL PURPOSE ARITHMETIC-LOGIC DECIMAL CLASS FOR WINDOWS 10/11

Language: C++ - Size: 1.42 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

devlinzhou/deterministic_float

fast soft float-point for deterministic computing,高性能、一致性计算的软件浮点数

Language: C++ - Size: 513 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 45 - Forks: 10

Jonny-exe/binary-fractions

A Python package for floating-point binary fractions. Do math in base 2!

Language: Python - Size: 375 KB - Last synced at: about 20 hours ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 4

user1095108/dpp

decimal floating-point number library

Language: C++ - Size: 13.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 1

thoughtworks/hardposit-chisel3

Chisel library for Unum Type-III Posit Arithmetic

Language: C++ - Size: 4.31 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 38 - Forks: 10

apache/commons-numbers

Apache Commons Numbers

Language: Java - Size: 29.7 MB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 74 - Forks: 60

joeycumines/floater

Package floater is not the shit in the toilet. Utils for math/big.

Language: Go - Size: 189 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

afterdusk/flop

IEEE 754-style floating-point converter

Language: TypeScript - Size: 1.31 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 1

powturbo/TurboPFor-Integer-Compression

Fastest Integer Compression

Language: C - Size: 5.9 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 799 - Forks: 113

artecs-group/PERCIVAL

Open-Source Posit RISC-V Core with Quire Capability

Language: C++ - Size: 33.5 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 56 - Forks: 11

2268977258/32-bit-Floating-Point-Adder

32位单精度浮点数加法器是一种专门用于执行符合IEEE 754标准的32位单精度浮点数加法运算的数字电路。这种加法器在现代计算机系统中扮演着重要角色,特别是在处理需要高精度计算的任务时,如科学计算、图形处理、机器学习等应用领域。这个小项目实现了一个符合IEEE 754 单精度浮点数标准(32 位)的浮点数加法器的完整设计。该设计的目标是通过Verilog实现一个能够处理两输入浮点数的加法运算模块。

Language: Verilog - Size: 257 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 4 - Forks: 0

stdlib-js/constants-float32-max-safe-nth-double-factorial

Maximum safe nth double factorial when stored in single-precision floating-point format.

Language: JavaScript - Size: 198 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

lifthrasiir/hexf

Hexadecimal float support for Rust

Language: Rust - Size: 43 KB - Last synced at: 3 days ago - Pushed at: 18 days ago - Stars: 38 - Forks: 12

nberlette/math

Standalone zero-dependency implementation of the entire `Math` namespace, compatible with any JS runtime.

Language: TypeScript - Size: 135 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-max

Maximum single-precision floating-point number.

Language: JavaScript - Size: 359 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 3 - Forks: 1

stdlib-js/constants-float32-max-base2-exponent

The maximum biased base 2 exponent for a single-precision floating-point number.

Language: JavaScript - Size: 209 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-log2-e

Base 2 logarithm of Euler's number.

Language: JavaScript - Size: 186 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-log10-e

Base 10 logarithm of Euler's number.

Language: JavaScript - Size: 186 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-ln-two

Natural logarithm of 2 as a single-precision floating-point number.

Language: JavaScript - Size: 203 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-ln-two-pi

Natural logarithm of 2π.

Language: JavaScript - Size: 180 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-ln-pi

Natural logarithm of π as a single-precision floating-point number

Language: JavaScript - Size: 205 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-half-ln-two

One half times the natural logarithm of 2 as a single-precision floating-point number.

Language: JavaScript - Size: 208 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-gamma-lanczos-g

Arbitrary constant `g` to be used in Lanczos approximation functions.

Language: JavaScript - Size: 186 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-fourth-root-eps

Fourth root of single-precision floating-point epsilon.

Language: JavaScript - Size: 170 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-sqrt-eps

Square root of half-precision floating-point epsilon.

Language: JavaScript - Size: 325 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-num-bytes

Size (in bytes) of a single-precision floating-point number.

Language: JavaScript - Size: 311 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float16-smallest-subnormal

Smallest positive half-precision floating-point subnormal number.

Language: JavaScript - Size: 355 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-ninf

Single-precision floating-point negative infinity.

Language: JavaScript - Size: 510 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-eulergamma

The Euler-Mascheroni constant.

Language: JavaScript - Size: 189 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-smallest-normal

Smallest positive normalized half-precision floating-point number.

Language: JavaScript - Size: 350 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-min-safe-integer

Minimum safe single-precision floating-point integer.

Language: JavaScript - Size: 315 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-eps

Difference between one and the smallest value greater than one that can be represented as a single-precision floating-point number.

Language: JavaScript - Size: 529 KB - Last synced at: 6 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-min-base2-exponent

The minimum biased base 2 exponent for a normal single-precision floating-point number.

Language: JavaScript - Size: 208 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-e

Euler's number.

Language: JavaScript - Size: 205 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-pinf

Half-precision floating-point positive infinity.

Language: JavaScript - Size: 340 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-cbrt-eps

Cube root of single-precision floating-point epsilon.

Language: JavaScript - Size: 544 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-min-base2-exponent-subnormal

The minimum biased base 2 exponent for a subnormal single-precision floating-point number.

Language: JavaScript - Size: 234 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-num-bytes

Size (in bytes) of a half-precision floating-point number.

Language: JavaScript - Size: 313 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-min-base10-exponent

The minimum base 10 exponent for a normal single-precision floating-point number.

Language: JavaScript - Size: 226 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-ninf

Half-precision floating-point negative infinity.

Language: JavaScript - Size: 409 KB - Last synced at: about 5 hours ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-catalan

Catalan's constant.

Language: JavaScript - Size: 183 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-min-safe-integer

Minimum safe half-precision floating-point integer.

Language: JavaScript - Size: 354 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-min-base10-exponent-subnormal

The minimum base 10 exponent for a subnormal single-precision floating-point number.

Language: JavaScript - Size: 212 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-max-safe-integer

Maximum safe single-precision floating-point integer.

Language: JavaScript - Size: 301 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float16-max

Maximum half-precision floating-point number.

Language: JavaScript - Size: 330 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-apery

Apéry's constant.

Language: JavaScript - Size: 184 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-max-safe-integer

Maximum safe half-precision floating-point integer.

Language: JavaScript - Size: 332 KB - Last synced at: 10 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-max-base2-exponent-subnormal

The maximum biased base 2 exponent for a subnormal single-precision floating-point number.

Language: JavaScript - Size: 121 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-exponent-bias

The bias of a half-precision floating-point number's exponent.

Language: JavaScript - Size: 319 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-max-base10-exponent

The maximum base 10 exponent for a single-precision floating-point number.

Language: JavaScript - Size: 204 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-eps

Difference between one and the smallest value greater than one that can be represented as a half-precision floating-point number.

Language: JavaScript - Size: 340 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float32-max-base10-exponent-subnormal

The maximum base 10 exponent for a subnormal single-precision floating-point number.

Language: JavaScript - Size: 225 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float16-cbrt-eps

Cube root of half-precision floating-point epsilon.

Language: JavaScript - Size: 319 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float32-max-safe-nth-factorial

Maximum safe nth factorial when stored in single-precision floating-point format.

Language: JavaScript - Size: 207 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float32-max-nth-factorial

Maximum nth factorial when stored in single-precision floating-point format.

Language: JavaScript - Size: 0 Bytes - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

brendanzab/approx

Approximate floating point equality comparisons and assertions

Language: Rust - Size: 119 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 164 - Forks: 36

stdlib-js/constants-float64-max-safe-nth-double-factorial

Maximum safe nth double factorial when stored in double-precision floating-point format.

Language: JavaScript - Size: 293 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

HPCguy/Squint

Squint: A peephole optimizer for stack VM compilers

Language: C - Size: 628 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 25 - Forks: 1

stdlib-js/constants-float64-max-nth-factorial

Maximum nth factorial when stored in double-precision floating-point format.

Language: JavaScript - Size: 176 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

Luis-Varona/uniquetol-rs

A Rust toolbox for isolating unique values in n-dimensional arrays of imprecise floating-point data within a given tolerance.

Language: Rust - Size: 34.2 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

stdlib-js/constants-float64-sqrt-two

Square root of 2.

Language: JavaScript - Size: 319 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float64-sqrt-two-pi

Square root of 2π.

Language: JavaScript - Size: 340 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float64-sqrt-three

Square root of 3.

Language: JavaScript - Size: 347 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

stdlib-js/constants-float64-sqrt-pi

Square root of π.

Language: JavaScript - Size: 320 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

stdlib-js/constants-float64-sqrt-half

Square root of 1/2.

Language: JavaScript - Size: 354 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0