Floating Point in C --- CS:APP

時間 2019-11-13

原文原文鏈接

All versions of C provide two different floating-point data types: float and double. On machines that support IEEE floating point, these data types correspond to single-and double-precision floating point. In addition, the machines use the round-to-even rounding mode. Unfortunately, since the C standards do not require the machine to use IEEE floating point, there are no standard methods to change the rounding mode or to get special values such as -0, +00,-00, or NaN. express

Most systems provide a combination of include ('.h') files and procedure libraries to provide access to these features, but the details vary from one system to another. For example, the GNU compiler GCC defines program constants INFINITY and NAN when the following sequence occurs in the program file: app

#define _GNU_SOURCE 1 ide

#define <math.h> ui

More recent versions of C, including ISO C99, include a third floating-point data type, long double. For many machines and compilers, this data type is equivalent to the double data type. For Intel-compatible machines, however, GCC implements this data type using an 80-bit "extended precision" format, providing a much larger range and precision than does the standard 64-bit format. this

When casting values between int, float, and double formats, the program changes the numeric values and the bit representations as follows (assuming a 32-bit int): spa

From int to float, the number cannot overflow, but it may be rounded.
From int or float to double, the exact numeric value can be preserved because double has both greater range(i.e., the range of representable values), as well as greater precision(i.e., the number of significant bits).
From double to float, the value can overflow to +00 or -00, since the range is smaller. Otherwise, it may be rounded, because the precision is smaller.
From float or double to int the value will be rounded toward zero. For examople, 1.999 will be converted to 1, while -1.999 will be converted to -1. Furthermore, the value may overflow. The C standards do not specify a fixed result for this case. Intel-compatible microprocessors designate thebit pattern[10...00] as an integer indefinite value. Any conversion from floating point to integer that cannot assign a reasonable integer approximation yields this value. Thus, the expression (int) +1e10 yields -21483648, generating a negative value from a positive one.

1. C# floating Point types
2. Floating Point Math
3. Floating-Point Unit Registers.
4. UVa11809 - Floating-Point Numbers
5. phpredis Floating point exception
6. 【UVA】 11809 --- Floating-Point Numbers
7. Xilinx Floating-Point Operator IP
8. Floating-Point Hazard【求導公式】
9. 【微學習】 floating-point | 200406
10. Point-In-Polygon Algorithm
更多相關文章...
• SQL IN 操作符 - SQL 教程
• Swift for-in 循環 - Swift 教程
• C# 中 foreach 遍歷的用法
• Kotlin學習（一）基本語法

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。