Study guides

☆☆

Q: Represent (178.1875)10 in single and double precision floating point format?

Write your answer...

Submit

Still have questions?

Related questions

A float is used to represent a single-precision floating point value. A double is used to represent a double-precision floating point value. Generally, a float is 32-bits in length while a double is 64-bits in length, however the actual lengths are implementation-defined.

In C++ we use the float, double and long double data types to represent real numbers with various degrees of precision. The type double provides at least as much precision as a float while a long double provides at least as much precision as a double. On most implementations, a double has twice the precision of a float, however the actual precision is implementation defined.

Depends on the format IEEE double precision floating point is 64 bits. But all sorts of other sizes have been used IBM 7094 double precision floating point was 72 bits CDC 6600 double precision floating point was 120 bits Sperry UNIVAC 1110 double precision floating point was 72 bits the DEC VAX had about half a dozen different floating point formats varying from 32 bits to 128 bits the IBM 1620 had floating point sizes from 4 decimal digits to 102 decimal digits (yes digits not bits).

Yes, double is a data type, it means double precision floating point number.

The double data type is a fundamental numeric data type that can represent floating point values. Depending on implementation, it is normally 8 bytes in size, with a precision around 18 decimal digits.

Floating point types are used to represent fractional numbers. In both C and Java the names for these types are float and double. double offers greater precision than float.

Basically you use a double-precision floating point number for the real part, a double-precision floating point number for the imaginary part, and write methods for any operation you want to include (such as addition, etc.; trigonometric functions, exponential function).

Character or small integerShort IntegerIntegerLong integerBooleanFloating point numbersDouble precision floating point numberLong double precision floating point numberWide characterTo get a better idea on C++ data types, see related links below.

Character or small integerShort IntegerIntegerLong integerBooleanFloating point numbersDouble precision floating point numberLong double precision floating point numberWide characterTo get a better idea on C++ data types, see related links below.

Floating point numbers are always stored according to the underlying architecture. The programming language is immaterial, it must use the same representations as the hardware itself, or at least provide an abstraction of it. C does not provide any abstractions for built-in data types. Most modern computers use the standard IEEE 754 representation, which caters for single-precision (equivalent to float in C), double-precision (double) and extended-precision (long double).

If you are referring to normalization of floating point numbers, it is to maintain the most precision of the number possible. Leading zeros in floating point representation is lost precision, thus normalization removes the leading zeros by shifting left and adjusting the exponent. If the calculation was done in a hidden extended precision register (like IEEE 80-bit format) extra precision bits may be shifted in to the LSBs before restoring the result to a standard single or double precision register, reducing loss of precision.

no you cannot increase the precision of double itself in C

People also asked