Floats and Ints — CMPT 166 Fall 2016 1 documentation (2024)

Manipulating numbers is one of the most important and useful things a computercan do. We’ll look at two number types in this course: int for integers(i.e. whole numbers), and float for floating point numbers, i.e. numbers adecimal point in them.

Integers

In Processing, an int is a whole number (i.e. an integer), such as:

4-610183993

These are called int literals to distinguish them from variables oftype int. For example:

int age = 18;

This statement does two things:

  • It declares age to be a variable of type int.
  • It assigns the value 18 to age. Since this is the first valueassigned to age, we say it initializes age.

Keep in mind that age is a variable of type int, while 18 is anint literal. We usually refer to both as ints, but sometimes thedistinction matters.

Integer Arithmetic

You can perform arithmetic on ints using the basic arithmetic operators.For example:

int averageLifespan = 82; // for Canadiansprintln(2014 + averageLifespan - 18); // 2078

Or:

int minsInOneHour = 60;int hoursInOneDay = 24;int daysInOneYear = 365;int minsInOneYear = minsInOneHour * hoursInOneDay * daysInOneYear;println(minsInOneYear); // 525600 --- about half a million minutes in a year

Integer Division

Dividing two ints always returns an int in Processing. Forexample:

println(1 / 2); // 0println(3 / 2); // 1println(4 / 2); // 2

It might come as a surprise that 1 / 2 evaluates to 0. The “correct”answer is 0.5, but the problem is that 0.5 is not an int (it’s afloat). So, after Processing divides two ints, if the result hasany non-zero digits to the right of the decimal point they get truncated (i.e.chopped off). This is called integer division: dividing two intsalways returns an int.

There is one exception, though. Dividing by 0 is an undefined operation inmathematics, and in Processing it causes an error, e.g.:

println(5 / 0); // ArithmeticException: / by zero

This statement crashes the program and issues the error message“ArithmeticException: / by zero”.

Min and Max Integers

It is important to know that there is a min int and a max int:

println(Integer.MIN_VALUE); // -2147483648println(Integer.MAX_VALUE); // 2147483647

As this shows, the maximum value for an int in Processing is, exactly,the number 2147483647, which is a little over 2.1 billion. Similarly, thesmallest int is a little less than -2.1 billion.

This means is that you cannot represent, say, the number 2.5 billion using anint. It can also result in some weird calculations, e.g. here’s whathappens if you add 1 to the maximum int:

println(2147483647 + 1); // -2147483648

This is pretty disturbing when you think about it: by adding two positivenumbers together we got a negative number!

Here’s another strange behaviour:

println(2147483648); // literal out of range

2147483648 is one more than the max int, so it is not an int. Thisprogram doesn’t even run: You get a “literal out of range error” when youcompile this statement.

Another fact about ints is that there is one more negative int thanpositive int. The smallest int is \(-2^{31} = -2147483648\), butthe biggest int is only \(2^{32}-1\). One place were this might be asmall problem is the abs(x) function, that should return x if x >=0, and -x if x < 0. But the problem is what is abs(-2147483648)?The correct answer is 2147483648, but 2147483648 is not an int: it’stoo big! In this one case, the abs function cannot possible return theright answer, and so it does this:

println(abs(-2147483648));// -2147483648

Note

Processing has a bigger integer type called long, which canrepresent any integer in the range -9,223,372,036,854,775,808 to9,223,372,036,854,775,807. That’s about -9.2 quintillion to 9.2quintillion.

You may be interested to know that 9223372036854775807 has its ownWikipedia page, but-9,223,372,036,854,775,808 doesn’t.

If you need integers bigger than long, then Java (on which Processingis based) has a special type called BigInteger thatcan represent arbitrarily long integers that are limited only by the amountof memory in your computer.

The Mod Operator

There’s one other useful int operator you should know about: the %operator, which is called the mod operator, or the remainder operator.For instance:

println(5 % 2); // 1println(6 % 2); // 0println(14 % 8); // 6

The expression 5 % 2 is read “5 mod 2”, and it calculates the remainderwhen 5 is divided 2: since 2 goes into 5 two times with 1 left over, 5 % 2equals 1.

One application of % is to test if a number is even or odd. For example,178 % 2 is 0, which means 178 is even (because 2 goes into 178 exactly89 times with 0 left over). In general, if n is a positive int and n% 2 is 0, then n must be even. The only other possibility is that n %2 is 1, which means n is odd.

Here’s an example of you might use % in animation. This program makes aball wrap-around the screen without using an if-statement:

float x;void setup() { size(500, 500);}void draw() { background(255); ellipse(x, 250, 100, 100); x += 2; x = x % 500;}

Floating Point Numbers

In Processing, a float is a number with a decimal point in it, such as:

4.5-61.20.0-3.0183.993

These are examples of float literals, which are so-named todistinguish them from variables of type float. For example:

float speed = 1.8;

Here, speed is a variable of type float, while 1.8 is a floatliteral. Just as for ints, this statement does two things:

  • It declares speed to be a variable of type float.
  • It assigns speed the initial value of 1.8.

Floating Point Arithmetic

In most cases, floating point arithmetic works like regular arithmetic. Forexample:

println(1.2 + 3.2); // 4.4println(6.0 - 3.344); // 2.7println(2.1 * 3.14); // 6.594println(-18.6 / 29.1); // -0.63917524println(10.0 / 2.0); // 5.0

Notice that 10.0 / 2.0 evaluates to 5.0, which is a float.Whenever you divide two floats, the result is always a float.

However, there are a few important details you need to be aware of.

Division by 0.0

Dividing a number by 0.0 is undefined mathematically, but for a float weget a surprising result:

println(5.6 / 0.0); // Infinity

There is no error message here — the actual result is a special floatvalue called infinity. Another special float value occurs in this case:

println(5.6 / 0.0 - 5.6 / 0.0); // NaN

Again, this does not cause an error, but instead prints the special floatvalue NaN, which means “not a number”.

If you think about this, it leads to a strange conclusion: if x is afloat, then it is possible that x - x is not equal to 0.0!

The reasons when and how floats use these special values are quitetechnical and beyond the scope of the course. The important thing for us is toknow that these values exist and can occur in ordinary calculations.

Min and Max floats

The smallest and largest float values are as follows:

println(Float.MIN_VALUE); // 1.4E-45println(Float.MAX_VALUE); // 3.4028235E38

Notice a few things here:

  • The numbers 1.4E-45 and 3.4028235E38 are written in exponentialnotation, and are equivalent to \(1.4 \times 10^{-45}\) and\(3.4028235 \times 10^{38}\).
  • The min float is 1.4E-45, and it is traditionally called machineepsilon. It is,approximately, the smallest possible number that we can represent as afloat. Any positive number less than 1.4E-45 is treated asequivalent to 0.0.
  • The max value, 3.4028235E38, is a huge number with 39 digits in it.However, only the first 8 or so digits are significant, i.e. after 8 digitsall the digits are 0.
  • The smallest float is -3.4028235E38, which is just the negation ofthe max float.

Rounding Errors

A major problem with floating point numbers is that they are often unavoidablyinaccurate. For example, in mathematics\(\frac{1}{3} = 0.3333 \dots\), where the \(\ldots\) means there arean infinite number of 3s after the decimal point. But a Processingfloat can’t have an infinite number of digits:

println(1.0 / 3.0); // 0.33333334

As you can see, there are a finite number digits, plus the final digit hasbeen rounded to 4. So it is not exactly equal to \(\frac{1}{3}\), but isinstead a little bit bigger.

For many programs, round-off errors don’t make any noticeable difference. Butsometimes they can be the source of serious bugs that are very hard to fix.There is an entire sub-field of computer science called numerical analysis that studies how to doaccurate and efficient floating point arithmetic on machines.

In this course, we will usually just ignore round-off errors and hope that ourfloating point calculations are accurate enough.

Mixing ints and floats

You can often use ints and floats together without a problem. Forexample:

println(4.0 + 5); // 9.0

In the expression 4.0 + 5, 4.0 is of type float, and 5 is of typeint. Processing doesn’t actually know how to add float and int,so it automatically converts 5 into the float 5.0. This makes theexpression equivalent to 4.0 + 5.0, which is 9.0.

You can also assign an int to a float without error, e.g.:

float temperature = 21; // 21 is an int, but temperature is a float

This works because Processing automatically converts 21 to 21.0.

By default, you cannot assign a float to an int:

int age = 5.5; // compiler error

This statement fails to compile because 5.5 is of type float, and youare not allowed to store a float in an int variable. However, you canexplicitly convert 5.5 to a float like this:

int age = int(5.5);println(age); // 5

Summary Table

intfloat
Sample literals4, -5, 0-4.0, 3.14, 0,0
Min-2147483648-3.4028235E38
Max21474836473.4028235E38
Smallest positive11.4E-45
When dividing by 0run-time errorinfinity or NaN
Special valuesnoneinfinity, NaN

Questions

  1. Give an example of:

    • a positive int literal
    • a negative int literal
    • an int literal that is neither positive nor negative
    • statement that declares a new int variable and initializes it to 15
  2. What does this print?

    println(5 * (1/2 + 1/3 + 1/4 + 1/5));
  3. In regular arithmetic,\(\frac{5}{\frac{1}{2}} = 5 \cdot \frac{2}{1} = 10\). What does theequivalent expression, 5 / (1 / 2), evaluate to in Processing?

  4. What is the biggest possible int? You answer should be accurate towithin about 2 million.

  5. Suppose n is a positive int. Is the expression n + 1 alwaysgreater than 0? Why, or why not?

  6. What is the name of the % operator?

  7. What are the values of the following expressions?

    • 8 % 2
    • 8 % 3
    • 17 % 4
    • (100 % 2) + (100 % 3) + (100 % 4)
  8. Give an example of:

    • a positive float literal greater than 100
    • a positive float literal between 0 and 1
    • statement that declares a new float variable and initializes it to3.14
  9. How many digits are there in the maximum possible float value? Youranswer should be correct within 1 digit.

  10. What does NaN stand for?

  11. What does this print?

    println(91.22 / 0.0);
  12. Give a simple example of an expression involving floats that suffersfrom a round-off error.

  13. What does this print?

    println(91.22 / 0);
  14. What does this print?

    println(int(6.9) + 3);
  15. Answer “true” or “false” for each of the following questions:

    1. If n is a positive int, then n + 1 is also a positiveint.
    2. If a and b are both of type int, and a > b is true, thena + 1 > b is also true.
    3. If n is a positive int, then n - n is 0.
    4. If x is a float, then x - x is 0.0.
Floats and Ints — CMPT 166 Fall 2016 1 documentation (2024)

FAQs

How do you check for floating-point errors? ›

Another way to measure the difference between a floating-point number and the real number it is approximating is relative error, which is simply the difference between the two numbers divided by the real number. For example the relative error committed when approximating 3.14159 by 3.14 × 100 is . 00159/3.14159 . 0005.

What is the format of float memory? ›

Floating-point numbers use the IEEE (Institute of Electrical and Electronics Engineers) format. Single-precision values with float type have 4 bytes, consisting of a sign bit, an 8-bit excess-127 binary exponent, and a 23-bit mantissa. The mantissa represents a number between 1.0 and 2.0.

How do I get rid of floating point error? ›

Changing the radix, in particular from binary to decimal, can help to reduce the error and better control the rounding in some applications, such as financial applications.

How do you resolve a floating point exception? ›

Floating-point exception subroutines
  1. Change the execution state of the process.
  2. Enable the signaling of exceptions.
  3. Disable exceptions or clear flags.
  4. Determine which exceptions caused the signal.
  5. Test the exception sticky flags.

How do you find the floating point bias? ›

For single-precision floating-point, the bias=127. For double-precision, the bias=1023. The sum of the bias and the power of 2 is the exponent that actually goes into the IEEE 754 string. Remember, the exponent = power + bias.

What is floating point test? ›

The fputest checks the functionality of the floating point unit in CPUs. The test verifies the functionality by various arithmetic operations. In addition, the fputest stresses the CPU with the use of benchmarks.

How do you check spot error? ›

Let's understand the different error spotting rules.
  1. Rule 1: Check and identify the part of the speech.
  2. Rule 2: Identify the type of sentence.
  3. Rule 3: Check the punctuation.
  4. Rule 4: Identify the pronouns.
  5. Rule 5: Identify the singularity and plurality.

How do you make a floating point more accurate? ›

To make floating point operations more accurate, you need more bits in Floating point unit in CPU an for storing the result. There are 32 and 64 bit floating point format, but since the cooprocessor 287 there is also 80 bit format giving better accuracy. I've just googled, that 128 bit standard exists: IEEE 754.

Top Articles
Latest Posts
Article information

Author: Chrissy Homenick

Last Updated:

Views: 6050

Rating: 4.3 / 5 (74 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Chrissy Homenick

Birthday: 2001-10-22

Address: 611 Kuhn Oval, Feltonbury, NY 02783-3818

Phone: +96619177651654

Job: Mining Representative

Hobby: amateur radio, Sculling, Knife making, Gardening, Watching movies, Gunsmithing, Video gaming

Introduction: My name is Chrissy Homenick, I am a tender, funny, determined, tender, glorious, fancy, enthusiastic person who loves writing and wants to share my knowledge and understanding with you.