Variables, Strings, and Numbers#

In this section, you will learn to store information in variables. You will learn about three types of data: strings, which are lists of characters, integers, which are numbers like 2 and 3, and floats, which are numbers like 2.0 and 2.5.

Variables#

A variable holds a value, for example

message = "Hello world!"
print(message)
Hello world!

You can change the value of a variable at any point. When you do, the original value is overwritten.

message = "Hello world!"
print(message)

message = "I'm learning to program"
print(message)
Hello world!
I'm learning to program

Variable naming rules#

  • Variable names can only have letters, numbers, and underscores. Variable names can start with a letter or an underscore, but cannot start with a number.

  • Spaces are not allowed in variable names, so we use underscores instead of spaces. For example, use student_name instead of “student name”.

  • You cannot use Python keywords as variable names.

  • Variable names should be descriptive, without being too long. For example, cur_trial is better than just trial, and my_current_trial.

  • Be careful about using the lowercase letter l and the uppercase letter O in places where they could be confused with the numbers 1 and 0.

  • Python conventions dictate separating naming clauses with underscores (e.g., cur_trial). An alternate naming convention, and one that I will use in the class because of old habits is mixed case (e.g., curTrial)

NameError#

There is a common error when using variables, that you will almost certainly encounter at some point. Take a look at this code, and see if you can figure out why it causes an error.

message = "Python was created by Guido van Rossum way back in 1991."
print(mesage)
---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
/home/ehmatthes/development_resources/project_notes/intro_programming/notebooks/<ipython-input-12-7966723379c3> in <module>()
      1 message = "Thank you for sharing Python with the world, Guido!"
----> 2 print(mesage)

NameError: name 'mesage' is not defined

Let’s look through this error message. First, we see it is a NameError. Then we see the file that caused the error, and a green arrow shows us what line in that file caused the error. Then we get some more specific feedback, that “name ‘mesage’ is not defined”.

You may have already spotted the source of the error. We spelled message two different ways. Python does not care whether we use the variable name “message” or “mesage”. Python only cares that the spellings of our variable names match every time we use them.

This is pretty important, because it allows us to have a variable “name” with a single name in it, and then another variable “names” with a bunch of names in it.

We can fix NameErrors by making sure all of our variable names are spelled consistently.

```{note} In nearly in every case, capitalization matters! The variable names message, Message, and messagE are – to Python – as different from one another as message and rhinoceros Of course to humans message and Message are quite similar and liable to get confused, which is a reason to not use variable names that differ only in capitalization!


Strings#

Strings are lists of characters. Let’s look at some examples.

Single and double quotes#

Strings are contained by either single or double quotes.

my_string = "This is a double-quoted string."
my_string = 'This is a single-quoted string.'

This lets us make strings that contain quotations.

quote = "Linus Torvalds once said, 'Any program is only as good as it is useful.'"

What if we want to have a string with both single and double quotes in it? The following won’t work. Can you see why?

my_string = "Here's a string with a "quote in it""
print(my_string)
  Input In [1]
    my_string = "Here's a string with a "quote in it""
                                         ^
SyntaxError: invalid syntax

To make it work we have to “escape” the quotes that would otherwise tell the Python interpreter that ths string is to be terminated.

my_string = "Here's a string with a \"quote in it\""
print (my_string)
Here's a string with a "quote in it"

Getting a string’s length and checking if it contains something#

Python represents strings as lists of characters. Because of this, all the functions that apply to lists (which you’ll larn about in the next notebook), apply to strings as well. This allows you to do things like this:

print (f"There are {len('aeiou')} vowels in English")
There are 5 vowels in English
if 'e' in 'aeiou':
    print ("yep, there's an e")
yep, there's an e

Here’s a simple string search. We’ll cover more complex string searches later in the semester.

print ('f' in 'aeiou')
False

Changing case#

You can easily change the case of a string, to present it the way you want it to look.

first_name = 'eric'

print(first_name)
print(first_name.title())
eric
Eric

It is often good to store data in lower case, and then change the case as you want to for presentation. This catches some TYpos. It also makes sure that ‘eric’, ‘Eric’, and ‘ERIC’ are not considered three different people.

Some of the most common cases are lower, title, and upper.

first_name = 'eric'

print(first_name)
print(first_name.title())
print(first_name.upper())

first_name = 'Eric'
print(first_name.lower())
eric
Eric
ERIC
eric

You will see this syntax quite often, where a variable name is followed by a dot and then the name of an action, followed by a set of parentheses. The parentheses may be empty, or they may contain some values.

variable_name.action()

In this example, the word “action” is the name of a method. A method is something that can be done to a variable. The methods ‘lower’, ‘title’, and ‘upper’ are all functions that have been written into the Python language, which do something to strings. Later on, you will learn to write your own methods.

Combining strings (concatenation)#

It is often very useful to be able to combine strings into a message or page element that we want to display. Again, this is easier to understand through an example.

first_name = 'ada'
last_name = 'lovelace'

full_name = first_name + ' ' + last_name

print(full_name.title())
Ada Lovelace

The plus sign combines two strings into one, which is called “concatenation”. You can use as many plus signs as you want in composing messages. In fact, many web pages are written as giant strings which are put together through a long series of string concatenations.

first_name = 'ada'
last_name = 'lovelace'
full_name = first_name + ' ' + last_name

message = full_name.title() + ' ' + "was considered the world's first computer programmer."

print(message)
Ada Lovelace was considered the world's first computer programmer.

If you don’t know who Ada Lovelace is, you might want to go read what Wikipedia or the Computer History Museum have to say about her. Her life and her work are also the inspiration for the Ada Initiative.

Whitespace#

The term “whitespace” refers to characters that the computer is aware of, but are invisible to readers. The most common whitespace characters are spaces, tabs, and newlines.

Spaces are easy to create, because you have been using them as long as you have been using computers. Tabs and newlines are represented by special character combinations.

The two-character combination “\t” makes a tab appear in a string. Tabs can be used anywhere you like in a string.

print("Hello everyone!")
Hello everyone!
print("\tHello everyone!")
	Hello everyone!
print("Hello \teveryone!")
Hello 	everyone!

The combination “\n” makes a newline appear in a string. You can use newlines anywhere you like in a string.

print("Hello everyone!")
Hello everyone!
print("\nHello everyone!")
Hello everyone!
print("Hello \neveryone!")
Hello 
everyone!
print("----")
print("\n\n\nHello everyone!")
----



Hello everyone!

We’ll talk again about newlines when we talk about writing to files.

Stripping whitespace#

Sometimes you’ll wnat to get rid of whitespace characters (spaces, tabs, and newlines) that precede or follow the string (for example, when reading from files or when accepting free-response from users).

You can strip whitespace from the left side, the right side, or both sides of a string.

name = ' eric '

print(name.lstrip())
print(name.rstrip())
print(name.strip())
eric 
 eric
eric

It’s hard to see exactly what is happening, so let’s add some characters to make it a bit clearer.

name = ' eric '

print('-' + name.lstrip() + '-')
print('-' + name.rstrip() + '-')
print('-' + name.strip() + '-')
-eric -
- eric-
-eric-

Numbers#

Integers#

You can do all of the basic operations with integers, and everything should behave as you expect. Addition and subtraction use the standard plus and minus symbols. Multiplication uses the asterisk, and division uses a forward slash. Exponents use two asterisks.

print 3+2
print 3-2
print 3**2
print 3 % 2
1

This last one (%) is a modulus operator. It returns the remainder after division: 3 mod 2 is 1 because when we divide 3 by 2, the remainder is 1. Remember this. It comes in handy!

You can use parenthesis to modify the standard order of operations.

standard_order = 2+3*4
print(standard_order)
14
my_order = (2+3)*4
print(my_order)
20

Floating-Point numbers (floats)#


Floating-point numbers refer to any number with a decimal point. Most of the time, you can think of floating point numbers as decimals, and they will behave as you expect them to.

print(0.1+0.1)
0.2

However, sometimes you will get an answer with an unexpectly long decimal part:

print(0.1+0.2)
0.30000000000000004

This happens because of the way computers represent numbers internally; this has nothing to do with Python itself. Basically, we are used to working in powers of ten, where one tenth plus two tenths is just three tenths. But computers work in powers of two. So your computer has to represent 0.1 in a power of two, and then 0.2 as a power of two, and express their sum as a power of two. There is no exact representation for 0.3 in powers of two, and we see that in the answer to 0.1+0.2.

Python tries to hide this kind of stuff when possible. Don’t worry about it much for now; just don’t be surprised by it, and know that we will learn to clean up our results a little later on.

You can also get the same kind of result with other operations.

print(3*0.1)
0.30000000000000004

Integers in Python 3#

There are a couple differences in the way Python 2 and Python 3 handle numbers. In Python 2, dividing two integers always resulted in an integer, while Python 3 always returns a float.

Python 3 does float division by default:

print(4/2)
2.0
print(3/2)
1.5

If you are getting numerical results that you don’t expect, or that don’t make sense, check if the version of Python you are using is treating integers differently than you expect.

Combining data-types#

Python is a typed language meaning that every variable has a defined type. You can check its type like this:

type(4)
type(3.0)
type('asd')
type(False)

If you want to combine types, you need to convert them appopriately. Sometimes the conversions happen behind the scenes:

a=3
b='s'
print (a,b)
 3 s

Other times they don’t’:

a=3
b='s'
print (a+b)
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-8-b035952ec3f1> in <module>()
      1 a=3
      2 b='s'
----> 3 print a+b

TypeError: unsupported operand type(s) for +: 'int' and 'str'

…this is not arbitrary. The design philosophy is – if there’s ambiguity in how an operation will be interpreted by users, then make it illegal. Otherwise, allow it (this is a very different philosophy from a language like Perl which is much more syntactically permissive)

Some of Python’s operators are “overloaded” meaning that what they do depends on the type of the variables the operator is working with:

a=3
b=5
print (a*b)
15
a='q'
b=3
print (a*b)
qqq
a='q'
b='m'
print (a*b)
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
/Users/glupyan/gitRepos/psych750.github.io/notebooks/var_string_num.ipynb Cell 84 in <cell line: 3>()
      <a href='vscode-notebook-cell:/Users/glupyan/gitRepos/psych750.github.io/notebooks/var_string_num.ipynb#Y145sZmlsZQ%3D%3D?line=0'>1</a> a='q'
      <a href='vscode-notebook-cell:/Users/glupyan/gitRepos/psych750.github.io/notebooks/var_string_num.ipynb#Y145sZmlsZQ%3D%3D?line=1'>2</a> b='m'
----> <a href='vscode-notebook-cell:/Users/glupyan/gitRepos/psych750.github.io/notebooks/var_string_num.ipynb#Y145sZmlsZQ%3D%3D?line=2'>3</a> print (a*b)

TypeError: can't multiply sequence by non-int of type 'str'
a='q'
b='m'
print (a+b)
qm

The ‘+’ and ‘*’ operators are overloaded: when given integers or floats, they do normal addition and multiplication (that’s what you would expect, right?). Adding a string and an integer throws a TypeError because it doesn’t make sense… how would you add a number to a string? Multiplying a string by an integer does have a sensible and unambiguous interpretation (to programmers anyway): just repeat the string that number of times

Note

Multiplying a string by a will not work though. What’s ‘a’ times 3.5??

Finally, multiplying two strings isn’t defined (what’s ‘s’ times ‘q’??). But adding two strings does have an unambiguous interpretation: concatenation!

Commenting your code#

As you begin to write more complicated code, you will have to spend more time thinking about how to code solutions to the problems you want to solve. Once you come up with an idea, you will spend a fair amount of time troubleshooting your code, and revising your overall approach.

Comments allow you to write in English, within your program. In Python, any line that starts with a pound (#) symbol is ignored by the Python interpreter.

# This line is a comment.
print("This line is not a comment, it is code.")
This line is not a comment, it is code.

For multi-line comments (e.g., for explaining what a function does), you can use a triple single quote ''' but this is generally reserved for documenting code rather than writing simple comments. The triple quote can be useful for temporarily commenting out a chunk of your code during debugging.

'''{python}
for i in range(10):
    print "blah"
'''

Note

In VS Code (as well as Sublime Text), you can comment multiple lines by highlight them and pressing ⌘/ . This will prefix thm with the # character.

What makes a good comment?#

  • It is short and to the point, but a complete thought. Most comments should be written in complete sentences.

  • It explains your thinking, so that when you return to the code later you will understand how you were approaching the problem.

  • It explains your thinking, so that others who work with your code will understand your overall approach to a problem.

  • It explains particularly difficult sections of code in detail.

When should you write comments?#

  • When you have to think about code before writing it.

  • When you are likely to forget later exactly how you were approaching a problem.

  • When there is more than one way to solve a problem.

  • When others are unlikely to anticipate your way of thinking about a problem.

Writing good comments is one of the clear signs of a good programmer. If you have any real interest in taking programming seriously, start using comments now. You will see them throughout the examples in these notebooks.

Zen of Python#

=== The Python community is incredibly large and diverse. People are using Python in science, in medicine, in robotics, on the internet, and in any other field you can imagine. This diverse group of thinkers has developed a collective mindset about how programs should be written. If you want to understand Python and the community of Python programmers, it is a good idea to learn the ways Python programmers think.

A set of guiding principles that is written right into the language:

import this
The Zen of Python, by Tim Peters

Beautiful is better than ugly.
Explicit is better than implicit.
Simple is better than complex.
Complex is better than complicated.
Flat is better than nested.
Sparse is better than dense.
Readability counts.
Special cases aren't special enough to break the rules.
Although practicality beats purity.
Errors should never pass silently.
Unless explicitly silenced.
In the face of ambiguity, refuse the temptation to guess.
There should be one-- and preferably only one --obvious way to do it.
Although that way may not be obvious at first unless you're Dutch.
Now is better than never.
Although never is often better than *right* now.
If the implementation is hard to explain, it's a bad idea.
If the implementation is easy to explain, it may be a good idea.
Namespaces are one honking great idea -- let's do more of those!

There is a lot here. Let’s just take a few lines, and see what they mean for you as a new programmer.

Beautiful is better than ugly.

Python programmers recognize that good code can actually be beautiful. If you come up with a particularly elegant or efficient way to solve a problem, especially a difficult problem, other Python programmers will respect your work and may even call it beautiful. There is beauty in high-level technical work.

Explicit is better than implicit.

It is better to be clear about what you are doing, than come up with some shorter way to do something that is difficult to understand.

Simple is better than complex.
Complex is better than complicated.

Keep your code simple whenever possible, but recognize that we sometimes take on really difficult problems for which there are no easy solutions. In those cases, accept the complexity but avoid complication.

Readability counts.

There are very few interesting and useful programs these days that are written and maintained entirely by one person. Write your code in a way that others can read it as easily as possible, and in a way that you will be able to read and understand it 6 months from now. This includes writing good comments in your code.

There should be one-- and preferably only one --obvious way to do it.

There are many ways to solve most problems that come up in programming. However, most problems have a standard, well-established approach. Save complexity for when it is needed, and solve problems in the most straightforward way possible.

Now is better than never.

No one ever writes perfect code. If you have an idea you want to implement it, write some code that works. Release it, let it be used by others, and then steadily improve it.