# Files

Python uses file objects to interact with external files on your computer. These file objects can be any sort of file you have on your computer, whether it be an audio file, a text file, emails, Excel documents, etc. Note: You will probably need to install certain libraries or modules to interact with those various file types, but they are easily available. (We will cover downloading modules later on in the course).

Python has a built-in open function that allows us to open and play with basic file types. First we will need a file though. We're going to use some IPython magic to create a text file!

## IPython Writing a File 
#### This function is specific to jupyter notebooks! Alternatively, quickly create a simple .txt file with sublime text editor.

In [37]:
%%writefile test.txt     
Hello, this is a quick test file.

Overwriting test.txt


In [1]:
%%writefile test1.txt     
Hello, this is a quick test file.

Writing test1.txt


## Python Opening a file

Let's being by opening the file test.txt that is located in the same directory as this notebook. For now we will work with files located in the same directory as the notebook or .py script you are using.

It is very easy to get an error on this step:

In [3]:
myfile = open('test1.txt')

In [4]:
myfile

<_io.TextIOWrapper name='test1.txt' mode='r' encoding='cp1252'>

To avoid this error,make sure your .txt file is saved in the same location as your notebook, to check your notebook location, use **pwd**:

In [39]:
pwd

'C:\\Users\\ejaz\\Desktop\\Two month AI\\Complete-Python-3-Bootcamp-master\\Complete-Python-3-Bootcamp-master\\00-Python Object and Data Structure Basics'

**Alternatively, to grab files from any location on your computer, simply pass in the entire file path. **

For Windows you need to use double \ so python doesn't treat the second \ as an escape character, a file path is in the form:

    myfile = open("C:\\Users\\YourUserName\\Home\\Folder\\myfile.txt")

For MacOS and Linux you use slashes in the opposite direction:

    myfile = open("/Users/YouUserName/Folder/myfile.txt")

In [5]:
# Open the text.txt we made earlier
my_file = open('test.txt')

In [6]:
my_file

<_io.TextIOWrapper name='test.txt' mode='r' encoding='cp1252'>

In [7]:
# We can now read the file
my_file.read()

'First Line\nSecond Line\n'

In [8]:
# But what happens if we try to read it again?
my_file.read()

''

This happens because you can imagine the reading "cursor" is at the end of the file after having read it. So there is nothing left to read. We can reset the "cursor" like this:

In [11]:
# Seek to the start of file (index 0)
my_file.seek(0)

0

In [12]:
# Now read again
my_file.read()

'First Line\nSecond Line\n'

You can read a file line by line using the readlines method. Use caution with large files, since everything will be held in memory. We will learn how to iterate over large files later in the course.

In [45]:
# Readlines returns a list of the lines in the file
my_file.seek(0)
my_file.readlines()

['Hello, this is a quick test file.\n']

When you have finished using a file, it is always good practice to close it.

In [46]:
my_file.close()

In [49]:
# Read the file
my_file.seek(0)
my_file.read()

'This is a new line'

In [50]:
my_file.close()  # always do this when you're done with a file

In [52]:
my_file.seek(0)
print(my_file.read())

This is a new line
This is text being appended to test.txt
And another line here.


In [53]:
my_file.close()

## Iterating through a File

Lets get a quick preview of a for loop by iterating over a text file. First let's make a new text file with some IPython Magic:

In [55]:
%%writefile test.txt
First Line
Second Line

Overwriting test.txt


Now we can use a little bit of flow to tell the program to for through every line of the file and do something:

In [56]:
for line in open('test.txt'):
    print(line)

First Line

Second Line



Don't worry about fully understanding this yet, for loops are coming up soon. But we'll break down what we did above. We said that for every line in this text file, go ahead and print that line. It's important to note a few things here:

1. We could have called the "line" object anything (see example below).
2. By not calling `.read()` on the file, the whole text file was not stored in memory.
3. Notice the indent on the second line for print. This whitespace is required in Python.

In [57]:
# Pertaining to the first point above
for asdf in open('test.txt'):
    print(asdf)

First Line

Second Line



We'll learn a lot more about this later, but up next: Sets and Booleans!