# <u>Module 0</u> - Python Crash Course

[Python](https://www.python.org/) is a versatile and powerful programming language that has gained immense popularity in the world of software development. Known for its simplicity, readability, and versatility, Python has become one of the go-to languages for both beginners and experienced developers alike.

First, download the example.

In [1]:
import os

# Check if the data directory already exists.
if not os.path.exists("read.txt"):
    # URL of the zip data file to download.
    url = "https://github.com/ntsourakis/Machine-Learning-Techniques-for-Text-Seminar/raw/main/module-00/data.zip"

    # If it doesn't exist, download the zip file.
    !wget {url}

    # Unzip the file into the "data" folder.
    !unzip -q "data.zip"

'wget' is not recognized as an internal or external command,
operable program or batch file.
'unzip' is not recognized as an internal or external command,
operable program or batch file.


## Why Python?

One of the key reasons behind Python's widespread adoption is its clear and concise syntax, which emphasizes readability and reduces the cost of program maintenance. This makes Python an excellent choice for individuals entering the world of programming, as it allows them to focus on problem-solving rather than getting bogged down by complex syntax.

Moreover, Python has a vast and active community of developers who contribute to its extensive collection of libraries and frameworks. This rich ecosystem enables developers to easily access pre-built modules, saving time and effort in the development process.

<hr/>

## Applications of Python

Python finds applications in a wide range of fields, including web development, data science, artificial intelligence, machine learning, automation, and more. The language's versatility makes it suitable for various domains, and its ease of integration with other languages and tools further enhances its appeal.

Whether you are a beginner learning to code or an experienced developer looking to build complex applications, Python provides the tools and resources needed to accomplish your goals. Its versatility, coupled with an extensive community and robust ecosystem, positions Python as a top choice for individuals and organizations seeking a reliable and efficient programming language.

<hr/>

## Variables

In Python, a variable is a named location in the computer's memory that stores a value. Think of it as a container or a label that you can use to refer to a specific piece of data. Unlike some other programming languages, Python does not require explicit declaration of the variable type. You can simply assign a value to a variable, and Python will determine its type dynamically.

In the code below, we assign different types of values to variables (`int`, `str`, `float`, and `bool`). The variable names (`age`, `name`, `pi_value`, and `is_student`) are user-defined, and they can be chosen according to the context of your program.

In [2]:
# Example of variable assignment.
age = 25
name = "John Doe"
pi_value = 3.14
is_student = True

<hr/>

## Built-in Data Types

Python comes with several built-in data types that define the nature of a variable. Here are some common ones:

`int`: Integer type for whole numbers.

In [3]:
age = 25

`float`: Floating-point type for decimal numbers.

In [4]:
pi_value = 3.14

`str`: String type for text.

In [5]:
name = "John Doe"

`bool`: Boolean type for representing truth values (True or False).

In [6]:
is_student = True

`list`: Ordered collection of items.

In [7]:
numbers = [1, 2, 3, 4, 5]

`tuple`: Immutable ordered collection of items.

In [8]:
coordinates = (4, 7)

`dict`: Dictionary type for key-value pairs.

In [9]:
person_info = {'name': 'John', 'age': 25, 'is_student': True}

`set`: A collection of unique elements with no duplicate values.

In [10]:
number_set = {1, 2, 3, 4, 5}

Understanding these built-in data types is essential for effective programming in Python, as they provide the foundation for manipulating and organizing data in your programs. As you become more familiar with Python, you'll discover additional data types and structures that enhance the language's flexibility and expressiveness.

<hr/>

## Printing with Built-in Types

In Python, the `print()` function is a versatile tool for displaying information. It allows you to output data of various types to the console. Let's explore different ways to use the `print()` statement with built-in types.

### 1. Printing Variables

You can directly print the value of a variable using the `print()` function:

In [11]:
name = "John"
age = 30
print(name)  # Output: John
print(age)   # Output: 30

John
30


### 2. Concatenation in Print

Concatenate multiple values within the `print()` function using the `+` operator:

In [12]:
name = "John"
age = 30
print("Name:", name + ", Age:", age)  # Output: Name: John, Age: 30

Name: John, Age: 30


### 3. Formatting Strings

Use string formatting to embed variables within a string:

In [13]:
name = "John"
age = 30
print("Name: {}, Age: {}".format(name, age))  # Output: Name: John, Age: 30

Name: John, Age: 30


Or, using f-strings (formatted string literals):

In [14]:
name = "John"
age = 30
print(f"Name: {name}, Age: {age}")  # Output: Name: John, Age: 30

Name: John, Age: 30


### 4. Printing Multiple Values
Print multiple values using commas within the `print()` function:

In [15]:
name = "John"
age = 30
print("Name:", name, "Age:", age)  # Output: Name: John Age: 30

Name: John Age: 30


### 5. Printing with Separator
Specify a separator between values using the `sep` parameter:

In [16]:
name = "John"
age = 30
print(name, age, sep=" | ")  # Output: John | 30

John | 30


### 6. Printing with End
Control the end character with the `end` parameter:

In [17]:
name = "John"
age = 30
print("Name:", name, end=" | ")
print("Age:", age)  # Output: Name: John | Age: 30

Name: John | Age: 30


Now, try the following and reflect on the output:

In [18]:
print(5)
print("5")
print(5 + 5)
print("5 + 5")
print("5" + "5")
print(5*2)
print("5"*2)

5
5
10
5 + 5
55
10
55


These techniques provide flexibility in formatting and presenting data when using the `print()` statement in Python. Choose the method that best suits your needs for displaying information in your programs.

<hr/>

## Operators in Python

Operators in Python are special symbols or keywords that perform operations on operands. Operands can be variables, values, or expressions. Python supports various types of operators, including arithmetic, comparison, logical, assignment, and more. Let's explore some of the commonly used operators in Python.

### 1. Arithmetic Operators

Arithmetic operators perform basic mathematical operations.

* Addition (`+`):

In [19]:
result = 5 + 3  # Result: 8
result

8

* Subtraction (`-`):

In [20]:
result = 5 - 3  # Result: 2
result

2

* Multiplication (`*`):

In [21]:
result = 5 * 3  # Result: 15
result

15

* Division (`/`):

In [22]:
result = 10 / 2  # Result: 5.0 (float)
result

5.0

* Floor Division (`//`):

In [23]:
result = 10 // 3  # Result: 3 (integer, discards the fractional part)
result

3

* Modulus (`%`):

In [24]:
result = 10 % 3  # Result: 1 (remainder of the division)
result

1

* Exponentiation (`**`):

In [25]:
result = 2 ** 3  # Result: 8 (2 raised to the power of 3)
result

8

### 2. Comparison Operators
Comparison operators are used to compare values and return Boolean results.

* Equal to (`==`):

In [26]:
result = (5 == 5)  # Result: True
result

True

* Not equal to (`!=`):

In [27]:
result = (5 != 3)  # Result: True
result

True

* Greater than (`>`):

In [28]:
result = (5 > 3)  # Result: True
result

True

* Less than (`<`):

In [29]:
result = (5 < 3)  # Result: False
result

False

* Greater than or equal to (`>=`):

In [30]:
result = (5 >= 5)  # Result: True
result

True

* Less than or equal to (`<=`):

In [31]:
result = (5 <= 3)  # Result: False
result

False

### 3. Logical Operators

Logical operators perform logical operations on Boolean values.

* Logical AND (`and`):

In [32]:
result = (True and False)  # Result: False
result

False

* Logical OR (`or`):

In [33]:
result = (True or False)  # Result: True
result

True

* Logical NOT (`not`):

In [34]:
result = not True  # Result: False
result

False

These are just a few examples of the many operators available in Python. Understanding and mastering these operators are crucial for effective programming, allowing you to manipulate and compare values in your code efficiently.

<hr/>

## Type Casting

`Type casting`, also known as type conversion, refers to the process of converting one data type into another. Python provides built-in functions for type casting, allowing you to change the type of a variable or value as needed. Here are some common type casting functions in Python:

### 1. `int()`
Converts a value to an integer.

In [35]:
float_number = 3.14
integer_number = int(float_number)
print(integer_number)  # Output: 3

3


### 2. `float()`
Converts a value to a floating-point number.

In [36]:
int_number = 5
float_number = float(int_number)
print(float_number)  # Output: 5.0

5.0


### 3. `str()`
Converts a value to a string.

In [37]:
number = 123
str_number = str(number)
print(str_number)  # Output: '123'

123


### 4. `bool()`
Converts a value to a boolean.

In [38]:
non_zero_number = 42
is_true = bool(non_zero_number)
print(is_true)  # Output: True

True


### 5. `list()`, `tuple()`, `set()`
Converts a sequence (like a string or list) to a list, tuple, or set, respectively.

In [39]:
text = "Python"
list_text = list(text)
tuple_text = tuple(text)
set_text = set(text)

print(list_text)  # Output: ['P', 'y', 't', 'h', 'o', 'n']
print(tuple_text)  # Output: ('P', 'y', 't', 'h', 'o', 'n')
print(set_text)  # Output: {'P', 'y', 't', 'h', 'o', 'n'}

['P', 'y', 't', 'h', 'o', 'n']
('P', 'y', 't', 'h', 'o', 'n')
{'o', 'y', 'n', 'h', 'P', 't'}


### 6. `dict()`
Converts a sequence of key-value pairs to a dictionary.

In [40]:
pairs = [('a', 1), ('b', 2), ('c', 3)]
dictionary = dict(pairs)
print(dictionary)  # Output: {'a': 1, 'b': 2, 'c': 3}

{'a': 1, 'b': 2, 'c': 3}


### 7. `complex()`
Converts a real number to a complex number.

In [41]:
real_number = 2
complex_number = complex(real_number)
print(complex_number)  # Output: (2+0j)

(2+0j)


What is the result of the following?

In [42]:
result = int(float("3.2"))

Type casting is a valuable tool in Python, allowing you to ensure compatibility between different data types and perform operations that require consistent types. However, it's essential to be aware of potential data loss or unexpected behavior when converting between certain types, especially when precision may be affected, as in the case of converting from a float to an int.

<hr/>

## String Manipulation

String manipulation is a fundamental aspect of programming, and Python provides a rich set of tools for working with strings. Here are some common techniques and methods for string manipulation:

### 1. Concatenation 

Concatenation is the process of combining strings. You can use the `+` operator to concatenate two or more strings:

In [43]:
first_name = "John"
last_name = "Doe"
full_name = first_name + " " + last_name
print(full_name)  # Output: John Doe

John Doe


### 2. String Interpolation 

String Interpolation allows you to embed variables within a string. There are multiple ways to achieve this, such as using the `%` operator or the `.format()` method:

In [44]:
name = "Alice"
age = 28

# Using % operator.
message = "Hello, %s! You are %d years old." % (name, age)

# Using .format() method.
message = "Hello, {}! You are {} years old.".format(name, age)

print(message)
# Output: Hello, Alice! You are 28 years old.

Hello, Alice! You are 28 years old.


In Python 3.6 and later, you can use f-strings for a more concise and readable syntax:

In [45]:
message = f"Hello, {name}! You are {age} years old."

### 3. String Slicing

You can extract substrings from a string using slicing. The syntax is `string[start:stop]`, where `start` is the index of the starting character, and `stop` is the index of the character just after the end of the desired substring:

In [46]:
sentence = "Python is a powerful programming language."

# Extracting a substring.
substring = sentence[0:6]
print(substring)  # Output: Python

Python


### 4. String Methods

Python provides a variety of built-in string methods for manipulation, including:

* `len()`: Returns the length of a string.

* `lower()`, `upper()`: Converts a string to lowercase or uppercase.

* `strip()`: Removes leading and trailing whitespaces.

* `replace()`: Replaces a substring with another substring.

In [47]:
text = "   Python Programming   "
print(len(text))           # Output: 24
print(text.lower())        # Output: python programming
print(text.strip())        # Output: Python Programming
print(text.replace('P', 'J'))  # Output:   Jython Jrogramming   

24
   python programming   
Python Programming
   Jython Jrogramming   


### 5. String Splitting and Joining

Use the `split()` method to split a string into a list of substrings based on a delimiter. Conversely, the `join()` method joins a list of strings into a single string:

In [48]:
csv_data = "apple,orange,banana,grape"
fruits_list = csv_data.split(',')
print(fruits_list)  # Output: ['apple', 'orange', 'banana', 'grape']

# Joining the list into a string
joined_string = '-'.join(fruits_list)
print(joined_string)  # Output: apple-orange-banana-grape

['apple', 'orange', 'banana', 'grape']
apple-orange-banana-grape


### 6. Checking and Formatting

* `startswith()`, `endswith()`: Check if a string starts or ends with a specific substring.

* `in` keyword: Check if a substring is present in a string.

* `format()` method: Format strings with placeholders.

In [49]:
email = "user@example.com"
print(email.startswith("user"))  # Output: True
print("@" in email)              # Output: True

# String formatting
name = "Alice"
age = 30
formatted_string = "Name: {}, Age: {}".format(name, age)
print(formatted_string)
# Output: Name: Alice, Age: 30

True
True
Name: Alice, Age: 30


Understanding these string manipulation techniques will empower you to effectively work with text data in Python, whether you're processing user inputs, parsing files, or formatting output.

<hr/>

## List Manipulation

Lists are a versatile and widely used data structure in Python, providing dynamic arrays to store and manipulate collections of items. Here are some common techniques and methods for list manipulation:

### 1. Creating Lists

You can create lists by enclosing items in square brackets `[]`:

In [50]:
numbers = [1, 2, 3, 4, 5]
fruits = ["apple", "orange", "banana", "grape"]

### 2. Accessing Elements

Access elements in a list using indexing. Remember that Python uses 0-based indexing:

In [51]:
first_number = numbers[0]  # Access the first element
print(first_number)        # Output: 1

1


### 3. Slicing Lists

Slice a list to extract a subset of elements:

In [52]:
subset = numbers[1:4]  # Elements at index 1, 2, 3
print(subset)          # Output: [2, 3, 4]

[2, 3, 4]


### 4. Modifying Lists

Lists are mutable, meaning you can modify them after creation.

* Appending Elements:

In [53]:
fruits.append("kiwi")  # Append "kiwi" to the end
print(fruits)          # Output: ['apple', 'orange', 'banana', 'grape', 'kiwi']

['apple', 'orange', 'banana', 'grape', 'kiwi']


* Inserting Elements:

In [54]:
fruits.insert(2, "pear")  # Insert "pear" at index 2
print(fruits)             # Output: ['apple', 'orange', 'pear', 'banana', 'grape', 'kiwi']

['apple', 'orange', 'pear', 'banana', 'grape', 'kiwi']


* Removing Elements:

In [55]:
fruits.remove("orange")  # Remove the first occurrence of "orange"
print(fruits)            # Output: ['apple', 'pear', 'banana', 'grape', 'kiwi']

['apple', 'pear', 'banana', 'grape', 'kiwi']


* Pop and Delete:

In [56]:
popped_item = fruits.pop(1)  # Remove and return the element at index 1
del fruits[0]                # Delete the element at index 0
print(popped_item, fruits)   # Output: pear ['banana', 'grape', 'kiwi']

pear ['banana', 'grape', 'kiwi']


### 5. List Concatenation and Repetition

Combine lists using concatenation (`+`) or repeat a list using repetition (`*`):

In [57]:
combined_list = numbers + fruits
repeated_list = numbers * 3
print(combined_list)
# Output: [1, 2, 3, 4, 5, 'apple', 'orange', 'banana', 'grape', 'kiwi']
print(repeated_list)
# Output: [1, 2, 3, 4, 5, 1, 2, 3, 4, 5, 1, 2, 3, 4, 5]

[1, 2, 3, 4, 5, 'banana', 'grape', 'kiwi']
[1, 2, 3, 4, 5, 1, 2, 3, 4, 5, 1, 2, 3, 4, 5]


### 6. List Comprehension

List comprehensions provide a concise way to create lists:

In [58]:
squared_numbers = [x ** 2 for x in numbers]
print(squared_numbers)  # Output: [1, 4, 9, 16, 25]

[1, 4, 9, 16, 25]


### 7. Sorting and Reversing Lists

Sort a list in ascending or descending order, or reverse the order:

In [59]:
numbers.sort()        # Sort in ascending order
fruits.sort(reverse=True)  # Sort in descending order
print(numbers)        # Output: [1, 2, 3, 4, 5]
print(fruits)         # Output: ['kiwi', 'grape', 'banana', 'apple']

[1, 2, 3, 4, 5]
['kiwi', 'grape', 'banana']


### 8. List Membership and Count

Check if an item is present in a list using the in keyword, and count occurrences with the `count()` method:

In [60]:
print("kiwi" in fruits)      # Output: True
print(numbers.count(3))       # Output: 1

True
1


Understanding these list manipulation techniques will enhance your ability to work with dynamic collections of data in Python, whether you're dealing with numerical data, text data, or a combination of both.

<hr/>

## Conditional Statements

Conditional statements allow you to control the flow of your program based on specific conditions. In Python, you can use the `if`, `elif` (else if), and `else` statements for this purpose.
### 1. `if` Statement

The `if` statement checks a condition, and if it is true, the indented block of code beneath it is executed:

In [61]:
x = 10

if x > 5:
    print("x is greater than 5")

x is greater than 5


### 2. `if`-`else` Statement

The `if`-`else` statement adds an alternative block of code to execute when the condition is false:

In [62]:
x = 3

if x > 5:
    print("x is greater than 5")
else:
    print("x is not greater than 5")

x is not greater than 5


### 3. `if`-`elif`-`else` Statement

The `if`-`elif`-`else` statement allows you to check multiple conditions:

In [63]:
x = 5

if x > 5:
    print("x is greater than 5")
elif x == 5:
    print("x is equal to 5")
else:
    print("x is less than 5")

x is equal to 5


### 4. Nested `if` Statements

You can nest if statements to check conditions within conditions:

In [64]:
x = 10
y = 5

if x > 5:
    print("x is greater than 5")
    
    if y > 2:
        print("y is also greater than 2")
    else:
        print("y is not greater than 2")

x is greater than 5
y is also greater than 2


### 5. Logical Operators (`and`, `or`, `not`)

Combine conditions using logical operators:

In [65]:
age = 25

if age >= 18 and age <= 30:
    print("You are between 18 and 30 years old")

if age < 18 or age > 65:
    print("You are either under 18 or over 65")

if not age > 30:
    print("You are not older than 30")

You are between 18 and 30 years old
You are not older than 30


### 6. Ternary Conditional Expression

Use a ternary conditional expression for a concise way to write simple `if`-`else` statements:

In [66]:
x = 8

message = "x is greater than 5" if x > 5 else "x is not greater than 5"
print(message)

x is greater than 5


Conditional statements are essential for creating dynamic and responsive programs. They allow your code to make decisions and respond to different situations, making your programs more flexible and capable of handling various scenarios.

<hr/>


## Loops

Loops are essential for repeating a block of code multiple times. In Python, there are two main types of loops: `for` loops and `while` loops.

### 1. `for` Loop

The `for` loop is used for iterating over a sequence (such as a list, tuple, string, or range). It executes a block of code for each item in the sequence:

Example with a List:

In [67]:
fruits = ["apple", "banana", "cherry"]

for fruit in fruits:
    print(fruit)

apple
banana
cherry


Example with a Range:

In [68]:
for i in range(5):
    print(i)

0
1
2
3
4


### 2. `while` Loop

The `while` loop continues to execute a block of code as long as a specified condition is true:

In [69]:
count = 0

while count < 5:
    print(count)
    count += 1

0
1
2
3
4


### 3. Loop Control Statements

a. `break` Statement

The `break` statement is used to exit the loop prematurely, regardless of whether the loop condition is true or false:

In [70]:
for number in range(10):
    if number == 5:
        break
    print(number)

0
1
2
3
4


b. `continue` Statement

The `continue` statement is used to skip the rest of the code inside the loop for the current iteration and move to the next iteration:

In [71]:
for number in range(10):
    if number % 2 == 0:
        continue
    print(number)

1
3
5
7
9


c. `else` Clause in Loops

Python allows an `else` clause to be associated with a loop. The `else` block is executed when the loop condition becomes false:

In [72]:
for i in range(5):
    print(i)
else:
    print("Loop completed without a break")

0
1
2
3
4
Loop completed without a break


### 4. Nested Loops

You can have loops inside loops, known as nested loops:

In [73]:
for i in range(3):
    for j in range(2):
        print(f"({i}, {j})")

(0, 0)
(0, 1)
(1, 0)
(1, 1)
(2, 0)
(2, 1)


### 5. Iterating Over Dictionaries

You can use the `items()` method to iterate over key-value pairs in a dictionary:

In [74]:
person = {"name": "Alice", "age": 30, "city": "Wonderland"}

for key, value in person.items():
    print(f"{key}: {value}")

name: Alice
age: 30
city: Wonderland


### 6. `enumerate()` Function

The `enumerate()` function is used to iterate over a sequence and keep track of the index:

In [75]:
fruits = ["apple", "banana", "cherry"]

for index, fruit in enumerate(fruits):
    print(f"Index {index}: {fruit}")

Index 0: apple
Index 1: banana
Index 2: cherry


Loops are fundamental to programming and are used for a variety of tasks, from iterating over data to implementing control flow in your programs. Understanding how to effectively use loops is key to writing efficient and readable code.

<hr/>

## Tuples

Tuples are a versatile and immutable data type in Python. They are similar to lists but with a key difference: once a tuple is created, its elements cannot be changed or modified. Tuples are created using parentheses `()`.

### 1. Creating Tuples

In [76]:
# Creating an empty tuple.
empty_tuple = ()

# Creating a tuple with elements.
fruits = ("apple", "banana", "cherry")

### 2. Accessing Elements

Tuples support indexing and slicing, similar to lists:

In [77]:
print(fruits[0])   # Output: apple
print(fruits[1:3])  # Output: ('banana', 'cherry')

apple
('banana', 'cherry')


### 3. Immutable Nature

Once a tuple is created, you cannot modify its elements. However, you can create a new tuple with modifications:

In [78]:
# Trying to modify a tuple (will raise an error).
# fruits[0] = "orange"

# Creating a new tuple with modifications.
modified_fruits = fruits + ("orange", "grape")
print(modified_fruits)  # Output: ('apple', 'banana', 'cherry', 'orange', 'grape')

('apple', 'banana', 'cherry', 'orange', 'grape')


### 4. Tuple Packing and Unpacking

Tuple packing is the process of creating a tuple by placing values inside parentheses. Tuple unpacking is the reverse, where values in a tuple are assigned to variables:

In [79]:
# Tuple packing.
coordinates = (3, 5)

# Tuple unpacking.
x, y = coordinates
print(x, y)  # Output: 3 5

3 5


### 5. Tuple Methods

Tuples have a few built-in methods:

`count()`: Returns the number of occurrences of a value.
`index()`: Returns the index of the first occurrence of a value.

In [80]:
numbers = (1, 2, 3, 4, 2, 5)

print(numbers.count(2))  # Output: 2 (number of occurrences of 2)
print(numbers.index(4))  # Output: 3 (index of the first occurrence of 4)

2
3


### 6. Iterating Over Tuples

You can use a for loop to iterate over the elements of a tuple:

In [81]:
fruits = ("apple", "banana", "cherry")

for fruit in fruits:
    print(fruit)

apple
banana
cherry


### 7. Advantages of Tuples

__Immutable__: Tuples provide data integrity and are suitable for situations where the data should not be changed.

__Performance__: Tuples can be faster than lists for certain operations because of their immutability.

__Valid Dictionary Key__: Tuples can be used as keys in dictionaries, unlike lists.

### 8. When to Use Tuples

Use tuples when you have a collection of items that should remain constant throughout the program's execution. For example, representing coordinates, RGB values, or dates.

In [82]:
rgb_values = ((255, 0, 0), (0, 255, 0), (0, 0, 255))

Understanding the characteristics and use cases of tuples will allow you to choose the appropriate data structure for your specific programming needs.

<hr/>

## Dictionaries

Dictionaries are a powerful and flexible data structure in Python, allowing you to store and retrieve data in key-value pairs. Each key in a dictionary must be unique, and the values can be of any data type. Dictionaries are created using curly braces `{}`.

### 1. Creating Dictionaries

In [83]:
# Creating an empty dictionary.
empty_dict = {}

# Creating a dictionary with key-value pairs.
person = {"name": "John", "age": 30, "city": "New York"}

### 2. Accessing Values
Access values in a dictionary using their corresponding keys:

In [84]:
print(person["name"])  # Output: John
print(person["age"])   # Output: 30

John
30


### 3. Modifying and Adding Items
Dictionaries are mutable, so you can modify values or add new key-value pairs:

In [85]:
# Modifying a value.
person["age"] = 31

# Adding a new key-value pair.
person["gender"] = "Male"

print(person)
# Output: {'name': 'John', 'age': 31, 'city': 'New York', 'gender': 'Male'}

{'name': 'John', 'age': 31, 'city': 'New York', 'gender': 'Male'}


### 4. Dictionary Methods
Dictionaries have various built-in methods for manipulation:

* `keys()`: Returns a view of all keys.
* `values()`: Returns a view of all values.
* `items()`: Returns a view of all key-value pairs.
* `get()`: Returns the value for a given key, with a default value if the key is not present.

In [86]:
print(person.keys())    # Output: dict_keys(['name', 'age', 'city', 'gender'])
print(person.values())  # Output: dict_values(['John', 31, 'New York', 'Male'])
print(person.items())   # Output: dict_items([('name', 'John'), ('age', 31), ('city', 'New York'), ('gender', 'Male')])

# Using get() to avoid KeyError.
print(person.get("country", "USA"))  # Output: USA (default value when key 'country' is not present)

dict_keys(['name', 'age', 'city', 'gender'])
dict_values(['John', 31, 'New York', 'Male'])
dict_items([('name', 'John'), ('age', 31), ('city', 'New York'), ('gender', 'Male')])
USA


### 5. Checking if a Key Exists
You can use the in keyword to check if a key exists in a dictionary:

In [87]:
print("age" in person)     # Output: True
print("country" in person) # Output: False

True
False


### 6. Nested Dictionaries
Dictionaries can contain other dictionaries, creating a nested structure:

In [88]:
contacts = {
    "John": {"phone": "123-456-7890", "email": "john@example.com"},
    "Alice": {"phone": "987-654-3210", "email": "alice@example.com"}
}

### 7. Dictionary Comprehensions
Similar to list comprehensions, you can use dictionary comprehensions to create dictionaries in a concise manner:

In [89]:
squares = {x: x**2 for x in range(1, 6)}
print(squares)
# Output: {1: 1, 2: 4, 3: 9, 4: 16, 5: 25}

{1: 1, 2: 4, 3: 9, 4: 16, 5: 25}


### 8. When to Use Dictionaries
Dictionaries are suitable when you have a set of unique keys mapped to corresponding values. They are efficient for quick data retrieval and are commonly used for representing structured data like JSON.

Understanding how to create, access, and manipulate dictionaries is crucial for working with complex data structures and building efficient programs in Python.

<hr/>

## Sets
A set is a built-in data type in Python that represents an unordered collection of unique elements. Sets are defined by enclosing elements in curly braces `{}`. They are widely used for tasks that involve mathematical set operations.

### 1. Creating Sets
You can create a set using curly braces or the set() constructor:

In [90]:
# Creating a set using curly braces
my_set = {1, 2, 3, 4, 5}

# Creating a set using the set() constructor
another_set = set([3, 4, 5, 6, 7])

### 2. Adding and Removing Elements
Sets are mutable, allowing you to add and remove elements:

In [91]:
# Adding an element
my_set.add(6)

# Removing an element
my_set.remove(3)

### 3. Set Operations
Sets support various operations for combining and manipulating sets:

* __Union__ (`|`): Combines elements from two sets, excluding duplicates.
* __Intersection__ (`&`): Returns elements common to both sets.
* __Difference__ (`-`): Returns elements present in the first set but not in the second.
* __Symmetric Difference__ (`^`): Returns elements present in either of the sets, but not in both.

In [92]:
set1 = {1, 2, 3, 4}
set2 = {3, 4, 5, 6}

union_set = set1 | set2        # {1, 2, 3, 4, 5, 6}
intersection_set = set1 & set2  # {3, 4}
difference_set = set1 - set2    # {1, 2}
symmetric_difference_set = set1 ^ set2  # {1, 2, 5, 6}

### 4. Membership Testing
You can use the `in` keyword to check if an element is present in a set:

In [93]:
print(3 in my_set)  # Output: False (3 was removed earlier)
print(4 in my_set)  # Output: True

False
True


### 5. Set Methods
Sets have several built-in methods for common set operations:

* `add()`: Adds an element to the set.
* `remove()`: Removes a specified element from the set (raises an error if the element is not present).
* `discard()`: Removes a specified element from the set (does not raise an error if the element is not present).
* `clear()`: Removes all elements from the set.
* `copy()`: Returns a shallow copy of the set.

In [94]:
my_set.add(7)
my_set.remove(2)
my_set.discard(10)  # No error even if 10 is not present
my_set.clear()

### 6. Frozen Sets
Python also supports an immutable version of sets called frozen sets, created using the `frozenset()` constructor. Frozen sets cannot be modified after creation.

In [95]:
frozen_set = frozenset([1, 2, 3, 4])

Sets are a valuable tool for dealing with collections of unique elements, and they provide efficient methods for performing set operations. Understanding how to use sets can simplify tasks that involve checking for uniqueness, combining datasets, and performing set arithmetic.

<hr/>

## Functions
Functions are blocks of reusable code that perform a specific task. They are fundamental to organizing and structuring code in a modular way. In Python, functions are defined using the `def` keyword.

### 1. Defining a Function

In [96]:
def greet(name):
    """This function greets the person passed in as a parameter."""
    print(f"Hello, {name}!")

# Calling the function.
greet("John")

Hello, John!


### 2. Function Parameters and Arguments
Functions can take parameters (input values) to perform operations. Parameters are specified in the function definition. Arguments are the actual values passed to the function when it is called.

In [97]:
def add_numbers(a, b):
    """This function adds two numbers."""
    result = a + b
    return result

# Calling the function with arguments.
sum_result = add_numbers(3, 5)
print(sum_result)  # Output: 8

8


### 3. Default Parameter Values
You can provide default values for parameters, making them optional when calling the function:

In [98]:
def greet(name, greeting="Hello"):
    """This function greets a person with a specified greeting."""
    print(f"{greeting}, {name}!")

# Calling the function with and without the second argument.
greet("John")           # Output: Hello, John!
greet("Alice", "Hi")    # Output: Hi, Alice!

Hello, John!
Hi, Alice!


### 4. Return Statement
Functions can return values using the `return` statement. The function exits when a `return`` statement is encountered:

In [99]:
def square(x):
    """This function returns the square of a number."""
    return x ** 2

result = square(4)
print(result)  # Output: 16

16


### 5. Multiple Return Values
Functions can return multiple values as a tuple:

In [100]:
def get_coordinates():
    """This function returns a tuple of coordinates."""
    x = 3
    y = 5
    return x, y

coordinates = get_coordinates()
print(coordinates)  # Output: (3, 5)


(3, 5)


### 6. Variable Scope
Variables defined inside a function have local scope and are not accessible outside the function. However, variables defined outside functions have global scope:

In [101]:
global_variable = 10

def print_global_variable():
    """This function prints the global variable."""
    print(global_variable)

print_global_variable()  # Output: 10

10


### 7. Lambda Functions (Anonymous Functions)
Lambda functions are small, anonymous functions defined using the `lambda` keyword. They are often used for short-term operations:

In [102]:
multiply = lambda x, y: x * y
result = multiply(3, 4)
print(result)  # Output: 12

12


### 8. Docstrings
Docstrings are used to document functions. They are placed in triple-quotes immediately after the function definition:

In [103]:
def calculate_area(radius):
    """This function calculates the area of a circle."""
    area = 3.14 * radius ** 2
    return area

### 9. Recursion
A function can call itself, a concept known as recursion:

In [104]:
def factorial(n):
    """This function calculates the factorial of a number."""
    if n == 0 or n == 1:
        return 1
    else:
        return n * factorial(n - 1)

Functions are crucial for building modular and reusable code. They enhance code readability, simplify maintenance, and promote code organization. Understanding how to define, call, and work with functions is essential for effective Python programming.

<hr/>

### Exception Handling in Python
Exception handling allows you to gracefully manage errors that might occur during the execution of your program. In Python, exceptions are raised when an error occurs, and you can use `try`, `except`, `else`, and `finally` blocks to handle them.

### 1. `try`-`except` Blocks
Use the `try` block to enclose the code that might raise an exception. The `except` block specifies how to handle the exception:

In [105]:
try:
    # Code that might raise an exception.
    result = 10 / 0
except ZeroDivisionError:
    print("Cannot divide by zero!")

Cannot divide by zero!


### 2. Handling Multiple Exceptions
You can handle multiple exceptions by providing multiple `except` blocks or a tuple of exception types:

In [106]:
try:
    # Code that might raise an exception.
    result = int("abc")
except ValueError:
    print("Invalid conversion to integer.")
except ZeroDivisionError:
    print("Cannot divide by zero!")

Invalid conversion to integer.


### 3. `else` Block
The `else` block contains code that will be executed if no exceptions are raised in the `try` block:

In [107]:
try:
    result = 10 / 2
except ZeroDivisionError:
    print("Cannot divide by zero!")
else:
    print("Division successful. Result:", result)

Division successful. Result: 5.0


### 4. `finally` Block
The `finally` block contains code that will be executed regardless of whether an exception was raised or not. It is often used for cleanup operations:

In [108]:
try:
    # Code that might raise an exception.
    result = 10 / 0
except ZeroDivisionError:
    print("Cannot divide by zero!")
finally:
    print("This code always executes.")

Cannot divide by zero!
This code always executes.


### 5. Raising Exceptions
You can manually raise exceptions using the `raise` statement. This is useful when you want to indicate an error condition in your code:

In [109]:
age = 10
# TODO: Uncomment the line below.
#age = -5

if age < 0:
    raise ValueError("Age cannot be negative.")

### 6. Custom Exceptions
You can create your own exception classes by inheriting from the `Exception` class. This allows you to define specific types of exceptions for your application:

In [110]:
class CustomError(Exception):
    pass

try:
    raise CustomError("This is a custom exception.")
except CustomError as e:
    print(f"Caught an exception: {e}")

Caught an exception: This is a custom exception.


### 7. Handling Multiple Exceptions in One Block
You can handle multiple exceptions in a single `except` block by using parentheses:

In [111]:
try:
    # Code that might raise an exception.
    result = int("abc") / 0
except (ValueError, ZeroDivisionError) as e:
    print(f"An error occurred: {e}")

An error occurred: invalid literal for int() with base 10: 'abc'


Exception handling is crucial for writing robust and error-tolerant code. It allows your program to respond gracefully to unexpected situations and provides a mechanism to communicate errors to the user or log them for later analysis.

<hr/>

## Regular Expressions
Regular expressions, often referred to as regex or regexp, are powerful tools for pattern matching and text manipulation. Python provides the `re` module, which allows you to work with regular expressions.

### 1. Importing the `re` Module

In [112]:
import re

### 2. Basic Patterns
* __Literal Characters__: Match literal characters in a string.

In [113]:
pattern = re.compile(r"hello")

* __Character Classes (`[...]`)__: Match any character within the brackets.

In [114]:
pattern = re.compile(r"[aeiou]")

* Dot (`.`): Match any character except a newline.

In [115]:
pattern = re.compile(r"gr.y")

### 3. Quantifiers
`*`: Match zero or more occurrences.

In [116]:
pattern = re.compile(r"ab*c")

`+`: Match one or more occurrences.

In [117]:
pattern = re.compile(r"ab+c")

`?`: Match zero or one occurrence.

In [118]:
pattern = re.compile(r"ab?c")

`{n}`: Match exactly n occurrences.

In [119]:
pattern = re.compile(r"ab{2}c")

`{n,}`: Match n or more occurrences.

In [120]:
pattern = re.compile(r"ab{2,}c")

`{n,m}`: Match between n and m occurrences.

In [121]:
pattern = re.compile(r"ab{2,4}c")

### 4. Anchors and Boundaries
`^`: Anchors the regex at the start of the string.

In [122]:
pattern = re.compile(r"^start")

`$`: Anchors the regex at the end of the string.

In [123]:
pattern = re.compile(r"end$")

`\b`: Matches a word boundary.

In [124]:
pattern = re.compile(r"\bword\b")

### 5. Character Classes and Escape Sequences
`\d`: Matches any digit (`[0-9]`).

In [125]:
pattern = re.compile(r"\d+")

`\D`: Matches any non-digit.

`\w`: Matches any word character (alphanumeric + underscore).

`\W`: Matches any non-word character.

`\s`: Matches any whitespace character.

`\S`: Matches any non-whitespace character.

## 6. Groups and Capturing
__Groups (`(...)`)__: Create a group.

In [126]:
pattern = re.compile(r"(\d+)-(\w+)")

__Capturing Groups__: Extract matched groups using `group()`` or `groups()`.

In [127]:
match = pattern.match("123-abc")
print(match.group(1))  # Output: 123
print(match.group(2))  # Output: abc

123
abc


### 7. Flags
The `re` module supports flags to modify the behavior of regular expressions. Common flags include:

`re.IGNORECASE` or `re.I`: Perform case-insensitive matching.

`re.MULTILINE` or `re.M`: Allow `^` and `$` to match the start/end of each line.

`re.DOTALL` or `re.S`: Allow `.` to match any character, including newline.

### 8. Using re Functions
`re.match(pattern, string, flags=0)`: Matches the pattern at the beginning of the string.

`re.search(pattern, string, flags=0)`: Searches the entire string for a match.

`re.findall(pattern, string, flags=0)`: Returns a list of all non-overlapping matches.

`re.finditer(pattern, string, flags=0)`: Returns an iterator of match objects.

`re.sub(pattern, replacement, string, count=0, flags=0)`: Replaces occurrences of the pattern with the replacement string.

### 9. Example

In [128]:
import re

# Matching an email address.
email_pattern = re.compile(r"\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z|a-z]{2,}\b")

text = "Contact us at info@example.com for more information."

match = email_pattern.search(text)
if match:
    print("Email found:", match.group())
else:
    print("No email found.")

Email found: info@example.com


Regular expressions are a powerful tool for string manipulation and pattern matching. Understanding their syntax and capabilities is valuable for tasks such as data validation, text parsing, and data extraction.

<hr/>

## File Handling
File handling in Python allows you to work with files, read from them, and write to them. Python provides built-in functions and methods to perform various operations on files.

### 1. Opening and Closing Files
Use the `open()` function to open a file. The function takes two arguments: the file path and the mode (read, write, append, etc.). It returns a file object.

In [129]:
# Opening a file for reading.
file_path = "read.txt"
file = open(file_path, "r")

# Closing the file.
file.close()

### 2. Reading from a File
There are different methods to read from a file:

`read()`: Reads the entire content of the file.

In [130]:
file = open("read.txt", "r")
content = file.read()
print(content)
file.close()

With the ever-increasing demand for machine learning and programming professionals, it's prime time to invest in the field. 
This book will help you in this endeavor, focusing specifically on text data and human language by steering a middle path among the various textbooks that present complicated theoretical concepts or focus disproportionately on Python code.
A good metaphor this work builds upon is the relationship between an experienced craftsperson and their trainee.
Based on the current problem, the former picks a tool from the toolbox, explains its utility, and puts it into action. 
This approach will help you to identify at least one practical use for each method or technique presented. 
The content unfolds in ten chapters, each discussing one specific case study. 
For this reason, the book is solution-oriented.
It's accompanied by Python code in the form of Jupyter notebooks to help you obtain hands-on experience.
A recurring pattern in the chapters of this book is helping yo

`readline()`: Reads a single line from the file.

In [131]:
file = open("read.txt", "r")
line = file.readline()
print(line)
file.close()

With the ever-increasing demand for machine learning and programming professionals, it's prime time to invest in the field. 



`readlines()`: Reads all lines from the file and returns them as a list.

In [132]:
file = open("read.txt", "r")
lines = file.readlines()
print(lines)
file.close()

["With the ever-increasing demand for machine learning and programming professionals, it's prime time to invest in the field. \n", 'This book will help you in this endeavor, focusing specifically on text data and human language by steering a middle path among the various textbooks that present complicated theoretical concepts or focus disproportionately on Python code.\n', 'A good metaphor this work builds upon is the relationship between an experienced craftsperson and their trainee.\n', 'Based on the current problem, the former picks a tool from the toolbox, explains its utility, and puts it into action. \n', 'This approach will help you to identify at least one practical use for each method or technique presented. \n', 'The content unfolds in ten chapters, each discussing one specific case study. \n', 'For this reason, the book is solution-oriented.\n', "It's accompanied by Python code in the form of Jupyter notebooks to help you obtain hands-on experience.\n", 'A recurring pattern 

### 3. Writing to a File
Use the `write()` method to write to a file. If the file does not exist, it will be created. If it already exists, the content will be overwritten.

In [133]:
file = open("write.txt", "w")
file.write("Hello, this is a sample file.")
file.close()

### 4. Appending to a File
To add content to an existing file without overwriting the existing content, open the file in append mode (`"a"`).

In [134]:
file = open("write.txt", "a")
file.write("\nThis is additional content.")
file.close()

### 5. Using with Statement (Context Manager)
The `with` statement ensures that the file is properly closed after the operations are performed. It is recommended for file handling.

In [135]:
with open("read.txt", "r") as file:
    content = file.read()
    print(content)
# File is automatically closed after exiting the 'with' block.

With the ever-increasing demand for machine learning and programming professionals, it's prime time to invest in the field. 
This book will help you in this endeavor, focusing specifically on text data and human language by steering a middle path among the various textbooks that present complicated theoretical concepts or focus disproportionately on Python code.
A good metaphor this work builds upon is the relationship between an experienced craftsperson and their trainee.
Based on the current problem, the former picks a tool from the toolbox, explains its utility, and puts it into action. 
This approach will help you to identify at least one practical use for each method or technique presented. 
The content unfolds in ten chapters, each discussing one specific case study. 
For this reason, the book is solution-oriented.
It's accompanied by Python code in the form of Jupyter notebooks to help you obtain hands-on experience.
A recurring pattern in the chapters of this book is helping yo

### 6. File Modes
`"r"`: Read (default mode).

`"w"`: Write (creates a new file or truncates an existing file).

`"a"`: Append (opens a file for writing, but content is appended to the end).

`"b"`: Binary mode (e.g., `"rb"`, `"wb"`, `"ab"`).

`"x"`: Exclusive creation (creates a new file but fails if the file already exists).

### 7. Working with Binary Files
To work with binary files, use modes like `"rb"` (read binary) or `"wb"` (write binary).

In [136]:
with open("binary_file.bin", "wb") as bin_file:
    bin_file.write(b"Binary data")

### 8. Exception Handling for File Operations
It's good practice to handle exceptions, especially when working with files.

In [137]:
try:
    with open("read.txt", "r") as file:
        content = file.read()
        print(content)
except FileNotFoundError:
    print("File not found.")
except Exception as e:
    print(f"An error occurred: {e}")

With the ever-increasing demand for machine learning and programming professionals, it's prime time to invest in the field. 
This book will help you in this endeavor, focusing specifically on text data and human language by steering a middle path among the various textbooks that present complicated theoretical concepts or focus disproportionately on Python code.
A good metaphor this work builds upon is the relationship between an experienced craftsperson and their trainee.
Based on the current problem, the former picks a tool from the toolbox, explains its utility, and puts it into action. 
This approach will help you to identify at least one practical use for each method or technique presented. 
The content unfolds in ten chapters, each discussing one specific case study. 
For this reason, the book is solution-oriented.
It's accompanied by Python code in the form of Jupyter notebooks to help you obtain hands-on experience.
A recurring pattern in the chapters of this book is helping yo

File handling is a fundamental aspect of programming, enabling you to interact with external data. Understanding how to read, write, and manipulate files is essential for various applications, including data processing and file-based storage.

<hr/>

## User Input
User input is a crucial aspect of interactive programming, allowing users to provide data to a program during its execution. In Python, you can use the `input()` function to receive user input.

### 1. Basic User Input
Use the `input()` function to get input from the user. The input is always returned as a string.

In [138]:
# Basic input.
user_name = input("Enter your name: ")
print("Hello, " + user_name + "!")

Hello, Nikos!


### 2. Converting Input to Other Data Types
If you need the user input as a different data type, you can use type conversion functions like `int()`, `float()`, or `bool()`.

In [139]:
# Converting input to integer.
user_age = int(input("Enter your age: "))

# Converting input to float.
user_height = float(input("Enter your height (in meters): "))

# Converting input to boolean.
is_student = bool(input("Are you a student? (True/False): "))

### 3. Handling User Input Errors
It's important to handle potential errors when working with user input, especially when converting to other data types.

In [140]:
# Handling ValueError for invalid integer input.
try:
    user_age = int(input("Enter your age: "))
except ValueError:
    print("Invalid input. Please enter a valid integer.")

### 4. Using `eval()` Function
The `eval()` function can be used to evaluate the user's input as a Python expression, but it should be used cautiously due to security risks.

In [141]:
user_input = eval(input("Enter an expression: "))
print("Result:", user_input)

Result: 1


### 5. Prompting for Multiple Inputs
You can prompt for multiple inputs in a single line by using the `split()` method.

In [142]:
# Prompting for multiple inputs.
user_inputs = input("Enter two numbers separated by a space: ").split()
num1, num2 = map(float, user_inputs)
print("Sum:", num1 + num2)

Sum: 3.0


### 6. Using `strip()` to Remove Whitespace
When taking user input, it's a good practice to use `strip()` to remove leading and trailing whitespace.

In [143]:
user_input = input("Enter something: ").strip()
print("You entered:", user_input)

You entered: Hi


### 7. Interactive User Input Loop
You can create an interactive loop to continuously receive input until a specific condition is met.

In [144]:
while True:
    user_input = input("Enter something (type 'exit' to quit): ")
    
    if user_input.lower() == 'exit':
        break
    
    print("You entered:", user_input)

You entered: Hello!
You entered: How are you?
You entered: Bye....


User input is a dynamic way to interact with your programs. It enables customization, parameterization, and real-time interaction, making your applications more versatile and user-friendly. However, when processing user input, always consider input validation and error handling to ensure robustness.

<hr/>

## What we have learned …

| | | | | |
| --- | --- | --- | --- | --- |
| **Variables** | **Built-in Data Types** | **Printing with Built-in Types** | **Operators** | **Type Casting** |
| **String Manipulation** | **List Manipulation** | **Conditional Statements** | **Conditional Statements** | **Loops in Python** |
| **Tuples** | **Dictionaries** | **Sets** | **Functions** | **Exception Handling** |
| **Regular Expressions** | **File Handling** | **User Input** | | |
| | | | | |

## Author Information

- **Author:** Nikos Tsourakis
- **Email:** nikos@tsourakis.net
- **Website:** [tsourakis.net](https://tsourakis.net)
- **Date:** November 20, 2023