# Functions Overview

*Ben Shaver (DC), Douglas Strodtman (SaMo)*

Here is the structure of a typical Python function:

```python
def function_name(argument1, argument2, ...):
    # Function body
    <do stuff to the arguments>
    print('something') # Optional
    return <something> # Optional
    ```
    
`def` is the keyword that tells Python we're trying to define a function. `if` and `for` are other keywords.

`function_name` is just the name of the function. Usually we assign an object to a variable name with the assignment operator `=`, like so: 
```python
x = something
```
But when we define a function we dont need to assign it to anything.

The function arguments (also called parameters) are given inside the parentheses. The function user will provide these, and our function will operate on them according to the code in the function body. Note that the function body is indented by four spaces (or a tab in Jupyter Notebooks). This is similar to the indentation we see in `if`/`else` statements or `for` loops.

The `return` keyword outputs something back to the line in which the function was called. It is optional, since your function may do useful things without actually returning anything, ie printing something or plotting a curve. Calling `return` inside a function body **always** ends the execution of the function, so you may sometimes return nothing just to exit a `for` loop inside a function

In [1]:
def add_squares(a, b):
    a_sq = a ** 2
    b_sq = b ** 2
    return a_sq + b_sq

add_squares(2,3)

13

Note that you may `print` something inside a function, but what you print is **not** returned. 

In [2]:
def add_squares(a, b):
    a_sq = a ** 2
    b_sq = b ** 2
    print(str(a) + ' ** 2 + ' + str(b) + ' ** 2 = ')
    print(a_sq + b_sq)

add_squares(2,3)

2 ** 2 + 3 ** 2 = 
13


You can't do anything **including print** after you've `return`ed.

In [3]:
def add_squares(a, b):
    a_sq = a ** 2
    b_sq = b ** 2
    return a_sq + b_sq
    print ('Did you expect this to print?')

add_squares(2,3)

13

The function above works as usual, but not that intermediate variables such as `a_sq` are not accessible outside the scope of the function:

In [4]:
print(a_sq)

NameError: name 'a_sq' is not defined

Most of the remaining complexity when it comes to functions has to do with the behavior of arguments.

### Default arguments:

In [5]:
def count_up(x, increment=1):
    return(x + increment)

count_up(1) # increment is optional

2

If increment is specified, Python performs _positional matching_ to match given arguments to argument names:

In [6]:
count_up(1,1)

2

We may also want to be explicit:

In [7]:
count_up(x=1, increment=2)

3

In [8]:
count_up(increment=2, x=1)

3

In [9]:
count_up(x=1, increment=2) == count_up(increment=2, x=1)

True

But we can't do positional matching after a keyword has been specified:


In [10]:
count_up(increment=2, 1)

SyntaxError: positional argument follows keyword argument (<ipython-input-10-2f7cc1d828c1>, line 1)

### Conditional Arguments

We can define conditionals as default arguments and then use these internally to change the functionality of our functions.

In [11]:
def square_and_maybe_sum(num_list, add=True):
    if add:
        return sum([num ** 2 for num in num_list])
    print('Numbers squared but not added')
    return [num ** 2 for num in num_list]

**NOTE**: In this function we return a scalar when `add=True` and a list when `add=False`. **This is generally a bad idea, but python won't stop you from making this mistake.**

In [12]:
square_and_maybe_sum([2, 3, 5], add=True)

38

In [13]:
square_and_maybe_sum([2, 3, 5], add=False)

Numbers squared but not added


[4, 9, 25]

You can also take advantage of default arguments directly in your logic.

In [14]:
def sum_exponentiated_list(num_list, exp=None):
    if exp:
        return sum([num ** exp for num in num_list])
    elif exp == 0:
        return len(num_list)
    return sum(num_list)

In [15]:
sum_exponentiated_list([2,3,5])

10

In [16]:
sum_exponentiated_list([2,3,5], 2)

38

In [17]:
sum_exponentiated_list([2,3,5], -3)

0.17003703703703704

This function takes advantage of the fact that any number we would pass to replace `None` will exist and thus evaluate to `True` in boolean logic (except `0`, thus the `elif` statement).

In [18]:
sum_exponentiated_list([2,3,5], 0)

3

## Lambda Functions

These are not as difficult as you may think. They're just Python's way of creating quick little 'anonymous' functions for one-off use.

In [19]:
count_up = lambda x: x + 1

count_up(1)

2

Almost always lambda functions will take one input, although they can take more:

In [20]:
add_up = lambda x, y: x + y

add_up(1, 2)

3

_Apparently_, `lambda` functions can have no arguments:

In [21]:
test = lambda : print('foo')
bar = test()

foo


Take note, however: lambda functions are supposed to be anonymous! So you wouldn't normally assign them to a variable name in order to keep them around. Consider this ~~use~~ case:

In [22]:
x = [x for x in range(5)] # Simple list comprehension
x

[0, 1, 2, 3, 4]

What if I want to compute $x^2 +1$ ?

In [23]:
mapped_lambda = map(lambda x: x ** 2 +1, range(1000000))

In [24]:
count = 0
for result in mapped_lambda:
    print(result)
    count += 1
    if count == 10:
        break

1
2
5
10
17
26
37
50
65
82


**We'll see a lot of `lambda` statements when we get into `map` and `apply` with Pandas.** It's not extremely important that you understand the default functionality of `map` here.

### \*args and \*\*kwargs

Have you run into `*args` and `**kwargs`?

Neither of these are especially pythonic, and it's generally best to avoid these when defining your own functions. You may run into these in some of the packages we use, though.

Giving `*args` to your function allows it to have an arbitrary number of arguments.

In [25]:
def add_squares(*args):
    return sum([arg ** 2 for arg in args])

add_squares(2, 3, 5)

38

What type of thing is `*args`?

Without going too deep into *why*, we can check the type and see that it's a tuple.

In [26]:
def print_args_type(*args):
    print(type(args))
print_args_type(1,2,3)

<class 'tuple'>


**Let's look at why this is bad**

We'll adapt the function we wrote above.

In [27]:
def sum_exponentiated_nums(exp=None, *args):
    if exp:
        return sum([num ** exp for num in args])
    elif exp == 0:
        return len(args)
    return sum(args)

What do we expect to get below?

In [28]:
sum_exponentiated_nums(2, 3, 5)

34

The function used our **first argument** as the `exp` and then captured the remainder of our argument in our `*args`. We can be explicit to get our desired functionality:

In [29]:
sum_exponentiated_nums(2, 2, 3, 5)

38

**But it's horribly confusing.** Please, don't do this.

`*kwargs` are in some ways worse. You'll run into them more than you want to in `matplotlib` and some other packages. **Unless you're building out an extremely complicated class or module with a lot of moving parts, you shouldn't use these** (and even then, you're better off using default arguments).

In [30]:
def print_value_and_kwarg(**kwargs):
    for k in kwargs:
        print(k, kwargs[k])

In [31]:
print_value_and_kwarg(arg1=2, arg2=3, arg3=4, arg4=5)

arg1 2
arg2 3
arg3 4
arg4 5


In [32]:
print_value_and_kwarg(arg3=4, arg4=5, arg1=2, arg2=3)

arg3 4
arg4 5
arg1 2
arg2 3


In [33]:
def print_kwargs_type(**kwargs):
    print(type(kwargs))
print_kwargs_type(x=1, y=2)

<class 'dict'>


Python processes `**kwargs` into a dictionary, which can then be used for setting conditional logic within functions. **Where used, this is generally a relic of code ported from other languages (or for backwards compatability).** Be explicit instead.