In [83]:
#include <iostream>
#include <string>
#include <stdlib.h>

using namespace std;

# Functions

In C++, a function is a group of statements that is given a name, and which can be called from some point of the program. The most common syntax to define a function is:

    type name ( parameter1, parameter2, ...) { statements }

Where:
- type is the type of the value returned by the function.
- name is the identifier by which the function can be called.
- parameters (as many as needed): Each parameter consists of a type followed by an identifier, with each parameter being separated from the next by a comma. Each parameter looks very much like a regular variable declaration (for example: int x), and in fact acts within the function as a regular variable which is local to the function. The purpose of parameters is to allow passing arguments to the function from the location where it is called from.
- statements is the function's body. It is a block of statements surrounded by braces { } that specify what the function actually does.

Let's have a look at an example:

In [21]:
int addition (int a, int b){
  return a+b;
}

In [22]:
z = addition (5,3);
cout << "The result is " << z;

The result is 8

![function_argumests_value](../static/img/function_arguments.png)

This program is divided in two functions: addition and main. Remember that no matter the order in which they are defined, a C++ program always starts by calling main. In fact, main is the only function called automatically, and the code in any other function is only executed if its function is called from main (directly or indirectly).

Function call are statements that are evaluated to the return value, if any, of the function:

![function_return_value](../static/img/function_return_value.png)

In [24]:
a=5, b=3;
z = addition(a, b) + 2; //same as 8+2
cout << z;

10

## Functions with no type. The use of void
The syntax shown above for functions:

    type name ( argument1, argument2 ...) { statements }

Requires the declaration to begin with a type. This is the type of the value returned by the function. But what if the function does not need to return a value? In this case, the type to be used is `void`, which is a special type to represent the absence of value. For example, a function that simply prints a message may not need to return any value:

In [25]:
void printmessage ()
{
  cout << "I'm a function!";
}

void can also be used in the function's parameter list to explicitly specify that the function takes no actual parameters when called. For example, printmessage could have been declared as:

In [27]:
void printmessage (void)
{
  cout << "I'm a function!";
}

In C++, an empty parameter list can be used instead of void with same meaning, but the use of void in the argument list was popularized by the C language, where this is a requirement.
Something that in no case is optional are the parentheses that follow the function name, neither in its declaration nor when calling it. And even when the function takes no parameters, at least an empty pair of parentheses shall always be appended to the function name.

In [28]:
printmessage();

I'm a function!

The parentheses are what differentiate functions from other kinds of declarations or statements. The following would not call the function:

In [31]:
printmessage;

## The return value of main

You may have noticed that the return type of main is int, but most examples in this and earlier chapters did not actually return any value from main.
Well, there is a catch: If the execution of main ends normally without encountering a return statement the compiler assumes the function ends with an implicit return statement:

```cpp
return 0; 
```

Note that this only applies to function main for historical reasons. All other functions with a return type shall end with a proper return statement that includes a return value, even if this is never used.
When main returns zero (either implicitly or explicitly), it is interpreted by the environment as that the program ended successfully. Other values may be returned by main, and some environments give access to that value to the caller in some way, although this behavior is not required nor necessarily portable between platforms.

## Arguments passed by value and by reference

In the functions seen earlier, arguments have always been passed _by value_. This means that, when calling a function, what is passed to the function are the values of these arguments on the moment of the call, which are copied into the variables represented by the function parameters. For example, take:

In [32]:
int x=5, y=3, z;
z = addition (x, y);

In this case, function addition is passed 5 and 3, which are copies of the values of x and y, respectively. These values (5 and 3) are used to initialize the variables set as parameters in the function's definition, but any modification of these variables within the function has no effect on the values of the variables x and y outside it, because x and y were themselves not passed to the function on the call, but only copies of their values at that moment.

In certain cases, though, it may be useful to access an external variable from within a function. To do that, arguments can be passed by reference, instead of by value. For example, the function duplicate in this code duplicates the value of its three arguments, causing the variables used as arguments to actually be modified by the call:

In [39]:
void duplicate (int& a, int& b, int& c)
{
    a*=2;
    b*=2;
    c*=2;
}

In [43]:
int x=1, y=2, z=3;
duplicate(x, y, z);
cout << "x=" << x << endl;
cout << "y=" << y << endl;
cout << "z=" << z << endl;

x=2
y=4
z=6


To gain access to its arguments, the function declares its parameters as references. In C++, references are indicated with an ampersand (&) following the parameter type, as in the parameters taken by duplicate in the example above.
When a variable is passed by reference, what is passed is no longer a copy, but the variable itself, the variable identified by the function parameter, becomes somehow associated with the argument passed to the function, and any modification on their corresponding local variables within the function are reflected in the variables passed as arguments in the call.


![function_argumests_reference](../static/img/function_by_reference.png)

## Efficiency considerations and const references

Calling a function with parameters taken by value causes copies of the values to be made. This is a relatively inexpensive operation for fundamental types such as int, but if the parameter is of a large compound type, it may result on certain overhead. For example, consider the following function:

In [45]:
string concatenate (string a, string b)
{
  return a+b;
}

This function takes two strings as parameters (by value), and returns the result of concatenating them. By passing the arguments by value, the function forces a and b to be copies of the arguments passed to the function when it is called. And if these are long strings, it may mean copying large quantities of data just for the function call.
But this copy can be avoided altogether if both parameters are made references:

In [46]:
string concatenate (string& a, string& b)
{
  return a+b;
}

Arguments by reference do not require a copy. The function operates directly on (aliases of) the strings passed as arguments, and, at most, it might mean the transfer of certain pointers to the function. In this regard, the version of concatenate taking references is more efficient than the version taking values, since it does not need to copy expensive-to-copy strings.

On the flip side, functions with reference parameters are generally perceived as functions that modify the arguments passed, because that is why reference parameters are actually for.
The solution is for the function to guarantee that its reference parameters are not going to be modified by this function. This can be done by qualifying the parameters as constant:

In [47]:
string concatenate (const string& a, const string& b)
{
  return a+b;
}

By qualifying them as const, the function is forbidden to modify the values of neither a nor b, but can actually access their values as references (aliases of the arguments), without having to make actual copies of the strings.
Therefore, const references provide functionality similar to passing arguments by value, but with an increased efficiency for parameters of large types. That is why they are extremely popular in C++ for arguments of compound types. Note though, that for most fundamental types, there is no noticeable difference in efficiency, and in some cases, const references may even be less efficient!

## Inline functions

Calling a function generally causes a certain overhead (stacking arguments, jumps, etc...), and thus for very short functions, it may be more efficient to simply insert the code of the function where it is called, instead of performing the process of formally calling a function.
Preceding a function declaration with the inline specifier informs the compiler that inline expansion is preferred over the usual function call mechanism for a specific function. This does not change at all the behavior of a function, but is merely used to suggest the compiler that the code generated by the function body shall be inserted at each point the function is called, instead of being invoked with a regular function call.
For example, the concatenate function above may be declared inline as:

In [48]:
inline string concatenate (const string& a, const string& b)
{
  return a+b;
}

This informs the compiler that when concatenate is called, the program prefers the function to be expanded inline, instead of performing a regular call. inline is only specified in the function declaration, not when it is called.
Note that most compilers already optimize code to generate inline functions when they see an opportunity to improve efficiency, even if not explicitly marked with the inline specifier. Therefore, this specifier merely indicates the compiler that inline is preferred for this function, although the compiler is free to not inline it, and optimize otherwise. In C++, optimization is a task delegated to the compiler, which is free to generate any code for as long as the resulting behavior is the one specified by the code.

## Default values in parameters

Let's imagine we need to create some library to perform certain calculations that involve dividing number, some times the divisor is not know, but most of the time is 2. You could think on creating these two functions: 

In [74]:
int divide(int a, int b){
    return a/b;
}

In [73]:
int divide(int a){
    return a/2;
}

This way users of the library could not pass the second argument when it is expected to be 2.

In [75]:
cout << divide(14, 5);

2

In [76]:
cout << divide(14);

7

In C++, functions can also have optional parameters, for which no arguments are required in the call, in such a way that, for example, a function with three parameters may be called with only two. For this, the function shall include a default value for its last parameter, which is used by the function when called with fewer arguments. For example:

In [77]:
int divide1(int a, int b=2){
    return a/b;
}

In [78]:
cout << divide1(14, 5);

2

In [79]:
cout << divide1(14);

7

## Declaring functions

In C++, identifiers can only be used in expressions once they have been declared. For example, some variable x cannot be used before being declared with a statement, such as:

```cpp
int x;
```

The same applies to functions. Functions cannot be called before they are declared. That is why, the functions must always be defined before the main function, which is the function from where the other functions were called. If main were defined before the other functions, this would break the rule that functions shall be declared before being used, and thus would not compile.

__The prototype of a function can be declared without actually defining the function completely__, giving just enough details to allow the types involved in a function call to be known. Naturally, the function shall be defined somewhere else, like later in the code. But at least, once declared like this, it can already be called.
The declaration shall include all types involved (the return type and the type of its arguments), using the same syntax as used in the definition of the function, but replacing the body of the function (the block of statements) with an ending semicolon.
The parameter list does not need to include the parameter names, but only their types. Parameter names can nevertheless be specified, but they are optional, and do not need to necessarily match those in the function definition. For example, a function called protofunction with two int parameters can be declared with either of these statements:

In [80]:
int protofunction (int first, int second);

In [81]:
int protofunction (int, int);

Anyway, including a name for each parameter always improves legibility of the declaration.

## Recursivity

Recursivity is the property that functions have to be called by themselves. It is useful for some tasks, such as sorting elements, or calculating the factorial of numbers. For example, in order to obtain the factorial of a number (n!) the mathematical formula would be:

    n! = n * (n-1) * (n-2) * (n-3) ... * 1 

More concretely, 5! (factorial of 5) would be:

    5! = 5 * 4 * 3 * 2 * 1 = 120 
    
And a recursive function to calculate this in C++ could be:

In [104]:
int factorial (int a)
{
    if (a > 1)
        return (a * factorial (a-1));
    else
        return 1;
}

In [107]:
int number = 5;
cout << factorial (number);

120

Notice how in function factorial we included a call to itself, but only if the argument passed was greater than 1, since, otherwise, the function would perform an infinite recursive loop, in which once it arrived to 0, it would continue multiplying by all the negative numbers (probably provoking a stack overflow at some point during runtime).

## Overloads and templates
### Overloaded functions
In C++, two different functions can have the same name if their parameters are different; either because they have a different number of parameters, or because any of their parameters are of a different type. For example: 

In [111]:
int operate (int a, int b)
{
  return (a*b);
}

In [122]:
double operate (double a, double b)
{
  return (a/b);
}

In [120]:
int x=5,y=2;
cout << operate (x,y) << '\n';

2


In [121]:
double n=5.0,m=2.0;
cout << operate (n,m) << '\n';

2.5


The compiler knows which one to call in each case by examining the types passed as arguments when the function is called. If it is called with two int arguments, it calls to the function that has two int parameters, and if it is called with two doubles, it calls the one with two doubles.
In this example, both functions have quite different behaviors, the int version multiplies its arguments, while the double version divides them. This is generally not a good idea. Two functions with the same name are generally expected to have -at least- a similar behavior, but this example demonstrates that is entirely possible for them not to. Two overloaded functions (i.e., two functions with the same name) have entirely different definitions; they are, for all purposes, different functions, that only happen to have the same name.

>Note that a function cannot be overloaded only by its return type. At least one of its parameters must have a different type.

### Function templates

Overloaded functions may have the same definition, as discussed before. For example:

In [124]:
int sum (int a, int b)
{
  return a+b;
}

In [125]:
double sum (double a, double b)
{
  return a+b;
}

Here, sum is overloaded with different parameter types, but with the exact same body.

The function sum could be overloaded for a lot of types, and it could make sense for all of them to have the same body. For cases such as this, C++ has the ability to define functions with generic types, known as function templates. Defining a function template follows the same syntax as a regular function, except that it is preceded by the template keyword and a series of template parameters enclosed in angle-brackets <>:

    template <template-parameters> function-declaration 
    
The template parameters are a series of parameters separated by commas. These parameters can be generic template types by specifying either the class or typename keyword followed by an identifier. This identifier can then be used in the function declaration as if it was a regular type. For example, a generic sum function could be defined as:

In [126]:
template <class T>
T sum (T a, T b)
{
  return a+b;
}

It makes no difference whether the generic type is specified with keyword class or keyword typename in the template argument list (they are 100% synonyms in template declarations).

In [142]:
template <typename T>
T sum (T a, T b)
{
    T result;
    result = a + b;
    return result;
}

Declaring `T` (a generic type within the template parameters enclosed in angle-brackets) allows `T` to be used anywhere in the function definition, just as any other type; it can be used as the type for parameters, as return type, or to declare new variables of this type. In all cases, it represents a generic type that will be determined on the moment the template is instantiated.

Instantiating a template is applying the template to create a function using particular types or values for its template parameters. This is done by calling the function template, with the same syntax as calling a regular function, but specifying the template arguments enclosed in angle brackets:

    name <template-arguments> (function-arguments)
    
For example, the sum function template defined above can be called with:

In [136]:
auto x = sum<int>(10,20);
cout << x;

30

The function sum<int> is just one of the possible instantiations of function template sum. In this case, by using int as template argument in the call, the compiler automatically instantiates a version of sum where each occurrence of SomeType is replaced by int, as if it was defined as:
    
```cpp
int sum (int a, int b)
{
  return a+b;
}
```

In [141]:
int i=5, j=6, k;
double f=2.0, g=0.5, h;

k=sum<int>(i,j);
h=sum<double>(f,g);

cout << k << endl;
cout << h << endl;

11
2.5


It is possible to instead simply write:

In [147]:
k = sum(i, j);

cout << k;

11

without the type enclosed in angle brackets. Naturally, for that, the type shall be unambiguous. If sum is called with arguments of different types, the compiler may not be able to deduce the type of T automatically.

Templates are a powerful and versatile feature. They can have multiple template parameters, and the function can still use regular non-templated types. For example:

In [148]:
template <class T, class U>
bool are_equal (T a, U b)
{
  return (a==b);
}

In [150]:
if (are_equal(10,10.0))
    cout << "x and y are equal\n";
else
    cout << "x and y are not equal\n";

x and y are equal


Note that this example uses automatic template parameter deduction in the call to are_equal:

```cpp
are_equal(10,10.0)
```

Is equivalent to:

```cpp
are_equal<int,double>(10,10.0)
```

There is no ambiguity possible because numerical literals are always of a specific type: Unless otherwise specified with a suffix, integer literals always produce values of type int, and floating-point literals always produce values of type double. Therefore 10 has always type int and 10.0 has always type double.

### Non-type template arguments
The template parameters can not only include types introduced by class or typename, but can also include expressions of a particular type:

In [152]:
template <class T, int N>
T fixed_multiply (T val)
{
  return val * N;
}

In [153]:
cout << fixed_multiply<int,2>(10);

20

In [154]:
cout << fixed_multiply<int,3>(10);

30

The second argument of the fixed_multiply function template is of type int. It just looks like a regular function parameter, and can actually be used just like one.

But there exists a major difference: the value of template parameters is determined on compile-time to generate a different instantiation of the function fixed_multiply, and thus the value of that argument is never passed during runtime: The two calls to fixed_multiply in main essentially call two versions of the function: one that always multiplies by two, and one that always multiplies by three. For that same reason, the second template argument needs to be a constant expression (it cannot be passed a variable).