# Introduction to Lambda Functions

Lambda functions, also known as anonymous functions, are a unique feature of Python and some other programming languages. They allow you to define small, throwaway functions without needing to formally define a function using the `def` keyword. In this notebook, we will explore the concept, syntax, examples, and common use cases of lambda functions in Python.

## Concept and Syntax

A lambda function is a small, unnamed function defined using the `lambda` keyword. This function can have any number of arguments but can only have one expression. The expression is evaluated and returned when the lambda function is called.

The general syntax of a lambda function is:
```python
lambda arguments: expression
```
The lambda function can be used wherever function objects are required. It is syntactically restricted to a single expression.

In [None]:
# Example of a simple lambda function
square = lambda x: x**2
square(5)

In the example above, we defined a lambda function that takes one argument `x` and returns its square. We then assigned this lambda function to the variable `square` and called it with an argument of 5. The result, 25, is the square of 5.

In [None]:
# Example of a lambda function with multiple arguments
multiply = lambda x, y: x * y
multiply(3, 4)

In the example above, we defined a lambda function that takes two arguments, `x` and `y`, and returns their product. We assigned this lambda function to the variable `multiply` and then called it with arguments 3 and 4. The result, 12, is the product of 3 and 4.

## Common Use Cases of Lambda Functions

Lambda functions are particularly useful in scenarios where you need a simple function for a short period and do not want to formally define it using the `def` keyword. Some common use cases include:

1. **Sorting and Filtering**: Lambda functions are often used with functions like `sorted()` and `filter()` to define custom sorting or filtering logic.
2. **Functional Programming**: They are used in functional programming paradigms where functions are passed as arguments to other functions.
3. **Short-lived Operations**: For operations that are used once or a limited number of times in the code.
4. **Callback Functions**: In scenarios where a small piece of code needs to be executed in response to some event.

In [None]:
# Example of using lambda function for sorting
students = [{'name': 'John', 'grade': 85},
            {'name': 'Jane', 'grade': 90},
            {'name': 'Doe', 'grade': 88}]

# Sorting students by grade using lambda
sorted_students = sorted(students, key=lambda x: x['grade'], reverse=True)
sorted_students

In the example above, we have a list of students with their names and grades. We want to sort this list based on the grades in descending order. To achieve this, we use the `sorted()` function with a lambda function as the `key` argument. The lambda function takes each student dictionary and returns the grade, which is then used as the key for sorting. The `reverse=True` argument ensures that the sorting is done in descending order.

In [None]:
# Example of using lambda function with filter()
numbers = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

# Filtering even numbers using lambda
even_numbers = list(filter(lambda x: x % 2 == 0, numbers))
even_numbers

In the example above, we have a list of numbers from 1 to 10. We want to filter out only the even numbers from this list. To achieve this, we use the `filter()` function with a lambda function. The lambda function checks if a number is even (i.e., divisible by 2) and returns `True` if it is. The `filter()` function then uses this lambda function to filter out the even numbers from the list. Finally, we convert the filtered object to a list using the `list()` function.

## Exercises

1. **Lambda for Square**: Write a lambda function that takes a number and returns its square. Test it with a few numbers.
2. **Lambda for Sorting**: Given a list of tuples where each tuple contains a name and age, use a lambda function to sort the list by age in ascending order.
3. **Lambda for Filtering**: Given a list of strings, use a lambda function with the `filter()` method to filter out only the strings that have a length greater than 5.

In [None]:
# Answer for Exercise 1
square_lambda = lambda x: x**2
square_lambda(6)

In [None]:
# Answer for Exercise 2
people = [('John', 35), ('Doe', 25), ('Jane', 30)]
sorted_people = sorted(people, key=lambda x: x[1])
sorted_people

In [None]:
# Answer for Exercise 3
words = ['apple', 'banana', 'cherry', 'date', 'elderberry', 'fig', 'grapefruit']
long_words = list(filter(lambda x: len(x) > 5, words))
long_words

## Lambda Functions in Data Wrangling and Preprocessing

Lambda functions are often used in data wrangling and preprocessing tasks, especially in the context of data science, NLP, and machine learning. They provide a concise way to apply transformations or operations on data without the need for defining full-fledged functions. Here are some common use cases:

### 1. Text Cleaning in NLP

In Natural Language Processing (NLP), text data often needs to be cleaned and preprocessed before it can be used for modeling. Lambda functions can be used to quickly apply text cleaning operations such as converting text to lowercase, removing punctuation, or stripping whitespace.

In [None]:
import pandas as pd
import string

# Sample text data
text_data = ['Hello World!', 'This is an NLP task.', 'Lambda functions are useful.']
text_series = pd.Series(text_data)

# Using lambda to clean text
cleaned_text = text_series.apply(lambda x: x.lower().translate(str.maketrans('', '', string.punctuation)).strip())
cleaned_text

In the example above, we used a lambda function with the `apply()` method of a pandas Series to:

1. Convert the text to lowercase using `lower()`.
2. Remove punctuation using the `translate()` method combined with `string.punctuation`.
3. Strip any leading or trailing whitespace using `strip()`.

This provides a quick way to clean text data in preparation for further NLP tasks.

### 2. Feature Engineering in Machine Learning

Lambda functions can be used for quick feature engineering tasks. For instance, you might want to create a new feature based on the values of an existing feature. Let's consider a dataset with a 'price' column, and we want to categorize items as 'cheap', 'medium', or 'expensive' based on their price.

In [None]:
# Sample data
data = {'item': ['apple', 'banana', 'cherry', 'date'], 'price': [0.5, 0.2, 1.5, 2.0]}
df = pd.DataFrame(data)

# Using lambda to categorize items based on price
df['price_category'] = df['price'].apply(lambda x: 'cheap' if x < 1 else ('medium' if x < 2 else 'expensive'))
df

In the example above, we used a lambda function to create a new column 'price_category' in our DataFrame. The lambda function checks the value of the 'price' column and assigns a category based on the price range:

- Items priced less than $1 are categorized as 'cheap'.
- Items priced between $1 and $2 are categorized as 'medium'.
- Items priced above $2 are categorized as 'expensive'.

This is a simple example of how lambda functions can be used for feature engineering in machine learning tasks.

### 3. Data Transformation

Lambda functions can be used for various data transformation tasks. For instance, in machine learning, it's common to normalize or standardize features. Let's consider a dataset with a 'weight' column, and we want to normalize the weights to a range between 0 and 1.

In [None]:
# Sample data
data = {'name': ['A', 'B', 'C', 'D'], 'weight': [45, 60, 55, 68]}
weight_df = pd.DataFrame(data)

# Using lambda to normalize weights
min_weight = weight_df['weight'].min()
max_weight = weight_df['weight'].max()
weight_df['normalized_weight'] = weight_df['weight'].apply(lambda x: (x - min_weight) / (max_weight - min_weight))
weight_df

In the example above, we used a lambda function to normalize the 'weight' column in our DataFrame. The lambda function applies the normalization formula:

\[ \text{normalized value} = \frac{\text{value} - \text{min value}}{\text{max value} - \text{min value}} \]

This formula scales the weights to a range between 0 and 1. Such normalization is often used in machine learning to ensure that all features have the same scale, which can improve the performance of certain algorithms.

### 4. Tokenization in NLP

Tokenization is a fundamental step in NLP where a text is split into individual tokens (usually words). Lambda functions can be used to quickly tokenize text data. Let's consider a dataset with sentences, and we want to tokenize each sentence into words.

In [None]:
# Sample text data
sentences_data = ['Hello world', 'Lambda functions are powerful', 'NLP is interesting']
sentences_series = pd.Series(sentences_data)

# Using lambda to tokenize sentences
tokenized_sentences = sentences_series.apply(lambda x: x.split())
tokenized_sentences

In the example above, we used a lambda function with the `split()` method to tokenize each sentence in our Series. The `split()` method, by default, splits a string based on whitespace, effectively tokenizing the sentence into individual words. This is a basic example of how lambda functions can be used for tokenization in NLP tasks.