## Ch 1 - What is Deep Learning?

### 1.1 Artificial Intelligence, Machine Learning, and Deep Learning


![AI](Images/01_01.jpg)


#### 1.1.1 Artificial Intelligence


AI is the effort to automate intellectual tasks normally performed by humans. 

AI is a general field that encompasses machine learning and deep learning, but that also includes other approaches that don't involve any learning.

Born in the 1950s. Early chess programs only involved hardcoded rules crafted by programmers, and didn't qualify as machine learning. At the time experts believed that human-level artificial intelligence could only be achieved by having programmers handcraft a sufficiently large set of explicit rules. This approach is called **symbolic AI**, and was the dominant paradigm in AI to the late 1980s. 

Although symbolic AI proved suitable to solve well-defined, logical problems, it turned out to be intractable to figure out explicit rules for solving more complex problems such as image classification, speech recognition, and language translation.

#### 1.1.2 Machine Learning

A machine learning system is trained rather than explicitly programmed. It is presented with many examples relevant to a task, and it finds statistical structure in these examples that eventually allows the system to come up with rules for automating the task.

For example, if you wanted to automate the task of tagging your vacation pictures, you could present a machine learning system with many examples of pictures already tagged by humans, and the system would learn statistical rules for associating specific pictures to specific tags.

Machine learning is tightly related to stats, but unlike stats, machine learning tends to deal with large, complex datasets for which classical statistical analysis such as Bayesian analysis would be impractical.

Ideas are proven empirically more often than theoretically.

#### 1.1.3 Learning Representations from Data

To do machine learning, we need

- Input data points: if the task is speech recognition, these data points could be sound files of people speaking. If image tagging, pictures.

- Examples of the expected output: in speech recognition, these could be human-generated transcripts of sound files. In an image tag, expected outputs could be tags such as 'dog,' 'cat,' and so on.

- A way to measure whether the algorithm is doing a good job: this is necessary in order to determine the distance between the algorithm's current output and its expected output. The measurement is used as a feedback signal to adjust the way the algorithm works. This adjustment is *learning*.


A machine-learning model transforms its input data into meaningful outputs, a process that is learned from exposure to known examples of inputs and outputs. The central problem in machine learning and deep learning is to meaningfully transform data: to learn useful representations of the input data at hand that get us closer to the expected output.

A representation  is a different way to look at data - to represent or encode data.
- A color image can be encoded in the RGB format or in the HSV format, these are two different representations of the same data. Some tasks that are difficult with one may be easier with the other. 

Machine learning models are all about finding appropriate representations for their input data.





In this case, we need a representation of our data that cleanly separates the white points from the black points. One transformation is a coordinate change. This new representation basically solves the classification problem.

![Representation](Images/01_02.jpg)

- The inputs are the coordinates of the points

- The expected outputs are the colors of our points

- A way to measure whether our algorithm is doing a good job could be the percentage of points that are correctly classified. 

#### 1.1.4 The "Deep" in Deep Learning

Deep learning is a specific subfield of machine learning: a new take on learning representations from data that puts an emphasis on learning successive layers of increasingly meaningful representations.

Modern deep learning often involves tens or even hundreds of successive layers of representations and they’re all learned automatically from exposure to training data. Meanwhile, other approaches to machine learning tend to focus on learning only one or two layers of representations of the data; they’re sometimes called shallow learning. 

How many layers contribute to a model of the data is called the depth of the model.

In deep learning, these layered representations are (almost always) learned via models called neural networks, structured in layers stacked on top of each other.


*Recognizing Digits:* 

Several layer deep network that will transform an image of a digit in order to recognize what digit it is: 

![Digits1](Images/01_03.jpg)


As you can see in figure 1.6, the network transforms the digit image into representations that are increasingly different from the original image and increasingly informative about the final result. You can think of a deep network as a multistage information-distillation operation, where information goes through successive filters and comes out increasingly purified (that is, useful with regard to some task).


![Digits2](Images/01_04.jpg)





#### 1.1.5 Understanding How Deep Learning Works, In 3 Figures

#### 1.1.6 What Deep Learning Has Achieved So Far

#### 1.1.7 Don't Believe the Short-Term Hype

#### 1.1.8 The Promise of AI

### 1.2 Before Deep Learning: A Brief History of Machine Learning

#### 1.2.1 Probabilistic Thinking

#### 1.2.2 Early Neural Networks

#### 1.2.3 Kernel Methods

#### 1.2.4 Decision Trees, Random Forests, and Gradient Boosting Machines

#### 1.2.5 Back to Neural Networks

#### 1.2.6 What Makes Deep Learning Different

#### 1.2.7 The Modern Machine-Learning Landscape

### 1.3 Why Deep Learning? Why Now?

#### 1.3.1 Hardware

#### 1.3.2 Data

#### 1.3.3 Algorithms

#### 1.3.4 A New Wave of Investment

#### 1.3.5 The Democratization of Deep Learning

#### 1.3.6 Will It Last?