diff --git a/lesson1.html b/lesson1.html
index cc39644..58bc57d 100644
--- a/lesson1.html
+++ b/lesson1.html
@@ -32,84 +32,63 @@
- What is Big Data?
+ Lesson 1: Building a Big Data Infrastructure Part 1
-
- Big data is the combination of infrastructure, algorithms, and visualizations around making sense of user and machine generated data.
-
+ Unstructured Storage & Hadoop
-
- Big data does not necessarily mean: more data than you can effectively work with on a single computer.
-
-
-
-
-
- Big data is about gaining insight from data regardless of the size of the data set.
-
-
-
-
- Questions Big Data can Answer
+ Unstructured Data
-
-
What are my users doing on my site?
-
- -
-
Is something spam?
+ Log Files
-
-
What items or users are like each other?
+ Text
-
-
What items might a user like?
+ Unknown Formats
- Types of Data
+ Hadoop
+
+ - Open source
+ - HDFS: Distributed file system modeled after GFS
+ - MapReduce: Distributed batch processing modeled after Google's MapReduce
+
+
+
+
+ Hadoop's Wider Ecosystem
+
+ - HBase
+ - ZooKeeper
+ - Hive
+ - Cascading
+ - Pig
+ - Flume
+
+
+
+
+ Batch Processing
-
-
User Generated
+ Like cron
-
-
Machine Generated
-
- -
-
Structured
+ Run once or frequently
-
-
Unstructured
+ Ship code to data
-
-
- Goals of a Big Data Infrastructure
-
- -
-
Scalability
-
- -
-
Experimentation
-
- -
-
Mining business intelligence
-
- -
-
Making recommendations
-
- -
-
Monitoring performance
-
-
-
-