Skip to content
Hands-On-Big-Data-Modeling, Published by Packt
Jupyter Notebook Other
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.
CH02 Chapter 2 code snippets Oct 12, 2018
CH03 Push something in chapter 3 Oct 12, 2018
CH04/graph Chapter 1 to 4 Oct 11, 2018
CH05 cleanup Oct 12, 2018
CH06 rename the readme file correctly Oct 12, 2018
CH07 chapter 7 instructions set. Oct 12, 2018
CH08/json Fix the correct chapter Oct 19, 2018
CH10 Chapter 10 Oct 12, 2018
CH12 update Oct 11, 2018
CH13 Chapter 13 Oct 11, 2018
CH14 Fix chapter 14 error Oct 29, 2018
LICENSE Initial commit Sep 10, 2018 Update Dec 6, 2018

Hands-On Big Data Modeling

Book Name

This is the code repository for [Hands-On Big Data Modeling](Packt UTM URL of the Book), published by Packt.

Effective database design techniques for data architects and business intelligence professionals

What is this book about?

Modeling and managing data is a central focus of all big data projects. In fact, a database is considered to be effective only if you have a logical and sophisticated data model. This book will help you develop practical skills in modeling your own big data projects and improve the performance of analytical queries for your specific business requirements.

This book covers the following exciting features:

  • Get insights into big data and discover various data models
  • Explore conceptual, logical, and big data models
  • Understand how to model data containing different file types
  • Run through data modeling with examples of Twitter, Bitcoin, IMDB and weather data modeling
  • Create data models such as Graph Data and Vector Space

If you feel this book is for you, get your copy today!

Instructions and Navigations

All of the code is organized into folders. For example, Chapter02.

The code will look like the following:

wordcloud = WordCloud(background_color="white",width=1000, height=860, margin=2).generate(text)
import matplotlib.pyplot as plt

Following is what you need for this book: This book is great for programmers, geologists, biologists, and every professional who deals with spatial data. If you want to learn how to handle GIS, GPS, and remote sensing data, then this book is for you. Basic knowledge of R and QGIS would be helpful.

With the following software and hardware list you can run all code files present in the book (Chapter 1-15).

Software and Hardware List

Chapter Software required OS required
1-15 Python, R, Jupyter Notebook Windows, Mac OS X, and Linux (Any)

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Related products

Get to Know the Authors

James Lee is a passionate software wizard working at one of the top Silicon Valley-based start-ups specializing in big data analysis. In the past, he has worked at big companies such as Google and Amazon. In his day job, he works with big data technologies, including Cassandra and Elasticsearch, and is an absolute Docker technology geek and IntelliJ IDEA lover with a strong focus on efficiency and simplicity. Apart from his career as a software engineer, he is keen on sharing his knowledge with others and guiding them, especially in relation to start-ups and programming. He has been teaching courses and conducting workshops on Java programming / IntelliJ IDEA since he was 21. James holds an MS degree in computer science from McGill University and has many years' experience as a teaching assistant in a variety of computer science classes. He also enjoys skiing and swimming, and is a passionate traveler.

Tao Wei is a passionate software engineer who works in a leading Silicon Valley-based big data analysis company. Previously, Tao worked in big IT companies, such as IBM and Cisco. He has intensive experience in designing and building distributed, large-scale systems with proven high availability and reliability. Tao has an MS degree in computer science from McGill University and many years of experience as a teaching assistant in various computer science classes. When not working, he enjoys reading and swimming, and is a passionate photographer.

Suresh Kumar Mukhiya is a PhD candidate currently associated with Western Norway University of Applied Sciences (HVL). He is also a web application developer and big data enthusiast specializing in information systems, model-driven software engineering, big data analysis, and artificial intelligence. He has completed a masters in information systems from the Norwegian University of Science and Technology, along with a thesis in processing mining. He also holds a bachelor's degree in computer science and information technology (BSc.CSIT).

Other books by the authors

Suggestions and Feedback

Click here if you have any feedback or suggestions.

You can’t perform that action at this time.