This is the code repository for Big Data Architect’s Handbook, published by Packt.
A guide to building proficiency in tools and systems used by leading big data experts
The big data architects are the “masters” of data, and hold high value in today’s market. Handling big data, be it of good or bad quality, is not an easy task. The prime job for any big data architect is to build an end-to-end big data solution that integrates data from different sources and analyzes it to find useful, hidden insights.
This book covers the following exciting features: <First 5 What you'll learn points>
- Learn Hadoop Ecosystem and Apache projects
- Understand, compare NoSQL database and essential software architecture
- Cloud infrastructure design considerations for big data
- Explore application scenario of big data tools for daily activities
- Learn to analyze and visualize results to uncover valuable insights
- Build and run a big data application with sample code from end to end
- Apply Machine Learning and AI to perform big data intelligence
- Practice the daily activities performed by big data architects
If you feel this book is for you, get your copy today!
All of the code is organized into folders. For example, Chapter16.
The code will look like the following:
agent.sources=netcatSource
agent.sinks=hdfsWriter
agent.channels=memoryChannel
Following is what you need for this book: Big Data Architect’s Handbook is for you if you are an aspiring data professional, developer, or IT enthusiast who aims to be an all-round architect in big data. This book is your one-stop solution to enhance your knowledge and carry out easy to complex activities required to become a big data architect.
With the following software and hardware list you can run all code files present in the book (Chapter 1-19).
Chapter | Software required | Hardware required |
---|---|---|
1-19 | ubuntu-14.04.5-desktop-amd64 | System with 4 GB or more or VirtualBox |
We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.
Syed Muhammad Fahad Akhtar Syed Muhammad Fahad Akhtar has 12+ years of industry experience in analysis, designing, developing, integrating, and managing large applications in different industries. He has vast exposure of working in UAE, Pakistan, and Malaysia and is currently working in ASIT Solutions as a solution architect. He received his master’s from Torrens University, Australia, and bachelor of science in computer engineering from National University of Computer and Emerging Sciences (FAST), Pakistan.
Click here if you have any feedback or suggestions.
If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.