Moving Hadoop to the Cloud: Harnessing Cloud Features and Flexibility for Hadoop Clusters

Moving Hadoop to the Cloud: Harnessing Cloud Features and Flexibility for Hadoop Clusters
Author: Bill Havanki
Pub Date: 2017
ISBN: 978-1-491-95961-9
Pages: 300
Language: English
Format: PDF/EPUB/MOBI (Early Release)
Size: 10 Mb

Download

Up until recently, Hadoop deployments have existed on hardware owned and run by organizations, often alongside legacy “big-iron” hardware. Today, cloud service providers allow customers to effectively rent hardware and associated network connectivity, along with a variety of other features like databases and bulk storage.
But installing a Hadoop cluster on a public cloud service is not as straightforward as it may appear. This practical book shows you how to install these clusters in a way that harmonizes with public cloud service features, and examine ways to use and manage them efficiently.
You’ll learn how to architect clusters in a way that works with the features of the provider, not only to avoid potential pitfalls, but also to take full advantage of what the services can do. A cluster installed in a suboptimal fashion will run slower and cost more than expected, which can defeat the goals of moving to the service in the first place.

+

Table of Contents

1. Why Hadoop in the Cloud?
2. Instances
3. Networking and Security
4. Setting up in AWS
5. Standing up a Cluster