We recently illustrated how some of our customers were using Liquid Web’s extensive resources to leverage big data with Hadoop. While many users could benefit from the data management capabilities Hadoop delivers, finding the right software to launch it and installing Hadoop stacks may give some pause. Using Cloudera, this can be accomplished in just a few simple steps we’ll outline below. [note: There are a number of software options available to run Hadoop, however we have chosen to highlight Cloudera as it offers a great deal of simplicity, ease, and automation for those new to running Hadoop.]
Install Cloudera Manager. Liquid Web cannot install Cloudera on your servers for you due to Cloudera’s Terms of Service. However, this simple installation gives you access to automatic installation of Hadoop stacks, other related software stacks, and nodes (also known as Hadoop clusters) for free.
Choose from Cloudera’s free edition or its fully supported enterprise version if you plan on running a cluster larger than 50 nodes or if you would like additional support from Cloudera. All of the projects you launch will remain open source and can be built from source. As the free edition offers more than enough functionality and ease of use for the average user, we will proceed from here as if you have chosen this option.
Select ‘Use Parcels’, other options for parcels, and ‘Matched release for this Cloudera Manager server’
Provide SSH login credentials by selecting root and allowing all hosts to accept the same password (this can be changed in the future as needed). Once this is confirmed, Cloudera will begin installing your host(s) as well as your clusters. This may take a moment depending on the amount of data you’re importing. Once it has completed, check the list of hosts it generates and make sure everything has completed with no errors. You may run this installation again or go back and check for mistakes if it did not complete properly.
Step 6:Cloudera Manager will now allow you to select which services to install. Based on your needs, you can choose anything from “Core Hadoop” services (if you are concerned with space) to “All Services” (to ensure the functionality of any future endeavor) to your own custom options.
Once you choose the services you want to install on your cluster, the automatic installation of your Hadoop stack will complete. From here you will be taken to your main Cloudera Manager menu where you can begin using Hadoop.
Liquid Web’s Heroic Support is always available to assist customers with this or any other issue. If you need our assistance please contact us:
Toll Free 1.800.580.4985