Skip to content

raggy-rs/hhsetup

Repository files navigation

How to setup a Mapreduce Hbase Cluster

1) create instance https://openstack.risc-software.at
Image Name: Ubuntu Server 12.04 LTS 
Flavor: m1.largediskdrive
Security Groups : <GRP>

Associate Floating IP: <IP>

2) Set passwords install git and clone repository

ssh -i path/to/<GRP> root@<IP>
passwd
passwd cloud
apt-get update
apt-get install git
cd /home/cloud/
mkdir chd5setup
cd chd5setup
git clone <GITREPO>

3) on Master
download ./oracle-java-7-update-45.tar.gz
python 0_newid.py

4) everywhere (including Master):
run install scripts

sudo python 1_ssh.py
sudo python 2_java.py
sudo python 3_network_conf.py
sudo python 4_chd5.py

5) on Master
sudo cp -r /etc/hadoop/conf.empty /etc/hadoop/conf.my_cluster
sudo cp -r /etc/hbase/conf.dist/ /etc/hbase/conf.benchmark_cluster

configure everything according to http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH5-Installation-Guide/CDH5-Installation-Guide.html

6) everywhere(including Master):
run configuration distribution script

sudo python 5_hadoop_conf.py

Format HDFS with:
hdfs namenode -format

start services
for x in `cd /etc/init.d ; ls hadoop-hdfs-*` ; do sudo service $x start ; done

About

Hadoop + Hbase Cluster setup

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published