Hadoop-build-env

Automatical deployment, configuration, benchmark execution, and performance report

Here mainly focus on hadoop-3.0.0-beta1. And we take delight in contribution in efficient cluster scheduling and approaches that can help it.

How to run

[optional]. Customize setting.yaml according to user demands.
[optional]. Create soft/hard link in the folder within $PATH.

# sudo ln -s <absolute path>/hbe <folder in $PATH>/<name you liked>
#
# example. Enter hadoop-build-env folder 
$ sudo ln -s `pwd`/hbe /usr/bin/hbe

Run bhe <stage(s)> .

# prepare enviroment in control-proxy and cluster for all actions 
$ hbe init 

# install nessary libs in control-proxy for compiling...
$ hbe initcontrolp 

# initally compile source code, configure site, distribute binary libs into cluster, 
# prepare runtime environment for cluster
$ hbe initdeploy 

# prepare runtime environment for cluster 
$ hbe initcluster 

# initially compile source code in control-proxy.
# This stage will resolve maven depandency and download necessary jars.
$ hbe initcompile  
	
# compile source code, configure site, distribute binary libs into cluster
$ hbe deploy 

# configure site.xml, worker, hadoop-env.sh and sync into cluster ...
$ hbe config 

# compile source code in control-proxy. default compile hadoop-main.
# params: yapi, yclient, ycommon, yscommon, ysrm, ysnm
$ hbe compile 
              
# add permissions for stage-sync, and also create hdfs dirs ...
$ hbe syncp 

# distribute binary libs into cluster. default sync hadoop-main.
# params: yapi, yclient, ycommon, yscommon, ysrm, ysnm
$ hbe sync
           
# clean cluster files. 
# params: log
$ hbe clean 

# default start-all.sh. 
# params: yarn, hdfs
$ hbe start

# default stop-all.sh.
# params: yarn, hdfs 
$ hbe stop 

# submit applications into cluster. 
# The basepath of execution is cluster binary libs path.
$ hbe submit <ins1> <ins2>

# ========================EXAMPLES AS FOLLOWING======================== #

$ hbe initcompile # first compile

$ hbe initdeploy # first compile and deploy

$ hbe deploy

$ hbe compile && hbe stop && hbe sync && hbe config && hbe strart

$ hbe compile ysrm ysnm 

$ hbe sync ysrm ysnm 

$ hbe clean log

$ hbe submit "./bin/hadoop jar ./share/hadoop/mapreduce/hadoop-mapreduce-examples-3.0.0-beta1.jar pi -Dmapreduce.job.num-opportunistic-maps-percent='100' 50 50" "./bin/hadoop jar ./share/hadoop/mapreduce/hadoop-mapreduce-examples-3.0.0-beta1.jar pi -Dmapreduce.job.num-opportunistic-maps-percent='50' 100 100"

PSEUDO_DIS_MODE

run, compile, benchmark and all actions are only in your dev-PC.

           run|compile|bench|report
    user ------> |__|
            control-proxy-pc

FULLY_DIS_MODE

compile, view report are only in yout dev-PC.
run, benchmark, performance log are in cluster-PCs.

                                        run jobs/benchmark
            compile|report    deploy      |_|_|_|_|_|    
    user ------> |__| ----------------->  |_|_|_|_|_|
           control-proxy-pc               |_|_|_|_|_|
                                *.jar       cluster

How to customize step(s) and organize stage(s)

Rules:

put *.py into ./scripts/ and *.sh into ./utilities/
customized python files need to inherit basis.py and overwrite its action() method.
define tigger function to support automatical execution.

Notes

Git & Bash

Compile

References

Hadoop: BUILDING.txt

Hadoop: How To Contribute

Book: Apache Hadoop YARN

Name		Name	Last commit message	Last commit date
Latest commit History 373 Commits
configs		configs
docs		docs
draft		draft
requirements		requirements
scripts		scripts
utilities		utilities
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
crontab		crontab
hadoop-build-env.py		hadoop-build-env.py
hbe		hbe
settings.template.yaml		settings.template.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hadoop-build-env

How to run

How to customize step(s) and organize stage(s)

Notes

References

About

Uh oh!

Releases

Packages

Languages

License

xshaun/hadoop-build-env

Folders and files

Latest commit

History

Repository files navigation

Hadoop-build-env

How to run

How to customize step(s) and organize stage(s)

Notes

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages