Name	Name	Last commit message	Last commit date
Latest commit History 19 Commits
config	config
Dockerfile	Dockerfile
LICENSE	LICENSE
README.md	README.md

Name

Last commit message

Last commit date

config

Dockerfile

LICENSE

README.md

Supported tags and respective `Dockerfile` links

What is Hadoop?

The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

http://hadoop.apache.org/

What is Docker?

Docker is an open platform for developers and sysadmins to build, ship, and run distributed applications. Consisting of Docker Engine, a portable, lightweight runtime and packaging tool, and Docker Hub, a cloud service for sharing applications and automating workflows, Docker enables apps to be quickly assembled from components and eliminates the friction between development, QA, and production environments. As a result, IT can ship faster and run the same app, unchanged, on laptops, data center VMs, and any cloud.

https://www.docker.com/whatisdocker/

What is a Docker Image?

Docker images are the basis of containers. Images are read-only, while containers are writeable. Only the containers can be executed by the operating system.

https://docs.docker.com/terms/image/

Dependencies

Install Docker

Base Docker image

gelog/java:openjdk7

How to use this image?

Starting the namenode

docker run -ti --name namenode -h namenode -v /data:/data -p 50070:50070 gelog/hadoop:2.3.0
hdfs namenode -format
mkdir -p /usr/local/hadoop-2.3.0/logs
/usr/local/hadoop/sbin/hadoop-daemon.sh start namenode

Starting a secondary namenode

docker run -ti --name secnamenode -h secnamenode -v /data:/data -p 50090:50090 gelog/hadoop:2.3.0
mkdir -p /usr/local/hadoop-2.3.0/logs
/usr/local/hadoop/sbin/hadoop-daemon.sh start secondarynamenode

Starting a datanode

docker run -ti --name datanode -h datanode -v /data:/data -p 50080:50080 gelog/hadoop:2.3.0
mkdir -p /usr/local/hadoop-2.3.0/logs
/usr/local/hadoop/sbin/hadoop-daemon.sh start datanode

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Supported tags and respective `Dockerfile` links

What is Hadoop?

What is Docker?

What is a Docker Image?

Dependencies

Base Docker image

How to use this image?

Starting the namenode

Starting a secondary namenode

Starting a datanode

About

Uh oh!

Releases

Packages

Contributors 8

Uh oh!

Languages

License

bigdatafoundation/docker-hadoop

Folders and files

Latest commit

History

Repository files navigation

Supported tags and respective Dockerfile links

What is Hadoop?

What is Docker?

What is a Docker Image?

Dependencies

Base Docker image

How to use this image?

Starting the namenode

Starting a secondary namenode

Starting a datanode

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Uh oh!

Languages

Supported tags and respective `Dockerfile` links

Packages