DSBDA grp b 1

The document outlines a practical exercise involving the execution of a Hadoop MapReduce job using a WordCount program. It details the steps taken to set up the environment, create input files, run the job, and retrieve output results. Additionally, it includes the Java code for the WordCount program, which processes text input to count word occurrences.

Uploaded by

Vishal Doke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views8 pages

DSBDA grp b 1

Uploaded by

Vishal Doke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

PRACTICAL-11

Name: Vishal Dattatraya Doke

Roll No: 16
Batch: T1

Microsoft Windows [Version 10.0.19045.5608]

(c) Microsoft Corporation. All rights reserved.
C:\WINDOWS\system32>start-all.cmd
This script is Deprecated. Instead use start-dfs.cmd and start-yarn.cmd
starting yarn daemons
C:\WINDOWS\system32>jps
2656 NodeManager
7216 ResourceManager
6724 NameNode
6836 DataNode
10952 Jps
C:\WINDOWS\system32>hadoop fs -mkdir /input
C:\WINDOWS\system32>hadoop fs -put C:\Users\Vishal\Documents\FILES\input1.txt /input
C:\WINDOWS\system32>hadoop fs -ls /input
Found 1 items
-rw-r--r-- 1 VISHAL DOKE supergroup 80 2025-04-09 03:45 /input/input1.txt
C:\WINDOWS\system32>hadoop jar C:\Users\Vishal\Documents\JARFILE\
MapReduceWordCount.jar com.mapreduce.wc/WordCount /input/input1.txt /output
2025-04-07 13:45:33,092 INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
2025-04-07 13:45:34,309 INFO mapreduce.JobResourceUploader: Disabling Erasure Coding for
path: /tmp/hadoop-yarn/staging/Admin/.staging/job_1744008556181_0001
2025-04-07 13:45:34,924 INFO input.FileInputFormat: Total input files to process : 1
2025-04-07 13:45:35,462 INFO mapreduce.JobSubmitter: number of splits:1
2025-04-07 13:45:35,965 INFO mapreduce.JobSubmitter: Submitting tokens for job:
job_1744008556181_0001
2025-04-07 13:45:35,967 INFO mapreduce.JobSubmitter: Executing with tokens: []
2025-04-07 13:45:36,183 INFO conf.Configuration: resource-types.xml not found
2025-04-07 13:45:36,183 INFO resource.ResourceUtils: Unable to find 'resource-types.xml'.
2025-04-07 13:45:36,626 INFO impl.YarnClientImpl: Submitted application
application_1744008556181_0001
2025-04-07 13:45:36,673 INFO mapreduce.Job: The url to track the job: http://DESKTOP-
0729C31:8088/proxy/application_1744008556181_0001/
2025-04-07 13:45:36,675 INFO mapreduce.Job: Running job: job_1744008556181_0001
2025-04-07 13:45:48,957 INFO mapreduce.Job: Job job_1744008556181_0001 running in uber mode
: false
2025-04-07 13:45:48,962 INFO mapreduce.Job: map 0% reduce 0%
2025-04-07 13:45:54,080 INFO mapreduce.Job: map 100% reduce 0%
2025-04-07 13:46:08,266 INFO mapreduce.Job: map 100% reduce 100%
2025-04-07 13:46:09,292 INFO mapreduce.Job: Job job_1744008556181_0001 completed
successfully
2025-04-07 13:46:09,391 INFO mapreduce.Job: Counters: 54
File System Counters
FILE: Number of bytes read=129
FILE: Number of bytes written=478023
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=183
HDFS: Number of bytes written=68
HDFS: Number of read operations=8
HDFS: Number of large read operations=0
HDFS: Number of write operations=2
HDFS: Number of bytes read erasure-coded=0
Job Counters
Launched map tasks=1
Launched reduce tasks=1
Data-local map tasks=1
Total time spent by all maps in occupied slots (ms)=3174
Total time spent by all reduces in occupied slots (ms)=9861
Total time spent by all map tasks (ms)=3174
Total time spent by all reduce tasks (ms)=9861
Total vcore-milliseconds taken by all map tasks=3174
Total vcore-milliseconds taken by all reduce tasks=9861
Total megabyte-milliseconds taken by all map tasks=3250176
Total megabyte-milliseconds taken by all reduce tasks=10097664
Map-Reduce Framework
Map input records=7
Map output records=8
Map output bytes=107
Map output materialized bytes=129
Input split bytes=103
Combine input records=0
Combine output records=0
Reduce input groups=6
Reduce shuffle bytes=129
Reduce input records=8
Reduce output records=6
Spilled Records=16
Shuffled Maps =1
Failed Shuffles=0
Merged Map outputs=1
GC time elapsed (ms)=70
CPU time spent (ms)=996
Physical memory (bytes) snapshot=508809216
Virtual memory (bytes) snapshot=749785088
Total committed heap usage (bytes)=362283008
Peak Map Physical memory (bytes)=304926720
Peak Map Virtual memory (bytes)=426901504
Peak Reduce Physical memory (bytes)=203882496
Peak Reduce Virtual memory (bytes)=322883584
Shuffle Errors
BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=80
File Output Format Counters
Bytes Written=68
C:\Windows\system32>hadoop dfs -cat /output/*
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
LAPTOP 1
MAHARASHTRA 2
SUBSCRIBERS 1
TECHNICAL 1
VISHAL 2
WINDOWS 1
C:\Windows\system32>hadoop dfs -get /output/part-r-00000 C:\Users\Admin\Documents\FILES\
textfile.txt
DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
C:\Windows\system32>hadoop fs -rm -r /input/input1.txt
Deleted /input/input1.txt
C:\Windows\system32>hadoop fs -rm -r /output
Deleted /output
C:\Windows\system32>stop-all.cmd
This script is Deprecated. Instead use stop-dfs.cmd and stop-yarn.cmd
SUCCESS: Sent termination signal to the process with PID 696.
SUCCESS: Sent termination signal to the process with PID 14080.
stopping yarn daemons
SUCCESS: Sent termination signal to the process with PID 7240.
SUCCESS: Sent termination signal to the process with PID 10956.
INFO: No tasks running with the specified criteria.
C:\Windows\system32>
**********************************************************************************
input1.txt
Technical Windows
Vishal
Subscribers
Maharashtra
laptop
Vishal
Maharashtra
**********************************************************************************
WordCount.java
package com.mapreduce.wc;
import java.io.IOException;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.util.GenericOptionsParser;
public class WordCount {
public static void main(String[] args) throws Exception {
Configuration c = new Configuration();
String[] files = new GenericOptionsParser(c, args).getRemainingArgs();
// Ensure correct input arguments
if (files.length < 2) {
System.err.println("Usage: WordCount <input path> <output path>");
System.exit(-1);
}
Path input = new Path(files[0]);
Path output = new Path(files[1]);
Job j = Job.getInstance(c, "wordcount");
j.setJarByClass(WordCount.class);
j.setMapperClass(MapForWordCount.class);
j.setReducerClass(ReduceForWordCount.class);
j.setOutputKeyClass(Text.class);
j.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(j, input);
FileOutputFormat.setOutputPath(j, output);
System.exit(j.waitForCompletion(true) ? 0 : 1);
}
// Mapper Class
public static class MapForWordCount extends Mapper<LongWritable, Text, Text, IntWritable> {
private final static IntWritable one = new IntWritable(1);
private Text wordText = new Text();
public void map(LongWritable key, Text value, Context con) throws IOException,
InterruptedException {
String line = value.toString().trim();
String[] words = line.split("\\s+"); // Handles multiple spaces
for (String word : words) {
if (!word.isEmpty()) { // Avoid empty strings
wordText.set(word.trim().toUpperCase());
con.write(wordText, one);
}
}
}
}
// Reducer Class
public static class ReduceForWordCount extends Reducer<Text, IntWritable, Text, IntWritable> {
public void reduce(Text word, Iterable<IntWritable> values, Context con) throws IOException,
InterruptedException {
int sum = 0;
for (IntWritable value : values) {
sum += value.get();
}
con.write(word, new IntWritable(sum));
}
}
}
**********************************************************************************

DSBDA grp b 1
No ratings yet
DSBDA grp b 1
8 pages
DSBDA GROUP B 1
No ratings yet
DSBDA GROUP B 1
5 pages
Exp2 Hadoop
No ratings yet
Exp2 Hadoop
6 pages
bda 1
No ratings yet
bda 1
6 pages
Write A Mapreduce Program To Find Dept Wise Salary. Empno Empname Dept Salary
100% (1)
Write A Mapreduce Program To Find Dept Wise Salary. Empno Empname Dept Salary
5 pages
BDA Output
No ratings yet
BDA Output
32 pages
Practical-1: Aim:-Make A Single Node Cluster in Hadoop. Solution
No ratings yet
Practical-1: Aim:-Make A Single Node Cluster in Hadoop. Solution
49 pages
DSBDN
No ratings yet
DSBDN
4 pages
6 - Simple Wordcount
No ratings yet
6 - Simple Wordcount
2 pages
Hadoop Mini Project
No ratings yet
Hadoop Mini Project
8 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
Palak
No ratings yet
Palak
10 pages
CCBDI Full Lab Manual Anurag Removed
No ratings yet
CCBDI Full Lab Manual Anurag Removed
97 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
$ Hdfs Dfsadmin - Report
No ratings yet
$ Hdfs Dfsadmin - Report
7 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Creation and Execution Process Document
No ratings yet
Creation and Execution Process Document
4 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
37 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
BDA record
No ratings yet
BDA record
58 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
Cp5261 Da Lab Me-Cse 2021 - Edit
No ratings yet
Cp5261 Da Lab Me-Cse 2021 - Edit
88 pages
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
49 pages
B1 instructions
No ratings yet
B1 instructions
9 pages
11. WordCountApp
No ratings yet
11. WordCountApp
2 pages
Word Count Program
No ratings yet
Word Count Program
2 pages
Map Reduce
No ratings yet
Map Reduce
30 pages
049
No ratings yet
049
2 pages
Map Reduce
No ratings yet
Map Reduce
4 pages
1WordCount
No ratings yet
1WordCount
2 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
BDA3
No ratings yet
BDA3
7 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
5 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Cloud_LAB_10.1,11.1,12.1
No ratings yet
Cloud_LAB_10.1,11.1,12.1
6 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
BDAV Practical
No ratings yet
BDAV Practical
17 pages
bda lab s
No ratings yet
bda lab s
92 pages
Part B Assignment - No - 1
No ratings yet
Part B Assignment - No - 1
6 pages
Exp-11
No ratings yet
Exp-11
4 pages
Map Reduce Java Program
No ratings yet
Map Reduce Java Program
2 pages
BDA Lab Manual_organized (2) (1) - Copy
No ratings yet
BDA Lab Manual_organized (2) (1) - Copy
69 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Hive Assignment Logs
No ratings yet
Hive Assignment Logs
37 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
Hadoop Training in Hyderabad
No ratings yet
Hadoop Training in Hyderabad
49 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Merge Files Store in A Directory To A File
No ratings yet
Merge Files Store in A Directory To A File
3 pages
Harley Davidson-Total
0% (1)
Harley Davidson-Total
36 pages
TYPICAL FLOOR
No ratings yet
TYPICAL FLOOR
1 page
Simple Harmonic Motion (Question Paper) PDF
100% (1)
Simple Harmonic Motion (Question Paper) PDF
4 pages
Packaging Machinery
No ratings yet
Packaging Machinery
16 pages
Worksheet 1 Chemistry Class IX
No ratings yet
Worksheet 1 Chemistry Class IX
3 pages
Faq Dbif - RSQL - SQL - Error Short Dump, Ora-14400 Error
No ratings yet
Faq Dbif - RSQL - SQL - Error Short Dump, Ora-14400 Error
2 pages
1. ANH 9. ĐỀ THI
No ratings yet
1. ANH 9. ĐỀ THI
5 pages
5th Term - Operations Management
No ratings yet
5th Term - Operations Management
58 pages
Rapiz, Janen H. (Final Output in Prof. Ed 163)
No ratings yet
Rapiz, Janen H. (Final Output in Prof. Ed 163)
3 pages
REMUS6000_Specs_Jan23
No ratings yet
REMUS6000_Specs_Jan23
1 page
Resume - Salsman
No ratings yet
Resume - Salsman
3 pages
GOP Moves To Extend Ballot Verification
No ratings yet
GOP Moves To Extend Ballot Verification
6 pages
#La411 Listings
No ratings yet
#La411 Listings
10 pages
Marquess Of Winter A Historical Regency Romance Novel The Wild Brides Book 3 Hazel Linwood download
100% (2)
Marquess Of Winter A Historical Regency Romance Novel The Wild Brides Book 3 Hazel Linwood download
36 pages
DL Project Report Anirudh Bhardwaj
No ratings yet
DL Project Report Anirudh Bhardwaj
11 pages
Shipping Law Dissertation
100% (2)
Shipping Law Dissertation
4 pages
Chronic Pancreatitis or Pancreatic Tumor
No ratings yet
Chronic Pancreatitis or Pancreatic Tumor
18 pages
Emortelle System Listing
No ratings yet
Emortelle System Listing
2 pages
Living World DPP-2
No ratings yet
Living World DPP-2
7 pages
232LPTTL
No ratings yet
232LPTTL
2 pages
Chapter 6- Recruiting Notes
No ratings yet
Chapter 6- Recruiting Notes
14 pages
Pump Head Calculations
100% (2)
Pump Head Calculations
4 pages
On The Role of Area in Elementary Geometry
No ratings yet
On The Role of Area in Elementary Geometry
9 pages
Techno-Economic and Environmental Assessment of PEM Water Electrolysis For Green H2 Production
No ratings yet
Techno-Economic and Environmental Assessment of PEM Water Electrolysis For Green H2 Production
88 pages
Chapter 2. TIME
No ratings yet
Chapter 2. TIME
24 pages
Understanding Wiegand Suprema
No ratings yet
Understanding Wiegand Suprema
9 pages
48TMSS18R0
No ratings yet
48TMSS18R0
45 pages
LI-FI Project Physics Investigatory
No ratings yet
LI-FI Project Physics Investigatory
16 pages
Food Biils
No ratings yet
Food Biils
23 pages
MPESA NEW
No ratings yet
MPESA NEW
1 page

DSBDA grp b 1

Uploaded by

DSBDA grp b 1

Uploaded by

PRACTICAL-11

Name: Vishal Dattatraya Doke

Microsoft Windows [Version 10.0.19045.5608]

You might also like