Skip to content

harshitjain007/duedil

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Duedil

This repo is part of a duedil data engineer assignment, whose problem statement can be found in the repo itself.

Prerequisites

Following software should be installed on the system and their locations should be inlcuded in the system PATH -

  1. Oracle Java 1.8
  2. Apache Spark
  3. Apache Maven

Build instructions

Goto the directory containing pom.xml and run the following command-

mvn clean install

This will create a new directory target that will contain the built jar.

Run instructions

Run the following command to start program execution-

spark-submit --class "app.Driver" --master local[4] path/to/jar path/to/input path/to/output1 path/to/output2 path/to/output3

Test data is present in the resource directory under src/main. Run the following command to run unit tests -

spark-submit --class "testing.TestDriver" --master local[4] path/to/jar path/to/test_input path/to/expected_output1 path/to/expected_output2 path/to/expected_output3

Authors

Harshit Jain

Acknowledgement

End

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages