0% found this document useful (0 votes)

108 views5 pages

Abhilash Resume

Abhilash G is a Big Data Engineer with over 9 years of IT experience, specializing in the development and implementation of big data projects using technologies such as Hadoop, Spark, and AWS. He has extensive hands-on experience with Python and PySpark, managing cloud operations on platforms like Azure and AWS, and has led multiple project deliveries. His professional experience includes roles at Amgen, Evergy, and CNA, where he developed data ingestion frameworks, optimized data processing, and managed Hadoop clusters.

Uploaded by

Gopichand surupula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

108 views5 pages

Abhilash Resume

Uploaded by

Gopichand surupula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Abhilash G

Big Data Engineer

E: [email protected]
Ph:913-802-2191
LinkedIn: https://www.linkedin.com/in/abhilash-g-a7027b18b/

Summary:
 Over 9 years of IT experience in analysis, design, develop, implementation of applications running on
various platforms.
 Have good Hands-on Experience on development of Big data projects using Hadoop, Hive, Spark, and
MapReduce open-source tools/technologies.
 Hands of experience with Spark 3.0.2.
 Hands on experience in Python Pyspark programming on Cloudera, HortonWorks, Azure Databricks,
Hadoop Clusters, Aws EMR clusters, AWS Lambda functions and CFT’S
 Worked on python pyspark programming.
 Worked on Airflow python code for jobs.
 Created Clusters in Azure Data Bricks to run Python Notebooks.
 Implemented AWS solutions using EC2, S3, RDS, EBS, Elastic Load Balancer, Auto scaling groups, AWS CLI.
 Managing scalable Hadoop clusters including Cluster designing, provisioning, custom configurations,
monitoring and maintaining using different Hadoop distributions: Cloudera CDH, Hortonworks
HDP,Databricks
 Good working experience with python to develop Custom Framework for generating of rules (just like rules
engine). Developed Hadoop streaming Jobs using python for integrating python API supported
applications.
 Good experience of AWS Elastic Block Storage (EBS), different volume types and use of various types
of EBS volumes based on requirement.
 Configured Databricks clusters to terminate after inactivity for certain period of time.
 Created python scripts to start/Stop Clusters depending on the usage of the cluster.
 Implemented AWS provides a variety of computing and networking services to meet the needs of
applications
 Excellent working knowledge on Object Oriented Principles (OOP), Design & Development and have good
understanding of programming concepts like data abstraction, concurrency, synchronization, multi-
threading and thread communication, networking, security.
 Developed ETL with SCD’s, caches, complex joins with optimized SQL queries.
 knowledge in relational data base (DB2, MSSQL, Teradata, Oracle 8i/9i/10g/11i).
 Experience working in Agile environment.

Technical Skills:

Programming C, C++, JAVA 8, SCALA

Technologies

Frameworks Java, Spring, Jersey, JavaScript, CSS, LESS, HTML 5, JQuery, Apache CXF,
Angular
JS, Jasmine

1 | Page
Big Data Technologies Apache Spark 1.6/2.2, HDFS, Amazon S3, YARN, Apache Oozie, Apache Hive,
Cloudera Impala, Apache Cassandra

Markups HTML, CSS, XML, XSL

Storage Technologies SQL, PL/SQL, Stored Procedures, Triggers, CQL, Hive QL, Parquet

Operating Systems Microsoft Windows 2000/XP/Vista/7, Unix, Linux, OS X

Professional Experience
Client: Amgen Thousand Oaks, CA January 2021 - Present
Lead Big Data Engineer
Responsibilities:

 Created Framework jobs using pyspark to ingest data from multiple Source systems.
 Developed and executed data ingestion frameworks with PySpark, catering to diverse source systems.
 Orchestrated data pipeline executions through Databricks Scheduler.
 Engaged in business requirement collection, analysis, and the conceptualization of data products.
 Crafted, validated, and managed data pipelines, integrating sources such as smart sheets, Excel, and
databases to produce end products.
 Leveraged Spark with Python, employing DataFrames, Datasets, and Spark SQL API for expedited data
processing.
 Authored scripts for secure password management of service accounts via AWS Secret Manager.
 Streamlined real-time data ingestion from various systems utilizing AWS Data Migration Service and
Kinesis.
 Implemented file-based and batch job ingestion scripts.
 Managed cloud operations on Azure platforms, including Data Lake, Databricks, and Blob storage.
 Executed SPARK SQL queries for application data validation.
 Created and maintained Databricks Notebooks with PySpark for data cleansing post-ingestion.
 Initiated Vacuum jobs to eliminate unused clusters.
 Automated cluster termination scripts triggered by inactivity.
 Devised data ingestion scripts from box locations with Airflow DAGs.
 Successfully ingested, transformed, and stored batch files and database tables into parquet and Delta
formats.
 Integrated CICD pipelines for deploying Python code from Git repositories to Databricks.
 Retrieved data from third-party websites through API calls.
 Led multiple project deliveries as a Team Lead.
 Automated Databricks notebooks using SPARK SQL and Python for pipeline executions.
 Configured Spark clusters and optimized high concurrency clusters in Azure Databricks for efficient
data preparation.
 Utilized SQL, Python/PySpark, and relational databases for data querying and management across
various database systems.
 Integrated Python code with Plotly Dash apps for tableau environment hosting.
2 | Page
 Composed Python scripts for Plotly Dash applications.
 Validated application data with SPARK SQL queries.
 Synthesized Spark code with PySpark to enhance data processing speeds.
 Operated Azure cloud systems, handling data ingestion, transformation, and storage in Azure Data
Lake.
 Processed data from AWS S3 into Databricks Notebooks.
 Developed utility code for AWS S3 data ingestion using Boto functions.
 Employed AWS EC2 and S3 services for handling smaller datasets.

Client: Evergy, Kansas City, MO October 2017 – December 2020

Hadoop Developer

Responsibilities:

 Migrate complex Map reduce programs, Hive scripts into Spark RDD transformations and actions.
 Used Spark API over Cloudera Hadoop Yarn to perform analytics on data in Hive.
 Using MapReduce Job exported Batch file into AWS S3
 Experience in developing Spark Applications using Spark RDD, Spark-SQL and Dataframe APIs.
 Analyzed HBase data in Hive by creating external partitioned and bucketed tables
 Used Apache Hue web interface to monitor the Hadoop cluster and run the jobs.
 Used Oozie scheduler to submit workflows.
 Integrated Oozie with the rest of the Hadoop stack supporting several types of Hadoop jobs out of the box (such
as Map-Reduce, Spark, Hive, and Sqoop) as well as systems specific jobs (such as Python programs and shell
scripts).
 Experience in Performance Tuning and Debugging of existing ETL processes.
 Wrote python scripts to process semi-structured data in formats like JSON.
 Worked with UNIX shell scripting for enhancing the job performance.
 spin up different AWS instances including EC2-classic and EC2-VPC using cloud formation templates.
 Created stored procedures and packages in Oracle as a part of the pre and Post ETL process.
 Export and Import data into HDFS and Hive using Sqoop.
 Design and develop ETL code using Informatica Mappings to load data from heterogeneous Source systems like
flat files, XML’s, MS Access files, Oracle to target system Oracle under Stage, then to data warehouse and then to
Data Mart tables for reporting.
 Imported data from AWS S3 into Spark RDD, Performed transformations and actions on RDD's.
 Worked on data processing and transformations and actions in spark by using Python (Pyspark) language.
 Load data into Hive partitioned tables
 Create reports for the BI team using Sqoop to export data into HDFS and Hive.
 Managed and reviewed Hadoop log files.

Environment: Apache Spark 1.6.0/2.2.0, Apache Hive, Cloudera Impala, Amazon S3, AWS, Aurora, REST,
MySQL 5.6, Junit, Mockito, Linux, Cloudera 5.x,HBase,ApacheKafka 0.9.x/0.10.x,Swagger, Parquet, Git, Intellij
IDEA, Apache Oozie, Agile/Scrum, Beeline

3 | Page
Client: CNA Chicago, IL October 2015 – September
2017
Hadoop Developer

Responsibilities:

 Responsible to Load the data into Spark RDD and performed in-memory data computation to generate the
output response.
 Developed spark coding using python scripting to analyze the data we are getting from different sources
 Worked in writing Spark Sql scripts for optimizing the query performance.
 Contributed towards building Apache Spark applications using Python.
 Writing UDF/Map reduce jobs depending on the specific requirement.
 Created Hive schemas using performance techniques like partitioning and bucketing.
 Enhanced and optimized product Spark code to aggregate, group and run data mining tasks using the
Spark framework.
 Worked on migrating Map Reduce programs into Spark transformations using Spark and Pyspark.
 Used PL/SQL to write scripts to do batch updates to database and to generate reports from Database.
Environment: Hadoop, HDFS, Hive, Oozie, SqoopHbase, Flume, Spark, Scala, SQL Server, Eclipse, PyCharm,
Maven, JIRA and GitHub, Junit, Mockito, Linux, Cloudera 5.x, Tomcat, Jenkins, XSLT, XML, Tomcat, JMS,
Swagger, Parquet, Git, Maven, SQL Server, Intellij IDEA.

Client: Legacy Health Portland, OR Jan 2015 – September 2015

SQL Developer

Responsibilities:

 Involved in Business requirement gathering, Technical Design Documents, Business use cases and Data
mapping.
 Scheduled the SQL jobs and SSIS Packages using Tidal (Enterprise Scheduler).
 Designed a complex SSIS package for data transfer from three different firm sources to a single destination
like SQL Server 2008.
 Created packages using different transformations like pivot, condition split, fuzzy lookup, aggregate,
execute SQL flow task, data flow task to extract data from different databases and flat files.
 Used SSIS to create ETL packages (.dtsx files) to validate, extract, transform and load data to data
warehouse databases
 Wrote SQL scripts using Script task to insert/update and delete data in SQL database, in creating
configuration packages using C# Scripting.
 Deployed the SSIS package on various servers using package configuration from test server to production
servers.
 Automated the SSIS jobs in SQL scheduler as SQL server agent job for daily, weekly and monthly loads.
 Generated a report which identifies the performance efficiency of every component within the modules of
payroll.
 Deployed code into PROD environment through JIRA (Atlassian Product) based on the tasks (Business
Request) assigned.
 Experience in Design and Development of Applications based on.NET and MS SQL Server

4 | Page
Environment: MS SQL Server 2005/2008, MS SQL Server Integration Services (SSIS), Visual Studio 2008/2010,
SQL Server Reporting Services (SSRS).

5 | Page

Sai Krishna Sr. Big Data Engineer
No ratings yet
Sai Krishna Sr. Big Data Engineer
8 pages
Senior Data Engineer Resume
No ratings yet
Senior Data Engineer Resume
4 pages
Eliassen Resume - Anup Somavarapu
No ratings yet
Eliassen Resume - Anup Somavarapu
5 pages
Big Data Engineering Expertise
No ratings yet
Big Data Engineering Expertise
6 pages
Nagaraju Bachu
No ratings yet
Nagaraju Bachu
6 pages
Santosh Goud - Senior AWS Big Data Engineer
No ratings yet
Santosh Goud - Senior AWS Big Data Engineer
9 pages
Azure Data Engineer - Samatha Gudala
100% (1)
Azure Data Engineer - Samatha Gudala
8 pages
Anusha K Phone No: (929) 456-3121 Senior Data Engineer: Summary
No ratings yet
Anusha K Phone No: (929) 456-3121 Senior Data Engineer: Summary
7 pages
Dice Resume CV Karthik S
No ratings yet
Dice Resume CV Karthik S
4 pages
Akash Spark
No ratings yet
Akash Spark
6 pages
Sai Kruthik Reddy Data Engineer
No ratings yet
Sai Kruthik Reddy Data Engineer
9 pages
Mohit ShivramwarCV
No ratings yet
Mohit ShivramwarCV
5 pages
Chaitanya - Sr. Data Engineer
No ratings yet
Chaitanya - Sr. Data Engineer
7 pages
Big Data & Cloud Engineering Expert
No ratings yet
Big Data & Cloud Engineering Expert
4 pages
Data Engineer Resume: AWS, Azure, Big Data Expertise
No ratings yet
Data Engineer Resume: AWS, Azure, Big Data Expertise
6 pages
DE Sample Resume
No ratings yet
DE Sample Resume
6 pages
Nikhil Kumar Mutyala - Senior Big Data Engineer
No ratings yet
Nikhil Kumar Mutyala - Senior Big Data Engineer
7 pages
Abdul Kareem Syed
No ratings yet
Abdul Kareem Syed
5 pages
Manoj Kumar
No ratings yet
Manoj Kumar
3 pages
SumanaV Bigdata
No ratings yet
SumanaV Bigdata
6 pages
Sai Sreekar P
No ratings yet
Sai Sreekar P
3 pages
SSREDDY
No ratings yet
SSREDDY
8 pages
Dice Resume CV Saumya S
No ratings yet
Dice Resume CV Saumya S
7 pages
Shiva Data - Resume
No ratings yet
Shiva Data - Resume
6 pages
Pavani Senior Data Engineer Professional Summary
No ratings yet
Pavani Senior Data Engineer Professional Summary
6 pages
Jyostna DataEngineer GCEAD
No ratings yet
Jyostna DataEngineer GCEAD
5 pages
Hadoop/AWS Developer Resume
No ratings yet
Hadoop/AWS Developer Resume
7 pages
Bharath DE
No ratings yet
Bharath DE
7 pages
Ankit Data Engineer Resume
No ratings yet
Ankit Data Engineer Resume
8 pages
Data Engineer Resume: Sailaja Reddy
No ratings yet
Data Engineer Resume: Sailaja Reddy
6 pages
RAJU AWS Data Engineer Resume
No ratings yet
RAJU AWS Data Engineer Resume
6 pages
Data Engineering Expert Profile
No ratings yet
Data Engineering Expert Profile
8 pages
Mathisha Jeeva
No ratings yet
Mathisha Jeeva
6 pages
Sharath Res
No ratings yet
Sharath Res
7 pages
Senior Data Engineer Resume SEO
100% (1)
Senior Data Engineer Resume SEO
4 pages
Resume 3
No ratings yet
Resume 3
7 pages
SR Data Engineer (Atlanta, GA) : Khaja Mohammed
No ratings yet
SR Data Engineer (Atlanta, GA) : Khaja Mohammed
5 pages
Data Analyst 3
No ratings yet
Data Analyst 3
5 pages
Sai Vodnala DE
No ratings yet
Sai Vodnala DE
5 pages
Abhinay - Data Engineer
No ratings yet
Abhinay - Data Engineer
7 pages
Anvesh - Sr. Data Engineer
No ratings yet
Anvesh - Sr. Data Engineer
6 pages
Resume 1
No ratings yet
Resume 1
7 pages
Anil Kumar: Data Engineer
No ratings yet
Anil Kumar: Data Engineer
8 pages
Rajesh DataEngineer
No ratings yet
Rajesh DataEngineer
7 pages
1
No ratings yet
1
6 pages
Big Data & Hadoop/Spark Expert Profile
No ratings yet
Big Data & Hadoop/Spark Expert Profile
3 pages
Nickel Ore Pre-Shipment Analysis
No ratings yet
Nickel Ore Pre-Shipment Analysis
1 page
Deepak (Sr. Data Engineer)
No ratings yet
Deepak (Sr. Data Engineer)
10 pages
Gautham - Data Engineer
No ratings yet
Gautham - Data Engineer
6 pages
Sankalp de Resume
No ratings yet
Sankalp de Resume
7 pages
Windows System Error Codes
No ratings yet
Windows System Error Codes
304 pages
Prime Vendors and Implementation Partners
100% (1)
Prime Vendors and Implementation Partners
2 pages
Data Engineering Expert Profile
No ratings yet
Data Engineering Expert Profile
5 pages
Ravi Shankar Chittela DataEngg
No ratings yet
Ravi Shankar Chittela DataEngg
10 pages
Prashanth Snowflake Data Engg
No ratings yet
Prashanth Snowflake Data Engg
5 pages
Porn Site Block List for Parents
0% (1)
Porn Site Block List for Parents
97 pages
Naresh DE
No ratings yet
Naresh DE
5 pages
PR Ofessional Summary: Data Frames and RDD's
No ratings yet
PR Ofessional Summary: Data Frames and RDD's
6 pages
Entry-Task-Validation-Exit (ETVX)
No ratings yet
Entry-Task-Validation-Exit (ETVX)
13 pages
Big Data Engineer Resume Overview
No ratings yet
Big Data Engineer Resume Overview
4 pages
Aziz Updated Resume
No ratings yet
Aziz Updated Resume
6 pages
Which Device (A-H) Would You Use For The Tasks (1-8) ? ( ../8)
100% (3)
Which Device (A-H) Would You Use For The Tasks (1-8) ? ( ../8)
3 pages
Cloud Computing Unit-2 PPT - PPSX
No ratings yet
Cloud Computing Unit-2 PPT - PPSX
46 pages
Caliptra Security Insights
No ratings yet
Caliptra Security Insights
71 pages
Report MrDhUWLKgSmjyukGHzi0 527035
No ratings yet
Report MrDhUWLKgSmjyukGHzi0 527035
66 pages
Products
No ratings yet
Products
78 pages
Sirisha Thatiparthi Resume
No ratings yet
Sirisha Thatiparthi Resume
2 pages
CH 3-5 MRI Contrast Spatial Localization
No ratings yet
CH 3-5 MRI Contrast Spatial Localization
109 pages
Aslam Big Data Engineer
No ratings yet
Aslam Big Data Engineer
6 pages
Naren Resume
No ratings yet
Naren Resume
4 pages
EV Charger Specification
No ratings yet
EV Charger Specification
9 pages
Vikram FullStackJava Resume
No ratings yet
Vikram FullStackJava Resume
4 pages
Annihilator Method
100% (1)
Annihilator Method
7 pages
Rakesh Data Engineer
No ratings yet
Rakesh Data Engineer
8 pages
VLSI Testing - DFT and Scan
No ratings yet
VLSI Testing - DFT and Scan
35 pages
Utr - PLN Suar PDF
100% (1)
Utr - PLN Suar PDF
86 pages
Chetana JavaFullStackDeveloper
No ratings yet
Chetana JavaFullStackDeveloper
8 pages
Native Otp Authentication With Netscaler
No ratings yet
Native Otp Authentication With Netscaler
14 pages
Anusha Resume
No ratings yet
Anusha Resume
2 pages
ECEN3250 Lab 7: Design of Common-Source MOS Amplifiers Prelab Assignment
No ratings yet
ECEN3250 Lab 7: Design of Common-Source MOS Amplifiers Prelab Assignment
14 pages
CIT 207 MODULE v2
No ratings yet
CIT 207 MODULE v2
57 pages
HEC-RAS User's Manual Version 4.1
No ratings yet
HEC-RAS User's Manual Version 4.1
790 pages
UNIT-III-Band Theory of Solids
No ratings yet
UNIT-III-Band Theory of Solids
8 pages
MJB Resume
No ratings yet
MJB Resume
5 pages
Bus Naming On Xilinx Schematics PDF
No ratings yet
Bus Naming On Xilinx Schematics PDF
3 pages
Akhil Java FSD
No ratings yet
Akhil Java FSD
8 pages
Preprocessor Directives in C Programming
No ratings yet
Preprocessor Directives in C Programming
7 pages
Women-Empowerment-india English by V Sruthi
No ratings yet
Women-Empowerment-india English by V Sruthi
6 pages
GVIDVANCCE5
No ratings yet
GVIDVANCCE5
8 pages
99 Ta 516149
No ratings yet
99 Ta 516149
2 pages
Ratodyasinh Parmar Resume
No ratings yet
Ratodyasinh Parmar Resume
4 pages
Malla Sree Sai Koushik - Senior Recruiter - Resume
No ratings yet
Malla Sree Sai Koushik - Senior Recruiter - Resume
4 pages
Shaveta Dhingra Update Resume
No ratings yet
Shaveta Dhingra Update Resume
3 pages
Arjun Kamath - CV
No ratings yet
Arjun Kamath - CV
3 pages
Syed Resume
No ratings yet
Syed Resume
2 pages
Sirisha Thatiparthi Resume
No ratings yet
Sirisha Thatiparthi Resume
2 pages
Mamatha Ajmeera Python
No ratings yet
Mamatha Ajmeera Python
2 pages
Sampurna's Resume
No ratings yet
Sampurna's Resume
2 pages
Shaveta Resume
No ratings yet
Shaveta Resume
3 pages
2013 SNUG SV Synthesizable SystemVerilog Paper
No ratings yet
2013 SNUG SV Synthesizable SystemVerilog Paper
45 pages
Anshu Kumar Rai.R2
No ratings yet
Anshu Kumar Rai.R2
5 pages
Mahesh - Big Data Engineer
No ratings yet
Mahesh - Big Data Engineer
5 pages
Python Lab
No ratings yet
Python Lab
21 pages
Kastle v1.5 Assembly Guide
No ratings yet
Kastle v1.5 Assembly Guide
16 pages
Snapdragon 616 Processor Product Brief
No ratings yet
Snapdragon 616 Processor Product Brief
2 pages
UNIT-III-Free Electron Theory
No ratings yet
UNIT-III-Free Electron Theory
8 pages
Turunan Imidazoline Crodazoline o
No ratings yet
Turunan Imidazoline Crodazoline o
2 pages
Dice Resume CV SN
No ratings yet
Dice Resume CV SN
5 pages
MySQL JOIN Types Explained
No ratings yet
MySQL JOIN Types Explained
1 page
UNIT-II-Quantum Mechanics
No ratings yet
UNIT-II-Quantum Mechanics
9 pages
Rest-Assured Rest
No ratings yet
Rest-Assured Rest
17 pages
Design and Fabrication of Compact Bicycle Trolley
No ratings yet
Design and Fabrication of Compact Bicycle Trolley
7 pages
Hadoop/Spark Developer Resume
No ratings yet
Hadoop/Spark Developer Resume
7 pages
Muhammad Danish Afif Bin Rosman Resume As of Aug 2022
No ratings yet
Muhammad Danish Afif Bin Rosman Resume As of Aug 2022
1 page

Abhilash Resume

Uploaded by

Abhilash Resume

Uploaded by

Abhilash G

Big Data Engineer

Programming C, C++, JAVA 8, SCALA

Markups HTML, CSS, XML, XSL

Operating Systems Microsoft Windows 2000/XP/Vista/7, Unix, Linux, OS X

Client: Evergy, Kansas City, MO October 2017 – December 2020

Client: Legacy Health Portland, OR Jan 2015 – September 2015

You might also like