Choosing a data profession
Choosing a data profession
On the next page there is an example case where I explain what each
data role would be responsible for in a given project.
This document
has 2 sections:
In the pages after, there are a more in-depth explanation of each role's
responsibilities.
The data engineer will set up a system for data analysts and data scientists to be able to reach
and use this data.
The data analyst would show us important insights that we should take into consideration for our
next marketing campaign, for example, the age group of our audience and which social media
channels they hang out in.
The data scientist would create a predictive system that takes as input a client's behaviour and
outputs the probability of them buying a product in a given session. You can read this article, if
you want to learn about a data scientist's daily tasks in more detail.
Even though less talked-about, one of the most important skill Data scientists' deliverables vary a lot from company to
of data scientists is data manipulation. They know how to alter company and even project to project. The most typical
and augment the data to achieve the desired outcome. They deliverable is an API developed with the help of a software
know how to deal with problematic data and use statistics for engineer that serves predictions or classifications for an
their purposes. application.
Data scientists are data experts in the company. It is their A similar role to data science is machine learning engineer.
responsibility to decide on the approach to be taken to solve a Depending on the company, an ML engineer might do the same
problem or address an issue in the most effective way possible. job as a data scientist, might focus primarily on optimising ML
The approach could be as simple as a graph or as complex as a models or deal with deployment of machine learning solutions.
multi-year project.
After the senior level, you can choose to go into management. A management position would give you a chance to manage many
projects and data scientists but would be less hands-on. You would be responsible for financial parts of a project too. Alternatively,
you can choose to become a technical lead in a company such as a distinguished engineer. This way you probably still be doing hands-
on work on projects but have less of a say when it comes to financial matters.
Data Engineer
What do they do? What do they deliver? Where can you go from there?
A data engineer is someone who focuses on the engineering Specialised data engineers are currently in demand. So by
of a data pipeline. A data pipeline is a process of creating specialising in a certain technology, you can advance in your
and/or collecting data, accessing it, analysing it and creating career. You can also choose to switch to development roles
results from it and putting those results in use. Data engineers using your knowledge in this area.
focus on parts of the pipeline other than the analysis.
Another natural path is to grow into architectural or
Their responsibilities are setting up a reliable data management roles. This is because data engineering issues are
infrastructure and maintaining the infrastructure so that important to big companies and they want people with solid
whoever needs data can reach it quickly and effectively. knowledge on dealing with data on decision making levels of the
Depending on the company the challenges might also include company.
storing unstructured data, moving big amounts of data and
making it accessible for analysis, setting up systems for
undisturbed flow of data, making sure clean and usable data
is collected from sources, merging external data sources into
the company's data infrastructure.