0% found this document useful (0 votes)

16 views13 pages

MGNM801 Ca2 Final

Uploaded by

astitvaawasthi33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views13 pages

MGNM801 Ca2 Final

Uploaded by

astitvaawasthi33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Course Code: MGNM801 Course Title: Business Analytics 1

Course Instructor: Vikas Budhani Academic Task No.:2

Academic Task Title: Online Assignment Date of submission: 25th December,2023

Student Name: Astitva Awasthi Section: Q2345

Student’s Roll No: RQ2345A27 Student’s Reg. No: 12319511

Evaluation Parameters: (Parameters on which student is to be evaluated- To be

mentioned by students as specified at the time of assigning the task by the instructor)

Declaration:
I declare that this Assignment is my individual work. I have not copied it
from any other student’s work or from any other source except where due
acknowledgement is made explicitly in the text, nor has any part been written
for me by any other person.
Evaluator’s comments (For Instructor’s use only)

General observation Scope of improvement Best part

Evaluator’s Signature and Date:

Marks Obtained: _________ Max. Marks: ________
Unit: -4
Part1: Pandas
1. List at least three real-world scenarios where Pandas can be used for
data analysis. Explain the specific use cases in each scenario.
Ans)

Scenario 1: Unmasking Your Customer Personas

Use case: Pandas helps uncover hidden patterns and segment your customer
base into distinct groups with shared characteristics. This lets you craft targeted
campaigns, personalize messaging, and deliver experiences that resonate,
ultimately driving engagement and sales.
Example: -

Scenario 2: Campaign Performance Under the Microscope

Use case: Pandas empowers you to dissect the effectiveness of your marketing
campaigns across different channels. Analyse key metrics like impressions,
clicks, and conversion rates to identify top performers, optimize budget
allocation, and maximize ROI.

Example:
Scenario 3: Predicting Customer Churn Before It's Too Late
Use case: Pandas helps you identify customers at risk of churning (leaving your
brand) based on their past behaviour and purchase patterns. This allows you to
take proactive measures like offering personalized incentives or resolving
potential issues, ultimately decreasing customer loss and boosting lifetime
value.
Example:
2. Describe the primary data structures in Pandas, namely Series and Data
Frame. Explain the differences and use cases for each.
Ans)
Series:
Structure:
▪ One-dimensional array like a list or column in a spreadsheet.
▪ Holds an array of data values and an associated array of labels,
called an index.
Key characteristics:
▪ Can hold any data type, including numbers, strings, dates, and
Booleans.
▪ Data must be homogeneous (all elements of the same type).
▪ Labelled with an index that can be used for selection and
alignment.
Use cases:
▪ Representing a single column of data in a dataset.
▪ Storing time series data or sequences of values. ▪ Creating
simple statistical summaries of data.
Data Frame:
Structure:
▪ Two-dimensional labelled data structure with rows and
columns, resembling a spreadsheet or SQL table.
▪ Can be thought of as a collection of Series objects, each
representing a column.
Key characteristics:
▪ Columns can hold different data types.
▪ Labelled with both row and column indices for flexible
access and manipulation.
Use cases:
▪ Representing tabular datasets with multiple columns and
rows.
▪ Loading and storing data from various file formats (CSV,
Excel, databases).
Performing complex data cleaning, transformation, and analysis tasks.
Feature Series Data Frame
Dimensionality One-dimensional Two-dimensional
Data types Homogeneous Heterogeneous (different types per
column)
Structure Array of values + index Collection of Series objects
(columns)
Use cases Single-column data, Tabular datasets, multiple columns,
sequences and rows

Example:

Part2: NumPy

1. Write a brief description of what NumPy is and why it is important for

scientific computing and data analysis in Python.
Ans)

NumPy, short for Numerical Python, is a powerful open-source library in Python

designed for numerical computing. It provides support for large,
multidimensional arrays and matrices, along with a collection of high-level
mathematical functions to operate on these arrays. NumPy is a fundamental
building block for scientific computing and data analysis in Python, and its
importance stems from several key factors:
Efficient multidimensional arrays:
▪ It introduces the ndarray object, a fundamental data structure
for representing and manipulating large arrays of numbers in
Python. These arrays are more efficient in terms of memory
usage and operations compared to Python's built-in lists.

Foundation for scientific computing:

▪ It serves as the cornerstone for many other scientific and data
analysis libraries in Python, including Pandas, SciPy,
Matplotlib, and scikit-learn.

Comprehensive mathematical functions:

▪ It offers a vast collection of mathematical functions for linear
algebra, Fourier transforms, random number generation, and
more.

 Key reasons for its importance in scientific computing and data analysis:
Performance:
▪ Vectorized operations: NumPy arrays enable you to perform
operations on entire arrays at once, rather than element-by-
element, leading to significant speed gains.
▪ Optimized for numerical computations: NumPy's arrays are
optimized for numerical operations, making them much faster
than Python lists for large datasets.

Foundation for other libraries:

▪ Interoperability: NumPy arrays seamlessly integrate with
other scientific Python libraries, providing a cohesive
ecosystem for data analysis and scientific computing.

Mathematical capabilities:
▪ Comprehensive toolkit: NumPy offers a rich set of
mathematical functions for common tasks in scientific
computing, eliminating the need to write custom code for
many operations.
In essence, NumPy's efficient array structures, fast computations, and extensive
mathematical functions make it an indispensable tool for anyone working with
numerical data in Python, especially in the fields of scientific computing, data
analysis, machine learning, and engineering.
2.Explain the significance of NumPy in terms of performance and
efficiency when working with large datasets and numerical computations.
Ans)
NumPy (Numerical Python) is a powerful library in the Python programming
language that provides support for large, multi-dimensional arrays and matrices,
along with a collection of mathematical functions to operate on these elements.
It is a fundamental package for scientific computing in Python and is widely
used in various domains such as data science, machine learning, signal
processing, and more. The significance of NumPy, particularly in terms of
performance and efficiency when working with large datasets and numerical
computations, can be explained through several key aspects:

1. Array Representation:
• NumPy introduces the ndarray (N-dimensional array) data
structure, which allows for efficient representation of large
datasets. This array is a contiguous block of memory containing
elements of the same type, enabling fast and memory-efficient
operations.
2. Vectorized Operations:
• NumPy provides a set of highly optimized functions that
operate on entire arrays at once, eliminating the need for
explicit looping in Python. This vectorized approach takes
advantage of low-level optimizations in the underlying C and
Fortran code, resulting in significantly faster computations.
3. Broadcasting:
• NumPy allows for implicit element-wise operations on arrays of
different shapes and sizes through a feature called broadcasting.
This enables more concise and readable code, without the need
to explicitly reshape or replicate arrays.
4. Memory Efficiency:
• NumPy arrays are more memory-efficient compared to Python
lists, especially for large datasets. The array's homogeneous
data type ensures that memory is allocated in a contiguous
block, reducing memory overhead, and allowing for better
cache utilization.

5. Integration with low level Languages:

• NumPy is built on top of efficient, low-level libraries such as
BLAS (Basic Linear Algebra Subprograms) and LAPACK
(Linear Algebra Package). These libraries are written in
languages like Fortran and C and are highly optimized for
numerical computations. NumPy seamlessly integrates with
these libraries, providing a high-level interface for users.
6. Parallelization and Multithreading:
• NumPy operations can take advantage of parallelization and
multithreading on supported hardware, which can lead to
significant performance improvements, especially on modern
multicore processors.
7. Extensive Mathematical Functions:
• NumPy includes a comprehensive set of mathematical functions
for linear algebra, Fourier analysis, random number generation,
and more. These functions are implemented in highly efficient
C and Fortran code, contributing to the overall performance of
numerical computations.
8. Interoperability:
• NumPy provides seamless interoperability with other libraries
and tools in the scientific computing ecosystem, such as SciPy,
pandas, and scikit-learn. This interoperability allows users to
leverage the strengths of each library for different aspects of
their work.
Unit: -5 Data Visualization:
1. Create a Matplotlib bar plot showing the sales of products in a store for a
given month. Label the axes, add a title, and customize the appearance
(e.g., colour, width).

Output:-
2.Provide at least three examples of data visualization scenarios where
Seaborn is the preferred library over Matplotlib. Describe the type of plots
or charts involved and why Seaborn is a better choice.
Ans)
1. Statistical Relationships
 Plot Type: lmplot, joint plot, pair plot
 Scenario: When exploring relationships between variables or performing
regression analysis, Seaborn's specialized functions make it simpler to
create visualizations that include regression lines, scatter plots with trend
lines, and distribution plots. Seaborn's lmplot and joint plot provide built-
in functionalities for visualizing linear relationships between variables,
along with additional features like adding regression lines, confidence
intervals, and kernel density estimation.
 Why Seaborn: Seaborn streamlines the process of creating complex
statistical visualizations by providing convenient high-level functions that
directly handle these tasks, making it easier to visualize relationships in
data without the need for extensive customization.
2.Categorical Data Analysis
 Plot Type: cat plot, boxplot, violin plot
 Scenario: Analysing categorical variables involves visualizing distributions,
relationships, or comparisons across categories. Seaborn's cat plot, boxplot,
and violin plot functions offer a concise way to display categorical data
distributions, especially when dealing with multiple categories or
comparing distributions across different groups.
 Why Seaborn: Seaborn provides specialized functions specifically designed
for categorical data visualization, offering better aesthetics, flexibility, and
ease of use compared to manually customizing Matplotlib plots for
categorical data analysis.
3.Distribution Visualization
 Plot Type: distplot, kdeplot, rug plot
 Scenario: Visualizing distributions of variables is crucial in understanding
the underlying data patterns. Seaborn's distplot, kdeplot, and rug plot allow
easy plotting of univariate distributions, kernel density estimations, and rug
plots to represent individual data points on a distribution axis.
 Why Seaborn: Seaborn simplifies the creation of distribution plots by
providing intuitive functions that handle both the creation of the histogram-
like representation and the estimation of the underlying probability density
function (PDF) simultaneously, offering a more streamlined approach
compared to Matplotlib.
Additional advantages of Seaborn:
 Aesthetically pleasing defaults: Seaborn's default styles and colour palettes
create visually appealing and informative plots.
 Close integration with Pandas: Seaborn works effortlessly with Pandas Data
Frames, making it convenient for data analysis workflows.
Focus on statistical visualization: Seaborn is designed to create informative
statistical graphics, making it a valuable tool for data exploration and
communication.

Unit: -6
Describe the three key structures in Plotly:
1.Figure, Data, and Layout. Explain the purpose of each structure in creating
visualizations.
Ans)
- The key structures in Plotly and their purposes in creating visualizations:
Figure -
The overall container, which houses all the visualization's components, including
the data and layout.
Serves as a canvas: This is where the visual components are put together and
coordinated.
Crucial to interaction: it makes functions like panning, zooming, and hovering
over data points possible.
Data –
The major component of the visual aid: It contains the real data that you wish to
visualize.
Several traces: Multiple traces (data sets) can be included in a figure, and each
one can be seen as a separate visual entity (e.g., lines, bars, scatter points).
Trace-specific properties: A trace's look can be defined by its own attributes, such
as type, name, mode, marker style, line style, etc.
Layout –
Manages visual presentation: It oversees the visualization's non-data components,
including titles, labels, and annotations.
Gridlines and axes
Legend and colour bar o Margins and spacing
Colour and style of the background Collaborating Together:
Figure orchestrates: It combines layout and data to provide the entire
representation.
Information offers content: The visual elements are formed from this raw
material.
Context is created by layout: It sets the general look and feel, provides labels and
annotations, and arranges the visual elements.

2.Load a sales dataset with columns 'Sales,' create a Plotly line chart to
visualize the total sales trend. Include axis labels, a title, and customize the
appearance.

Aesthetic and Regenative Gynecology
No ratings yet
Aesthetic and Regenative Gynecology
12 pages
Data Analysis With Python & Pandas
100% (2)
Data Analysis With Python & Pandas
378 pages
Net Cafe Project
No ratings yet
Net Cafe Project
20 pages
Python Ca22
No ratings yet
Python Ca22
14 pages
Python CA2
No ratings yet
Python CA2
11 pages
Saurabh mgnm801 Ca2
No ratings yet
Saurabh mgnm801 Ca2
13 pages
Unit 5
No ratings yet
Unit 5
27 pages
Exploring The Power of Data Manipulation and Analysis - A Comprehensive Study of NumPy, SciPy, and Pandas
No ratings yet
Exploring The Power of Data Manipulation and Analysis - A Comprehensive Study of NumPy, SciPy, and Pandas
23 pages
tool and lib in Data Science
No ratings yet
tool and lib in Data Science
32 pages
Data Science Tools
No ratings yet
Data Science Tools
2 pages
Report
No ratings yet
Report
18 pages
fds_merged (3) (1)
No ratings yet
fds_merged (3) (1)
102 pages
UNIT-6(Data Analytics and Visualization With Python)
No ratings yet
UNIT-6(Data Analytics and Visualization With Python)
41 pages
Final Fds Manual
No ratings yet
Final Fds Manual
77 pages
Introduction to NumPy & Pandas
No ratings yet
Introduction to NumPy & Pandas
12 pages
Python Abstract
No ratings yet
Python Abstract
7 pages
ML File Updated
No ratings yet
ML File Updated
60 pages
Programming For Data Science
No ratings yet
Programming For Data Science
48 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
FINAL FDS MANUAL print
No ratings yet
FINAL FDS MANUAL print
55 pages
unit 5
No ratings yet
unit 5
28 pages
Unit 1
100% (1)
Unit 1
69 pages
Data Science 1
No ratings yet
Data Science 1
3 pages
data science
No ratings yet
data science
10 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
EXP1-siddhant gupta (23_SE_148)
No ratings yet
EXP1-siddhant gupta (23_SE_148)
17 pages
chapter 3 numpy data analysis
No ratings yet
chapter 3 numpy data analysis
21 pages
DATA ANALYSIS USING PYTHON2
No ratings yet
DATA ANALYSIS USING PYTHON2
27 pages
Data Science I: Charles C.N. Wang
No ratings yet
Data Science I: Charles C.N. Wang
68 pages
suraj report file
No ratings yet
suraj report file
17 pages
NUMPY - INTRODUCTION
No ratings yet
NUMPY - INTRODUCTION
118 pages
22mbada303 Module 4
No ratings yet
22mbada303 Module 4
32 pages
fods_final_done
No ratings yet
fods_final_done
67 pages
Data Science using Python_ Introduction
No ratings yet
Data Science using Python_ Introduction
6 pages
FDS LAB
No ratings yet
FDS LAB
43 pages
lab2report
No ratings yet
lab2report
6 pages
PPS - Unit 5 (Imp Topics)
No ratings yet
PPS - Unit 5 (Imp Topics)
7 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
49 pages
UNIT 2
No ratings yet
UNIT 2
38 pages
Data Science lecture 5 6th semster
No ratings yet
Data Science lecture 5 6th semster
3 pages
Chapter 04 Advanced Use of Python Libraries for AI and Data Science
No ratings yet
Chapter 04 Advanced Use of Python Libraries for AI and Data Science
179 pages
jjkjk
No ratings yet
jjkjk
10 pages
Data Science Notes
No ratings yet
Data Science Notes
13 pages
Machine Learning Lecture2
No ratings yet
Machine Learning Lecture2
38 pages
Python For Data Science
No ratings yet
Python For Data Science
22 pages
NumPy Essentials - Sample Chapter
50% (2)
NumPy Essentials - Sample Chapter
16 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Ip Project Class Xii
No ratings yet
Ip Project Class Xii
31 pages
FDSA LAB MANUAL
No ratings yet
FDSA LAB MANUAL
53 pages
DS1
No ratings yet
DS1
20 pages
FDS_LAB_MANUAL (1)
No ratings yet
FDS_LAB_MANUAL (1)
62 pages
lab manual fds
No ratings yet
lab manual fds
44 pages
Usage of NumPy for Numerical Data in Detail
No ratings yet
Usage of NumPy for Numerical Data in Detail
52 pages
Fds Record
No ratings yet
Fds Record
69 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
fdsa lab manual final
No ratings yet
fdsa lab manual final
70 pages
Fundamentals of Data Science Students
No ratings yet
Fundamentals of Data Science Students
52 pages
Numpy User
No ratings yet
Numpy User
659 pages
Ty B Tech - Bda - Ai315 - Lab Manual
No ratings yet
Ty B Tech - Bda - Ai315 - Lab Manual
52 pages
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Python Data Structures Explained: A Practical Guide with Examples
From Everand
Python Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Anjali Singh
No ratings yet
Anjali Singh
1 page
Archana Kumari Shaw
No ratings yet
Archana Kumari Shaw
1 page
Oprm639 Syllabus
No ratings yet
Oprm639 Syllabus
2 pages
Ecom525 Syllabus
No ratings yet
Ecom525 Syllabus
2 pages
Accm507 Syllabus
No ratings yet
Accm507 Syllabus
2 pages
Techcheck Daily: Emkay Global Financial Services LTD
No ratings yet
Techcheck Daily: Emkay Global Financial Services LTD
9 pages
r48 3000e3 Datasheet
No ratings yet
r48 3000e3 Datasheet
2 pages
Pneumatic Instrumentation Principles - The Force Balance System
No ratings yet
Pneumatic Instrumentation Principles - The Force Balance System
2 pages
PCI UG Framework
No ratings yet
PCI UG Framework
191 pages
Harley Davidson-Total
0% (1)
Harley Davidson-Total
36 pages
Air Conditioning Laboratory Unit A660: P.A.Hilton LTD
No ratings yet
Air Conditioning Laboratory Unit A660: P.A.Hilton LTD
6 pages
Questionnaire: Client Experience Towards Herbal Products
No ratings yet
Questionnaire: Client Experience Towards Herbal Products
5 pages
Pump Head Calculations
100% (2)
Pump Head Calculations
4 pages
AUDIOSCRIPT
No ratings yet
AUDIOSCRIPT
5 pages
The Evolution of Egyptian Pyramid Construction
No ratings yet
The Evolution of Egyptian Pyramid Construction
24 pages
Journal of Environmental Chemical Engineering: Sciencedirect
No ratings yet
Journal of Environmental Chemical Engineering: Sciencedirect
8 pages
Re-Ksl Bill 137
No ratings yet
Re-Ksl Bill 137
1 page
TEST 1 G9
No ratings yet
TEST 1 G9
3 pages
Synopsis Report Python Project
No ratings yet
Synopsis Report Python Project
4 pages
Biodiesel Extraction From Cotton Seed Oil
No ratings yet
Biodiesel Extraction From Cotton Seed Oil
12 pages
How Infinite Series Reveal The Unity of Mathematics - Quanta Magazine
No ratings yet
How Infinite Series Reveal The Unity of Mathematics - Quanta Magazine
5 pages
CS3492-DBMS Questions and Answers
No ratings yet
CS3492-DBMS Questions and Answers
5 pages
P5 Science - From Parents To Young
No ratings yet
P5 Science - From Parents To Young
38 pages
Teaching and Learning Resources For Grade IX Biology: Recommended Key Textbook
No ratings yet
Teaching and Learning Resources For Grade IX Biology: Recommended Key Textbook
7 pages
Total Amount $ 362,337.21: Payment Terms
No ratings yet
Total Amount $ 362,337.21: Payment Terms
34 pages
CHAPTER_-1[1]
No ratings yet
CHAPTER_-1[1]
30 pages
Base Si Units
No ratings yet
Base Si Units
10 pages
Algebra Assignment 2
No ratings yet
Algebra Assignment 2
7 pages
Introduction and Basic Concepts of Chemical Engineering Thermodynamics PDF
No ratings yet
Introduction and Basic Concepts of Chemical Engineering Thermodynamics PDF
22 pages
Lesson 2: Understanding The Basic Concepts in ICT What To Expect?
No ratings yet
Lesson 2: Understanding The Basic Concepts in ICT What To Expect?
10 pages
CIPS L5M4 ACFM Key Definitions, Formulas & L5M4 Learning Outcomes
100% (1)
CIPS L5M4 ACFM Key Definitions, Formulas & L5M4 Learning Outcomes
25 pages
Inventario Actualizado Chemical Guys
No ratings yet
Inventario Actualizado Chemical Guys
8 pages
Switching Basics and Intermediate Routing: Case Study
No ratings yet
Switching Basics and Intermediate Routing: Case Study
20 pages

MGNM801 Ca2 Final

Uploaded by

MGNM801 Ca2 Final

Uploaded by

Course Code: MGNM801 Course Title: Business Analytics 1

Course Instructor: Vikas Budhani Academic Task No.:2

Academic Task Title: Online Assignment Date of submission: 25th December,2023

Student Name: Astitva Awasthi Section: Q2345

Student’s Roll No: RQ2345A27 Student’s Reg. No: 12319511

Evaluation Parameters: (Parameters on which student is to be evaluated- To be

General observation Scope of improvement Best part

Evaluator’s Signature and Date:

Scenario 1: Unmasking Your Customer Personas

Scenario 2: Campaign Performance Under the Microscope

1. Write a brief description of what NumPy is and why it is important for

NumPy, short for Numerical Python, is a powerful open-source library in Python

Foundation for scientific computing:

Comprehensive mathematical functions:

Foundation for other libraries:

5. Integration with low level Languages:

You might also like