Shopee Python-Pandas Test (45 Mins)

1. The dataset contains listing information from over 464,000 rows with 12 columns of data on items, shops, sales metrics and more. 2. Analysis found over 26,000 unique shops, over 1,000 preferred or cross-border shops, and over 100,000 products with zero sales. 3. The top categories and shops by unique product count and estimated revenue were identified. Duplicated listings within shops were also flagged and further analyzed.

Uploaded by

Gyan Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views2 pages

Shopee Python-Pandas Test (45 Mins)

Uploaded by

Gyan Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Shopee Python-Pandas Test (45 mins)

In this task, you'll be analysing listings data from our Shopee Platform.

You may use the Pycharm IDE installed, Sublime or other windows native text editors. Please save
your python source code on the desktop. You may use the internet for help.

The dataset is stored in the Test_Pandas.xlsx file. It contains listing information posted on Shopee.
One single listing corresponds to one row in the dataset.

The dataset has 12 columns, and 464433 rows.

Here are the brief descriptions of each column:

Itemid - a unique ID of the product

Shopid - a unique ID of the shop
item_name - product title
item_description - detailed product description
item_variation - stores variations of a product (e.g. different colours or sizes, in the format like
{variation 1 name: variation 1 price, variation 2 name: variation 2 price})
price - how much does the item sold
stock - how many stocks left
category - which category does the product belongs to
cb_option - 1 indicates the product is sold by a cross border shop
is_preferred - 1 indicates the product is sold by a preferred shop
sold_count - how many products have been sold
item_creation_date - when are the product uploaded by the seller
1. Use pandas function to read the Test_Pandas.xlsx file in:
a. Assign the result to a variable named “data”
b. Assign all column names to a variable named “columns”

2. Use pandas function to find:

a. How many unique shops are in the dataset?
b. How many unique preferred and cross border shops are in the dataset?
c. How many products have zero sold count?
d. How many products were created in the year 2018?

3. Use pandas function to find:

a. Top 3 Preferred shops’ shopid that have the largest number of unique products
b. Top 3 Categories that have the largest number of unique cross-border products

4. Find Top 3 shopid with the highest revenue (Assumption: the product price has not been
changed.)

5. Find number of products that have more than 3 variations (do not include products with 3 or
fewer variations)

6. Use pandas function to identify duplicated listings within each shop (If listing A and B in shop
S have the exactly same product title, product detailed description, and price, both listing A
and B are considered as duplicated listings)
a. Mark those duplicated listings with True otherwise False and store the marking
result in a new column named “is_duplicated”
b. Find duplicate listings that has less than 2 sold count and store the result in a new
excel file named “duplicated_listings.xlsx”
c. Find the preferred shop shopid that have the most number of duplicated listings

Shopee Python-Pandas Test (45 Mins)
100% (3)
Shopee Python-Pandas Test (45 Mins)
3 pages
Shopee ID - SQL Test
0% (2)
Shopee ID - SQL Test
929 pages
Shopee Python-Pandas Test (45 Mins)
No ratings yet
Shopee Python-Pandas Test (45 Mins)
2 pages
Practice Problems - CPM
No ratings yet
Practice Problems - CPM
8 pages
Latest Algorithm Design Using Pseudocode
No ratings yet
Latest Algorithm Design Using Pseudocode
28 pages
Filmora Keyboard Shortcuts
100% (1)
Filmora Keyboard Shortcuts
1 page
Data Science & Analytics Test For Examinee Latest Docx 2
No ratings yet
Data Science & Analytics Test For Examinee Latest Docx 2
3 pages
Ip Practice Test (14in)
No ratings yet
Ip Practice Test (14in)
9 pages
Supermarket Sales Data analysis
No ratings yet
Supermarket Sales Data analysis
6 pages
Lab 1 ML
No ratings yet
Lab 1 ML
2 pages
Worksheet on Pandas Dataframe
No ratings yet
Worksheet on Pandas Dataframe
5 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
T1 IP qp
No ratings yet
T1 IP qp
8 pages
ESE Ques Pattern
No ratings yet
ESE Ques Pattern
3 pages
Standardqp
No ratings yet
Standardqp
4 pages
E201-Aakah Jathore - Lab - Ass - No - 04
No ratings yet
E201-Aakah Jathore - Lab - Ass - No - 04
3 pages
Ip Questions
No ratings yet
Ip Questions
5 pages
BigMart Sales Data Analysis
No ratings yet
BigMart Sales Data Analysis
16 pages
DATAFRAME
No ratings yet
DATAFRAME
6 pages
practice_questions2
No ratings yet
practice_questions2
2 pages
Reading An Entire File at Once: Generating Current Date
No ratings yet
Reading An Entire File at Once: Generating Current Date
2 pages
Dataframe
No ratings yet
Dataframe
19 pages
CODING & OUTPUT Bike Data Analysis
No ratings yet
CODING & OUTPUT Bike Data Analysis
25 pages
Masterclass Data Analysis.ipynb - Colab
No ratings yet
Masterclass Data Analysis.ipynb - Colab
4 pages
IP Project Final
No ratings yet
IP Project Final
9 pages
Data Understanding and Preparation
No ratings yet
Data Understanding and Preparation
48 pages
Divyanshi 05401172023 Ds Practical
No ratings yet
Divyanshi 05401172023 Ds Practical
18 pages
Python For Business Decision Making Asm2
No ratings yet
Python For Business Decision Making Asm2
21 pages
question paper
No ratings yet
question paper
5 pages
IP (12) Proj File Pandas&Matplotlib
No ratings yet
IP (12) Proj File Pandas&Matplotlib
12 pages
Ip Project
No ratings yet
Ip Project
16 pages
Build Your Empire With Amazon: Laptop Lifestyle
From Everand
Build Your Empire With Amazon: Laptop Lifestyle
Nebula Press
No ratings yet
prac1
No ratings yet
prac1
5 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
Supermarket Sales Analysis 1
No ratings yet
Supermarket Sales Analysis 1
13 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Question Bank Class XII IP 065 Long Question Answer
No ratings yet
Question Bank Class XII IP 065 Long Question Answer
35 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
List of Practicals Python 2024 - 25
No ratings yet
List of Practicals Python 2024 - 25
13 pages
AIYA DATA EXPLORATION
No ratings yet
AIYA DATA EXPLORATION
4 pages
Nikita Prasad - Exploratory Data Analysis (EDA)
No ratings yet
Nikita Prasad - Exploratory Data Analysis (EDA)
18 pages
vertopal.com_12_Pandas
No ratings yet
vertopal.com_12_Pandas
14 pages
Minimum Level Pandas Skill Based Questions
No ratings yet
Minimum Level Pandas Skill Based Questions
8 pages
pandas_notes
No ratings yet
pandas_notes
8 pages
Class Xii (Informatics Practices) Half Yearly QP & Ms Ernakulam Region
No ratings yet
Class Xii (Informatics Practices) Half Yearly QP & Ms Ernakulam Region
5 pages
L-3 (Data Frame Part 2).Ipynb - Colab
No ratings yet
L-3 (Data Frame Part 2).Ipynb - Colab
5 pages
Stationary Shop Management System ( Ip Class Xii )
No ratings yet
Stationary Shop Management System ( Ip Class Xii )
23 pages
Important Questions With Solutions IP
No ratings yet
Important Questions With Solutions IP
5 pages
Task 6
No ratings yet
Task 6
14 pages
Python MCQs
No ratings yet
Python MCQs
21 pages
STD XII-TEE- IP
No ratings yet
STD XII-TEE- IP
9 pages
STATIONARY MANAGEMENT SYSTEM IP CLASS XII (2024-25)
No ratings yet
STATIONARY MANAGEMENT SYSTEM IP CLASS XII (2024-25)
26 pages
IP Practical PRGM
No ratings yet
IP Practical PRGM
41 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
SalesMgmtSystem XII IP Projectreport 2022 23
No ratings yet
SalesMgmtSystem XII IP Projectreport 2022 23
18 pages
Project Sale Analysis
No ratings yet
Project Sale Analysis
8 pages
Diwali Sales Analysis EDA 1696347982
No ratings yet
Diwali Sales Analysis EDA 1696347982
8 pages
ST Joseph'S Convent Senior Secondary School: Name:-Shatakshi Gaur Class:-Xii Sec:-A Board Roll No.
No ratings yet
ST Joseph'S Convent Senior Secondary School: Name:-Shatakshi Gaur Class:-Xii Sec:-A Board Roll No.
65 pages
IP PROJECT (23-24) Jerin
No ratings yet
IP PROJECT (23-24) Jerin
28 pages
Pandas 1
No ratings yet
Pandas 1
32 pages
Ip Practical Notes
No ratings yet
Ip Practical Notes
6 pages
Customer Segmentation PDF
No ratings yet
Customer Segmentation PDF
18 pages
Dejene Chala Stat606 Screening Quiz Programming Part
No ratings yet
Dejene Chala Stat606 Screening Quiz Programming Part
12 pages
Oisd STD-225 PDF
No ratings yet
Oisd STD-225 PDF
42 pages
Divya Desam - Wikipedia PDF
100% (2)
Divya Desam - Wikipedia PDF
48 pages
Scanned by Tapscanner
No ratings yet
Scanned by Tapscanner
3 pages
NET201 Lab Experiment # 4 - Configuring IPv4 Static and Default Routes
No ratings yet
NET201 Lab Experiment # 4 - Configuring IPv4 Static and Default Routes
15 pages
Fusion TallyAPI Documentation
No ratings yet
Fusion TallyAPI Documentation
18 pages
701 101 Eci Epi - 05122022
No ratings yet
701 101 Eci Epi - 05122022
2 pages
chapter 4
No ratings yet
chapter 4
10 pages
Embedded Syllabus
No ratings yet
Embedded Syllabus
2 pages
Control CL Commands With Command Exit Programs - Part 1
No ratings yet
Control CL Commands With Command Exit Programs - Part 1
8 pages
Cs Paper 2nd Year
No ratings yet
Cs Paper 2nd Year
1 page
BIOS and DOS Interrupts
83% (6)
BIOS and DOS Interrupts
42 pages
DLL Injection
No ratings yet
DLL Injection
1 page
User Guide: Important: For Mapping Root Access Required With The Latest Supersu & Busybox
No ratings yet
User Guide: Important: For Mapping Root Access Required With The Latest Supersu & Busybox
8 pages
Forensics Investigations Case Studies and Tools
No ratings yet
Forensics Investigations Case Studies and Tools
9 pages
C Programming Lab
No ratings yet
C Programming Lab
79 pages
Mikrotik DAN Topologi Jaringan: Ibnu Prastowo Haryono Putro Raditya Aji Habsoro
No ratings yet
Mikrotik DAN Topologi Jaringan: Ibnu Prastowo Haryono Putro Raditya Aji Habsoro
76 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
110 pages
1 Principles of Compiler Design
No ratings yet
1 Principles of Compiler Design
89 pages
External Data Sheet
No ratings yet
External Data Sheet
14 pages
PROJECT REPORT ON ART GALLER1.docx2222
No ratings yet
PROJECT REPORT ON ART GALLER1.docx2222
19 pages
Switching Circuits & Logic Design: 18 Circuits For Arithmetic Operations
No ratings yet
Switching Circuits & Logic Design: 18 Circuits For Arithmetic Operations
8 pages
A To Z Preparation Guide For Code With Cisco by Vikram
No ratings yet
A To Z Preparation Guide For Code With Cisco by Vikram
17 pages
E72-2G4M05S1A Usermanual EN v1.1
No ratings yet
E72-2G4M05S1A Usermanual EN v1.1
12 pages
RK3566 Tablet Ref V11 20210601
No ratings yet
RK3566 Tablet Ref V11 20210601
56 pages
Javascript 2
No ratings yet
Javascript 2
14 pages
MT6580 Android Scatter
100% (1)
MT6580 Android Scatter
7 pages
X360glitchip v2 Installation (En)
No ratings yet
X360glitchip v2 Installation (En)
14 pages
Paperless e Cash Management System by Using An I Button Technology
No ratings yet
Paperless e Cash Management System by Using An I Button Technology
35 pages
17.PZ1000 Testing Tool, CSPC (With ARC Resistance)
No ratings yet
17.PZ1000 Testing Tool, CSPC (With ARC Resistance)
15 pages
31010794-VRP1.5 Command Reference Volume 1
No ratings yet
31010794-VRP1.5 Command Reference Volume 1
283 pages

Shopee Python-Pandas Test (45 Mins)

Uploaded by

Shopee Python-Pandas Test (45 Mins)

Uploaded by

Shopee Python-Pandas Test (45 mins)

The dataset has 12 columns, and 464433 rows.

Here are the brief descriptions of each column:

Itemid - a unique ID of the product

2. Use pandas function to find:

3. Use pandas function to find:

You might also like