Categorical Analysis ¶
In [ ]:
import pandas as pd
In [2]:
df = pd.read_csv("googleplaystore.csv").dropna()
In [3]:
df.head(5)
Out[3]:
App | Category | Rating | Reviews | Size | Installs | Type | Price | Content Rating | Genres | Last Updated | Current Ver | Android Ver | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | Photo Editor & Candy Camera & Grid & ScrapBook | ART_AND_DESIGN | 4.1 | 159 | 19M | 10,000+ | Free | 0 | Everyone | Art & Design | January 7, 2018 | 1.0.0 | 4.0.3 and up |
1 | Coloring book moana | ART_AND_DESIGN | 3.9 | 967 | 14M | 500,000+ | Free | 0 | Everyone | Art & Design;Pretend Play | January 15, 2018 | 2.0.0 | 4.0.3 and up |
2 | U Launcher Lite – FREE Live Cool Themes, Hide ... | ART_AND_DESIGN | 4.7 | 87510 | 8.7M | 5,000,000+ | Free | 0 | Everyone | Art & Design | August 1, 2018 | 1.2.4 | 4.0.3 and up |
3 | Sketch - Draw & Paint | ART_AND_DESIGN | 4.5 | 215644 | 25M | 50,000,000+ | Free | 0 | Teen | Art & Design | June 8, 2018 | Varies with device | 4.2 and up |
4 | Pixel Draw - Number Art Coloring Book | ART_AND_DESIGN | 4.3 | 967 | 2.8M | 100,000+ | Free | 0 | Everyone | Art & Design;Creativity | June 20, 2018 | 1.1 | 4.4 and up |
Q1. Total number of categories
¶
In [4]:
print("There are total",len(df['Category'].unique()),'categories')
There are total 33 categories
Q2. Total apps in ART_AND_DESIGN
¶
In [5]:
c = 0
for i in df['Category']:
if(i == 'ART_AND_DESIGN'):
c += 1
print("There are total",c,'application in ART_AND_DESIGN')
There are total 61 application in ART_AND_DESIGN
Q3. Total number of App-Types
¶
In [7]:
data = pd.read_csv("googleplaystore.csv").dropna()
data['Type'].unique()
Out[7]:
array(['Free', 'Paid'], dtype=object)
Q4. Total number of Paid and Free Apps
¶
In [ ]:
f = 0
for i in df['Type']:
if(i == 'Free'):
f += 1
print("There are total",f,'free and',end=' ')
p = 0
for i in df['Type']:
if(i == 'Paid'):
p += 1
print("and",p,'paid applications')
There are total 8715 free and and 645 paid applications
Q5. Percentage of free apps in the dataset
¶
In [ ]:
print(int(f /(f + p)*100),"% applications are free")
93 % applications are free
Q6. Name all the Content Ratings
¶
In [11]:
for i in df['Content Rating'].unique():
print(i , end=' ; ')
Everyone ; Teen ; Everyone 10+ ; Mature 17+ ; Adults only 18+ ; Unrated ;
In [ ]: