

-----

### Dataset Description

This dataset contains **41,355** rows and **6** columns, with no missing values. It provides information about various songs, including their name, artist, duration, and language.

| Column Name | Data Type | Description | Example Values |
| :--- | :--- | :--- | :--- |
| `name` | object (string) | The title of the song. | `Dil - E - Nadan Tujhe`, `Hazaron Khwahishen Aisi` |
| `singer` | object (string) | The name(s) of the singer(s). Multiple singers are separated by a pipe (`\|`). | `Chitra Singh\|Jagjit Singh`, `Jagjit Singh` |
| `singer_id` | object (string) | An identifier or path component for the singer(s), likely used to construct a URL. Multiple IDs are separated by a pipe (`\|`). | `/artist/chitra-singh\|/artist/jagjeet-singh-1` |
| `duration` | object (string) | The duration of the song, formatted as minute:second (MM:SS). | `05:00`, `06:26` |
| `link` | object (string) | A unique path component for the song, likely used to construct a URL. | `/dil-e-nadan-tujhe-3`, `/agar-hum-kahen-aur-woh-muskura-den` |
| `language` | object (string) | The language of the song. | `Urdu`, `Bengali` |

# 1)Overall Analysis

## 1.1)Import Data

In [None]:
import pandas as pd
import numpy as np

In [None]:
!git clone "https://github.com/HarshvardhanSingh-13/Datasets"

Cloning into 'Datasets'...
remote: Enumerating objects: 303, done.[K
remote: Counting objects: 100% (61/61), done.[K
remote: Compressing objects: 100% (55/55), done.[K
remote: Total 303 (delta 14), reused 0 (delta 0), pack-reused 242 (from 1)[K
Receiving objects: 100% (303/303), 301.35 MiB | 21.09 MiB/s, done.
Resolving deltas: 100% (128/128), done.
Updating files: 100% (235/235), done.


## 1.2)Loading the Dataset

In [None]:
df=pd.read_csv("/content/Datasets/Gaana/songs.csv")

In [None]:
df.head(2)

Unnamed: 0,name,singer,singer_id,duration,link,language
0,Dil - E - Nadan Tujhe,Chitra Singh|Jagjit Singh,/artist/chitra-singh|/artist/jagjeet-singh-1,05:00,/dil-e-nadan-tujhe-3,Urdu
1,Agar Hum Kahen Aur Woh Muskara De,Chitra Singh|Jagjit Singh,/artist/chitra-singh|/artist/jagjeet-singh-1,06:26,/agar-hum-kahen-aur-woh-muskura-den,Urdu


In [None]:
df.tail(2)

Unnamed: 0,name,singer,singer_id,duration,link,language
41353,Jatone Hridaye Rekho,Ajoy Chakrabarty,/artist/ajoy-chakrabarty,04:29,/jatone-hridaye-rekho,Bengali
41354,Tomar Je Oi Haater Mala,Asha Bhosle,/artist/asha-bhosle-1,03:22,/tomar-je-oi-haater-mala,Bengali


##1.3)Reading of Dataset

In [None]:
df.describe()

Unnamed: 0,name,singer,singer_id,duration,link,language
count,41355,41355,41355,41355,41355,41355
unique,31534,6196,6580,1309,36361,16
top,O Mere Dil Ke Chain,Lata Mangeshkar,/artist/lata-mangeshkar,03:02,/ek-baar-zara-phir-keh-do,Telugu
freq,26,1410,1410,447,2,4996


In [None]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 41355 entries, 0 to 41354
Data columns (total 6 columns):
 #   Column     Non-Null Count  Dtype 
---  ------     --------------  ----- 
 0   name       41355 non-null  object
 1   singer     41355 non-null  object
 2   singer_id  41355 non-null  object
 3   duration   41355 non-null  object
 4   link       41355 non-null  object
 5   language   41355 non-null  object
dtypes: object(6)
memory usage: 1.9+ MB


## 1.4) Checking for Null Values

In [None]:
df.isnull().sum()

Unnamed: 0,0
name,0
singer,0
singer_id,0
duration,0
link,0
language,0


In [None]:
df.head()

Unnamed: 0,name,singer,singer_id,duration,link,language
0,Dil - E - Nadan Tujhe,Chitra Singh|Jagjit Singh,/artist/chitra-singh|/artist/jagjeet-singh-1,05:00,/dil-e-nadan-tujhe-3,Urdu
1,Agar Hum Kahen Aur Woh Muskara De,Chitra Singh|Jagjit Singh,/artist/chitra-singh|/artist/jagjeet-singh-1,06:26,/agar-hum-kahen-aur-woh-muskura-den,Urdu
2,Unke Dekhe Se,Jagjit Singh,/artist/jagjeet-singh-1,03:41,/unke-dekhe-se,Urdu
3,Yeh Na Thi Hamari Qismat - Chitra Singh,Chitra Singh,/artist/chitra-singh,04:26,/yeh-na-thi-hamari-qismat-2,Urdu
4,Hazaron Khwahishen Aisi,Jagjit Singh,/artist/jagjeet-singh-1,05:39,/hazaron-khwahishen-aisi-1,Urdu


In [None]:
df[df["language"]=="Hindi"].count()

Unnamed: 0,0
name,4993
singer,4993
singer_id,4993
duration,4993
link,4993
language,4993


In [None]:
df[df["language"]=="Old"].count()

Unnamed: 0,0
name,4993
singer,4993
singer_id,4993
duration,4993
link,4993
language,4993


##1.5)Removing Duplicates

In [None]:
df.drop_duplicates(subset="link",inplace=True)

In [None]:
df.describe()

Unnamed: 0,name,singer,singer_id,duration,link,language
count,36361,36361,36361,36361,36361,36361
unique,31534,6196,6580,1309,36361,15
top,Padhyams,S. P. Balasubrahmanyam,/artist/s-p-balasubrahamanyam,03:02,/tomar-je-oi-haater-mala,Telugu
freq,20,1151,1155,417,1,4995


In [None]:
df.drop_duplicates(subset=["name","singer","singer_id","duration"], inplace=True)

In [None]:
df.describe()

Unnamed: 0,name,singer,singer_id,duration,link,language
count,32949,32949,32949,32949,32949,32949
unique,31534,6196,6580,1309,32949,15
top,Padhyams,P. Susheela,/artist/p-suseela-2,04:00,/tomar-je-oi-haater-mala,Telugu
freq,19,938,938,377,1,4635


In [None]:
df["language"].value_counts()

Unnamed: 0_level_0,count
language,Unnamed: 1_level_1
Telugu,4635
Marathi,4240
Tamil,4227
Hindi,4078
Punjabi,3580
Urdu,3108
Kannada,2851
Gujarati,2007
Odia,915
Bengali,874


# 2) Column Wise Analysis (Univariate Analysis)

##2.1) Name

In [None]:
df.describe()

Unnamed: 0,name,singer,singer_id,duration,link,language
count,32949,32949,32949,32949,32949,32949
unique,31534,6196,6580,1309,32949,15
top,Padhyams,P. Susheela,/artist/p-suseela-2,04:00,/tomar-je-oi-haater-mala,Telugu
freq,19,938,938,377,1,4635


In [None]:
df["name"].value_counts()

Unnamed: 0_level_0,count
name,Unnamed: 1_level_1
Padhyams,19
Ahe Ramahari,12
Tuma Gaan Mahu Machhi,11
Music,10
Film Story,8
...,...
Idhu Shrungara Hennina,1
Oh Rasika Subhanalla,1
Naa Donkadarenu,1
Jayathu Jaya/Ee Nanna/Kivivarege,1


In [None]:
df[df["name"]=="Padhyams"]

Unnamed: 0,name,singer,singer_id,duration,link,language
34123,Padhyams,Ghanatasala,/artist/ghanatasala,01:21,/poem-1-1,Telugu
34236,Padhyams,P. Susheela|Madhavapeddi Satyam,/artist/p-suseela-2|/artist/madhavapeddi-satyam,08:00,/padyams1,Telugu
34340,Padhyams,Ghanatasala|P. Susheela|P Leela|S. Varalakshmi,/artist/ghanatasala|/artist/p-suseela-2|/artis...,19:56,/padhyams-2,Telugu
34441,Padhyams,Ghanatasala,/artist/ghanatasala,04:47,/padyams-narthanasala,Telugu
34512,Padhyams,Ghanatasala|Madhavapeddi Satyam,/artist/ghanatasala|/artist/madhavapeddi-satyam,05:06,/padyamssri-krishnarjuna-yuddham,Telugu
34855,Padhyams,Ghanatasala|S. Janaki,/artist/ghanatasala|/artist/s-janaki,01:48,/dhanyosmi-matha-sukravarapu-poddu,Telugu
34869,Padhyams,Ghanatasala,/artist/ghanatasala,56:19,/padyams-seetharama-kalyanam,Telugu
34922,Padhyams,Ghanatasala,/artist/ghanatasala,50:37,/padyams-gudi-gantalu,Telugu
35153,Padhyams,Ghanatasala|Madhavapeddi Satyam,/artist/ghanatasala|/artist/madhavapeddi-satyam,03:23,/padyams-sri-venketswara-mahathymam,Telugu
35281,Padhyams,Ghanatasala|Madhavapeddi Satyam,/artist/ghanatasala|/artist/madhavapeddi-satyam,04:15,/padyams-with-dialogues,Telugu


##2.2) Singers

In [None]:
df["singer"].value_counts()

Unnamed: 0_level_0,count
singer,Unnamed: 1_level_1
P. Susheela,938
S. P. Balasubrahmanyam,931
Asha Bhosle,880
Lata Mangeshkar,725
S. Janaki,616
...,...
Calcutta Youth Choir,1
Sudhin Mukherjee|Sushil Das|Vivekananda Roy|Bhanu Bandopadhyay|Sudhir Bhattacharya|Shaila Devi,1
Sanjay Chakraborty,1
Pamela Mukherjee,1


In [None]:
df[df["singer"]=="P. Susheela"]

Unnamed: 0,name,singer,singer_id,duration,link,language
5419,Nalandhana,P. Susheela,/artist/p-suseela-2,05:18,/nalandhana,Tamil
5429,Nenjam Marappathillai (Female),P. Susheela,/artist/p-suseela-2,04:23,/nenjam-marappathillai,Tamil
5464,Paarttha Gnaabagam (Happy),P. Susheela,/artist/p-suseela-2,04:12,/partha-nyaabagam-happy,Tamil
5493,Athai Madi,P. Susheela,/artist/p-suseela-2,03:26,/athai-madi,Tamil
5497,Thedinen Vanthathu,P. Susheela,/artist/p-suseela-2,03:43,/thedinen-vanthathu,Tamil
...,...,...,...,...,...,...
38187,Vasthade Vasthade,P. Susheela,/artist/p-suseela-2,02:07,/vasthade-vasthade,Telugu
38471,Nabaghana Barana,P. Susheela,/artist/p-suseela-2,04:20,/nabaghana-barana-1,Odia
38692,Nabaghana Barana,P. Susheela,/artist/p-suseela-2,03:08,/nabaghana-barana,Odia
38769,Asa Pura Kamini,P. Susheela,/artist/p-suseela-2,03:05,/asa-pura-kamini,Odia


## 2.3) Singer_Id




In [None]:
df["singer_id"].value_counts()

Unnamed: 0_level_0,count
singer_id,Unnamed: 1_level_1
/artist/p-suseela-2,938
/artist/s-p-balasubrahamanyam,935
/artist/asha-bhosle-1,880
/artist/lata-mangeshkar,725
/artist/s-janaki,616
...,...
/artist/lata-mangeshkar|/artist/jolly-mukherjee-1,1
/artist/suresh-wadkar|/artist/iiaiyaraaja,1
/artist/kamaal-khan,1
/artist/bhupinder-singh|/artist/runa-laila,1


In [None]:
df[df["singer_id"]=="/artist/p-suseela-2"]

Unnamed: 0,name,singer,singer_id,duration,link,language
5419,Nalandhana,P. Susheela,/artist/p-suseela-2,05:18,/nalandhana,Tamil
5429,Nenjam Marappathillai (Female),P. Susheela,/artist/p-suseela-2,04:23,/nenjam-marappathillai,Tamil
5464,Paarttha Gnaabagam (Happy),P. Susheela,/artist/p-suseela-2,04:12,/partha-nyaabagam-happy,Tamil
5493,Athai Madi,P. Susheela,/artist/p-suseela-2,03:26,/athai-madi,Tamil
5497,Thedinen Vanthathu,P. Susheela,/artist/p-suseela-2,03:43,/thedinen-vanthathu,Tamil
...,...,...,...,...,...,...
38187,Vasthade Vasthade,P. Susheela,/artist/p-suseela-2,02:07,/vasthade-vasthade,Telugu
38471,Nabaghana Barana,P. Susheela,/artist/p-suseela-2,04:20,/nabaghana-barana-1,Odia
38692,Nabaghana Barana,P. Susheela,/artist/p-suseela-2,03:08,/nabaghana-barana,Odia
38769,Asa Pura Kamini,P. Susheela,/artist/p-suseela-2,03:05,/asa-pura-kamini,Odia


## 2.4) Duration

In [None]:
df["duration"].value_counts()

Unnamed: 0_level_0,count
duration,Unnamed: 1_level_1
04:00,377
03:02,365
05:00,348
03:21,329
03:23,307
...,...
35:48,1
23:31,1
41:01,1
26:09,1


## 2.5)link

In [None]:
df["link"].value_counts()

Unnamed: 0_level_0,count
link,Unnamed: 1_level_1
/tomar-je-oi-haater-mala,1
/dil-e-nadan-tujhe-3,1
/agar-hum-kahen-aur-woh-muskura-den,1
/unke-dekhe-se,1
/yeh-na-thi-hamari-qismat-2,1
...,...
/dialogues-82,1
/tum-apna-ranjhogham-apni-1,1
/tum-itna-jo-muskura-rahe-ho-12,1
/bahut-pahele-se-un-kadmo-ki-aahat-jaan-lete-hain,1


## 2.6) language

In [None]:
df["language"].value_counts()

Unnamed: 0_level_0,count
language,Unnamed: 1_level_1
Telugu,4635
Marathi,4240
Tamil,4227
Hindi,4078
Punjabi,3580
Urdu,3108
Kannada,2851
Gujarati,2007
Odia,915
Bengali,874


# 3)DATA CLEANING

## 3.1) DURATION CLEANING


In [None]:
df["duration"][:10]

Unnamed: 0,duration
0,05:00
1,06:26
2,03:41
3,04:26
4,05:39
5,07:59
6,04:48
7,05:21
8,03:09
9,04:24


In [None]:
duration_in_sec=[]
for i in df["duration"]:

  if len(i.split(":"))==2:

   mins=int(i.split(":")[0])
   sec=int(i.split(":")[1])

   dur=mins*60+sec

   duration_in_sec.append(dur)

  elif len(i.split(":"))==3:

    hrs=int(i.split(":")[0])
    mins=int(i.split(":")[1])
    sec=int(i.split(":")[2])

    dur=hrs*60*60+mins*60+sec

    duration_in_sec.append(dur)


duration_in_sec



[300,
 386,
 221,
 266,
 339,
 479,
 288,
 321,
 189,
 264,
 303,
 240,
 283,
 449,
 228,
 204,
 219,
 322,
 359,
 381,
 271,
 263,
 156,
 289,
 280,
 458,
 403,
 442,
 357,
 217,
 301,
 216,
 228,
 291,
 196,
 302,
 343,
 199,
 264,
 155,
 288,
 354,
 196,
 212,
 186,
 155,
 241,
 307,
 197,
 295,
 1106,
 84,
 329,
 276,
 199,
 391,
 302,
 225,
 314,
 256,
 309,
 439,
 238,
 341,
 245,
 442,
 219,
 268,
 334,
 279,
 363,
 261,
 196,
 326,
 226,
 292,
 300,
 249,
 257,
 284,
 206,
 1582,
 218,
 281,
 248,
 382,
 372,
 192,
 249,
 289,
 246,
 464,
 301,
 207,
 240,
 196,
 289,
 195,
 225,
 198,
 392,
 321,
 375,
 381,
 219,
 200,
 345,
 1733,
 384,
 182,
 336,
 379,
 354,
 265,
 196,
 358,
 189,
 266,
 276,
 333,
 186,
 241,
 189,
 195,
 261,
 277,
 198,
 212,
 178,
 339,
 395,
 231,
 216,
 1249,
 324,
 405,
 1143,
 258,
 398,
 258,
 1641,
 199,
 262,
 272,
 348,
 421,
 424,
 192,
 791,
 352,
 1109,
 719,
 366,
 355,
 196,
 2285,
 388,
 326,
 758,
 273,
 247,
 199,
 361,
 1760,
 1653,
 

In [None]:
df["duration_in_secs"]=duration_in_sec

In [None]:
df.head(2)

Unnamed: 0,name,singer,singer_id,duration,link,language,duration_in_secs
0,Dil - E - Nadan Tujhe,Chitra Singh|Jagjit Singh,/artist/chitra-singh|/artist/jagjeet-singh-1,05:00,/dil-e-nadan-tujhe-3,Urdu,300
1,Agar Hum Kahen Aur Woh Muskara De,Chitra Singh|Jagjit Singh,/artist/chitra-singh|/artist/jagjeet-singh-1,06:26,/agar-hum-kahen-aur-woh-muskura-den,Urdu,386


In [None]:
df["duration_in_secs"].describe()

Unnamed: 0,duration_in_secs
count,32949.0
mean,313.098273
std,531.554234
min,17.0
25%,202.0
50%,256.0
75%,321.0
max,25680.0


In [None]:
df.sort_values(by= 'duration_in_secs', ascending=False).head(20)

Unnamed: 0,name,singer,singer_id,duration,link,language,duration_in_secs
16093,Haye Tumba,Dilshad Akhtar,/artist/dilshad-akhtar,07:08:00,/haye-tumba,Punjabi,25680
15000,Bum Bum Lahiri,Dilshad Akhtar,/artist/dilshad-akhtar,06:52:00,/bum-bum-lahiri-1,Punjabi,24720
15025,Kanu Atharu Bahoundi,Dilshad Akhtar,/artist/dilshad-akhtar,06:04:00,/kanu-atharu-bahoundi,Punjabi,21840
14624,Desi Bandri Vilayati Cheekan,Dilshad Akhtar,/artist/dilshad-akhtar,06:03:00,/desi-bandri-vilayati-cheekan-1,Punjabi,21780
15259,Kunda Khol Basantariye,Dilshad Akhtar,/artist/dilshad-akhtar,06:02:00,/kunda-khol-basantariye,Punjabi,21720
14655,Ghund Kad Lai,Dilshad Akhtar,/artist/dilshad-akhtar,05:48:00,/ghund-kad-lai-1,Punjabi,20880
14360,Man Wich Vasnai,Dilshad Akhtar,/artist/dilshad-akhtar,05:43:00,/man-wich-vasnai-1,Punjabi,20580
15493,Addi Todewali Jhanjhar,Dilshad Akhtar,/artist/dilshad-akhtar,05:33:00,/addi-todewali-jhanjhar-1,Punjabi,19980
15098,Dil Chori Ho Gaya,Dilshad Akhtar,/artist/dilshad-akhtar,05:25:00,/dil-chori-ho-gaya,Punjabi,19500
14904,Nachi Saadde Naal,Dilshad Akhtar,/artist/dilshad-akhtar,05:17:00,/nachi-saadde-naal,Punjabi,19020


# KPIs

















##1)Most Popular Song

In [None]:
df["name"].value_counts()

Unnamed: 0_level_0,count
name,Unnamed: 1_level_1
Padhyams,19
Ahe Ramahari,12
Tuma Gaan Mahu Machhi,11
Music,10
Film Story,8
...,...
Idhu Shrungara Hennina,1
Oh Rasika Subhanalla,1
Naa Donkadarenu,1
Jayathu Jaya/Ee Nanna/Kivivarege,1


In [None]:
df.groupby("name")["name"].count().sort_values(ascending=False)

Unnamed: 0_level_0,name
name,Unnamed: 1_level_1
Padhyams,19
Ahe Ramahari,12
Tuma Gaan Mahu Machhi,11
Music,10
Film Story,8
...,...
Hun Ghunghat Maan Gherani,1
Hun Gamre Jhulasan Ni Chhori,1
Hun Fatkelo Phaganiyo,1
Hun Evi Vahune Gotva,1


##2)Most Popular Singer

In [None]:
lst=[]
for i in df["singer"]:
  lst=lst+i.split("|")
lst

['Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Kaur',
 'Chitra Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Asha Bhosle',
 'Ghulam Ali',
 'Chitra Singh',
 'Gulzar',
 'Naseeruddin Shah',
 'Vinod Sehgal',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Mohammed Rafi',
 'Suman Kalyanpur',
 'Jagjit Singh',
 'Chitra Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',


In [None]:
pd.Series(lst).value_counts()

Unnamed: 0,count
S. P. Balasubrahmanyam,3057
P. Susheela,2893
S. Janaki,1847
Asha Bhosle,1713
Lata Mangeshkar,1434
...,...
Amzad Khan,1
Masood Rana,1
Mehboob Pervaiz,1
Krishna Mukherjee,1


## 3)Cummulative Song Duration

duration in hours, minutes and seconds

In [None]:
df["duration_in_secs"].sum()

np.int64(10316275)

In [None]:
10316275//3600

2865

In [None]:
10316275%3600

2275

In [None]:
2275//60

37

In [None]:
2275%60

55

duration in day, hours, minutes and seconds

In [None]:
10316275//86400


119

In [None]:
10316275%86400

34675

In [None]:
34675//3600

9

In [None]:
34675%3600

2275

In [None]:
2275//60

37

## 4)Total Number of Languages

In [None]:
df["language"].nunique()

15

##  5)Total Number of Songs

In [None]:
df["name"].count()

np.int64(32949)

## 6)Total Number of Singers

In [None]:
lst=[]
for i in df["singer"]:
  lst=lst+i.split("|")
lst

['Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Kaur',
 'Chitra Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Asha Bhosle',
 'Ghulam Ali',
 'Chitra Singh',
 'Gulzar',
 'Naseeruddin Shah',
 'Vinod Sehgal',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Mohammed Rafi',
 'Suman Kalyanpur',
 'Jagjit Singh',
 'Chitra Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',


In [None]:
pd.Series(lst).describe()

Unnamed: 0,0
count,47920
unique,3940
top,S. P. Balasubrahmanyam
freq,3057


##  7)Minimum and Maximum Duration

In [None]:
df["duration_in_secs"].describe()

Unnamed: 0,duration_in_secs
count,32949.0
mean,313.098273
std,531.554234
min,17.0
25%,202.0
50%,256.0
75%,321.0
max,25680.0


##8) TOTAL NO. OF UNIQUE SINGERS

In [None]:
df["singer"].describe()

Unnamed: 0,singer
count,32949
unique,6196
top,P. Susheela
freq,938


In [None]:
pd.Series(lst).nunique()

3940

In [None]:
lst

['Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Kaur',
 'Chitra Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Naseeruddin Shah',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Asha Bhosle',
 'Ghulam Ali',
 'Chitra Singh',
 'Gulzar',
 'Naseeruddin Shah',
 'Vinod Sehgal',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Mohammed Rafi',
 'Suman Kalyanpur',
 'Jagjit Singh',
 'Chitra Singh',
 'Chitra Singh',
 'Jagjit Singh',
 'Jagjit Singh',


In [None]:
4394/3600

1.2205555555555556

In [None]:
4394%3600

794

In [None]:
10316275//3600

2865

In [None]:
10316275%3600

2275

In [None]:
2275//60

37

In [None]:
2275%60

55

##9) Total Number of Unique Songs

In [None]:
df["name"].nunique()

31534

In [None]:
df.to_csv("gaana_cleand_data.csv" , index=False)

In [None]:
from google.colab import files
files.download("gaana_cleand_data.csv")

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>