Gathering comprehensive data on paint stores across different states in India is crucial for various stakeholders, including consumers, businesses, and policymakers. The availability and accessibility of paint stores play a significant role in meeting the demands of the construction, renovation, and home improvement sectors.

In this project, our objective is to collect detailed information about paint stores operating in each state of India. By systematically compiling data on the locations, contact information, product offerings, and services provided by these stores, we aim to create a comprehensive database that serves as a valuable resource for individuals and organizations in need of paint-related products and services.

The data collection process involves conducting extensive research, reaching out to paint store owners and industry associations, and leveraging online resources and directories. We will prioritize accuracy and reliability, ensuring that the collected data is up-to-date and verified.

This initiative holds significant benefits for various stakeholders. For consumers, having access to a centralized repository of paint store details will facilitate informed decision-making and streamline their search for specific products or services. Paint store owners can leverage this database to increase their visibility, reach potential customers, and gain insights into market trends. Policymakers can utilize this data to analyze the paint industry's economic impact, identify growth opportunities, and formulate effective policies to support the sector's development.

Join us in this endeavor to collect and compile accurate, comprehensive data on paint store details across all states in India. Together, we can build a valuable resource that promotes transparency, facilitates business growth, and enhances the overall paint retail landscape in the country.

In [180]:
# Install all require library
!pip install requests
!pip install bs4



In [181]:
# Import all require library like request,os,panda,csv and beautiful.
import requests
import pandas as pd
import os
from bs4 import BeautifulSoup as bs
import csv
import numpy as np

#**Andhra Pradesh**

After the data has been collected from various cities in Andhra Pradesh, we will merge it all into one comprehensive dataset.

In [182]:
# City Name: Anantapur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/andhra-pradesh/anantapur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
ap1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
ap1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,K K Paints,"K K Paints, 15/496/1 SURYA NAGAR 80 FEET ROAD ...",Anantapur,Andhra Pradesh,India,093966 87672,14.678225,77.604274,
1,Partha paints and hardware,"Partha paints and hardware, Rajamma Nagar, Bha...",Anantapur,Andhra Pradesh,India,,14.681712,77.611705,
2,Venkatasai Paintings Hardwares,"Venkatasai Paintings Hardwares, D No.12-5-402,...",Anantapur,Andhra Pradesh,India,099852 79850,14.665382,77.601675,
3,Elahi Enterprises,"Elahi Enterprises, opp:kamma bhavan, Main Rd, ...",Anantapur,Andhra Pradesh,India,070137 52806,14.668793,77.58282,
4,Sudhir paints,"Sudhir paints, Ganesha Nagar, Pilligundla, Ana...",Anantapur,Andhra Pradesh,India,,14.683689,77.576337,
5,Hanuman Paints And Hardwares,"Hanuman Paints And Hardwares, Rangaswamy Nagar...",Anantapur,Andhra Pradesh,India,,14.690774,77.595991,
6,Asian Paints,"Asian Paints, Kamalanagar, Anantapur, Andhra P...",Anantapur,Andhra Pradesh,India,,14.68248,77.596748,https://www.asianpaints.com
7,Brundhavan traders,"Brundhavan traders, Tirupati Rd, Sangameswar C...",Anantapur,Andhra Pradesh,India,,14.67355,77.606613,
8,Asian Paints,"Asian Paints, Maruthi Nagar, Anantapur, Andhra...",Anantapur,Andhra Pradesh,India,,14.669713,77.588092,https://www.asianpaints.com
9,S.K. Paints,"6, S.K. Paints, 71, Bellary - Uravakonda - Ana...",Anantapur,Andhra Pradesh,India,099666 81848,14.687115,77.582064,


In [183]:
# City Name: Guntur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/andhra-pradesh/Guntur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
ap2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
ap2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sri Rama Paints And Hardware Stores,"Sri Rama Paints And Hardware Stores, 1, 3rd Ln...",Guntur,Andhra Pradesh,India,094402 64770,16.301709,80.439706,https://sriramapaintsandhardwrestores.business...
1,Sri Vara Siddhi Vinayaka Paints,"Sri Vara Siddhi Vinayaka Paints, D. No. 24-12-...",Guntur,Andhra Pradesh,India,099482 15955,16.287984,80.441449,
2,Sri Rama Paints,"Sri Rama Paints, Guntur, Andhra Pradesh 522003",Guntur,Andhra Pradesh,India,095022 22781,16.290087,80.444724,
3,MAHALAKSHMI PAINTS,"MAHALAKSHMI PAINTS, 522001 12-27, 114, Seelam ...",Guntur,Andhra Pradesh,India,097032 00055,16.295631,80.447835,http://business.google.com
4,REVANTH PAINTS,"REVANTH PAINTS, BOMMALA CENTER, 3rd lane,MENAK...",Guntur,Andhra Pradesh,India,,16.27876,80.438963,https://revanth-paints.business.site
5,Dorka Paints,"# 13, Dorka Paints, 2, Arundelpet, Guntur, And...",Guntur,Andhra Pradesh,India,,16.306244,80.439117,
6,Hanuman Paints & Sanitary Stores,"Hanuman Paints & Sanitary Stores, Door No 23-6...",Guntur,Andhra Pradesh,India,0863 222 2957,16.298434,80.432532,
7,Dharani Enterprises,"Dharani Enterprises, 8 Th Lane, Arundetpet, Gu...",Guntur,Andhra Pradesh,India,0863 223 2167,16.303743,80.438477,
8,Dharani Agencies,"Dharani Agencies, # 18-6-47 , Stadium, Road, G...",Guntur,Andhra Pradesh,India,,16.290769,80.446901,https://dharani-agencies.business.site
9,Sri Karthikeya Enterprises,"Sri Karthikeya Enterprises, State Highway 2, N...",Guntur,Andhra Pradesh,India,,16.299905,80.389653,


In [184]:
# City Name: Vijayawada

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/andhra-pradesh/Vijayawada/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
ap3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
ap3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Basha Paints,"Basha Paints, No. 5-6-7, KT Rd, Opp. Durga Tem...",Vijayawada,Andhra Pradesh,India,,16.531224,80.613125,
1,Sri Bhagyalakshmi Agencies & Paints,"Sri Bhagyalakshmi Agencies & Paints, #22/84/1,...",Vijayawada,Andhra Pradesh,India,,16.539826,80.576661,
2,k.g.n.paints&generals,"k.g.n.paints&generals, #29-1-12, Sheshadri Sas...",Vijayawada,Andhra Pradesh,India,099896 64499,16.512222,80.629493,
3,Surya Paints,"Surya Paints, #11-33-16,Tatakulavaristreet,Sam...",Vijayawada,Andhra Pradesh,India,098664 22055,16.515399,80.61109,
4,Sri Lakshmi Paints,"Sri Lakshmi Paints, Door No 55-3-25, 1st Floor...",Vijayawada,Andhra Pradesh,India,0866 254 4852,16.506179,80.648023,
5,Capital Colours - Paint Shop,"Capital Colours - Paint Shop, #43-78-12, Krish...",Vijayawada,Andhra Pradesh,India,089199 21654,16.540346,80.642328,https://getcapitalcolours.com
6,Maheshwari Paints,"Shop No. 28, Maheshwari Paints, 15-13, Dasu Va...",Vijayawada,Andhra Pradesh,India,0866 669 9866,16.514713,80.629838,
7,Vaishnavi Agencies,"Vaishnavi Agencies, #43-107-167,Shop No : 2, K...",Vijayawada,Andhra Pradesh,India,094944 96739,16.540993,80.637321,
8,Sadhana Paints Agencies,"D.No. 28-7, Sadhana Paints Agencies, 1/4, Shai...",Vijayawada,Andhra Pradesh,India,0866 257 9075,16.514142,80.628369,
9,Shree Venkata Satya Paints Inds,"Shree Venkata Satya Paints Inds, 1-4-204, Bhav...",Vijayawada,Andhra Pradesh,India,0866 241 5219,16.506194,80.648061,


In [185]:
# City Name: Nellore

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/andhra-pradesh/Nellore/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
ap4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
ap4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sri Sapthagiri Paints & Hardwares,"Sri Sapthagiri Paints & Hardwares, D.No. 15/61...",Nellore,Andhra Pradesh,India,092999 99429,14.456981,79.987221,
1,HS Paints & Hardwares,"HS Paints & Hardwares, Mini Bypass Rd, beside ...",Nellore,Andhra Pradesh,India,088855 87855,14.452345,79.989306,
2,Vijayalakshmi Paints & Hardware,"Vijayalakshmi Paints & Hardware, Mini Bypass R...",Nellore,Andhra Pradesh,India,088016 06676,14.424146,79.971608,
3,Tirumala Paints & Hardwares,"Tirumala Paints & Hardwares, Podalakur Road, N...",Nellore,Andhra Pradesh,India,094908 92252,14.415962,79.952658,
4,Sree Srinivasa paints and hardwares,"Sree Srinivasa paints and hardwares, Nawabpet ...",Nellore,Andhra Pradesh,India,094928 02134,14.459837,79.99902,
5,MALLI PAINTS & HARDWARE,"d.no, MALLI PAINTS & HARDWARE, 16-189A, Mini B...",Nellore,Andhra Pradesh,India,,14.450358,79.988795,
6,Sri Padmavathi Paints,"Sri Padmavathi Paints, Opp: Rtc Bus Stand, Ram...",Nellore,Andhra Pradesh,India,097012 68850,14.443015,79.977699,
7,Sri Venkatasyamala h/w & paints,"Sri Venkatasyamala h/w & paints, Padmavathi Ce...",Nellore,Andhra Pradesh,India,,14.422816,79.957322,https://sri-venkatasyamala-hw-paints.business....
8,KVS Paints and Infrastructures,"KVS Paints and Infrastructures, Dycus road cen...",Nellore,Andhra Pradesh,India,,14.414308,79.948302,https://kvs-paints-and-infrastructures.busines...
9,Balu Paints,"Balu Paints, Plot No. 219, Phase- 1, Auto Naga...",Nellore,Andhra Pradesh,India,098485 69474,14.404415,79.948947,


In [186]:
# By using 'pd.concat', it is possible to concatenate all the dataframes and store the result in a variable.
# Prepare a list containing the dataframes, and when using 'pd.concat', specify 'ignore_index=True' to reset the index.
ap_data = [ap1,ap2,ap3,ap4]
ap_data = pd.concat(ap_data,ignore_index=True)
ap_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,K K Paints,"K K Paints, 15/496/1 SURYA NAGAR 80 FEET ROAD ...",Anantapur,Andhra Pradesh,India,093966 87672,14.678225,77.604274,
1,Partha paints and hardware,"Partha paints and hardware, Rajamma Nagar, Bha...",Anantapur,Andhra Pradesh,India,,14.681712,77.611705,
2,Venkatasai Paintings Hardwares,"Venkatasai Paintings Hardwares, D No.12-5-402,...",Anantapur,Andhra Pradesh,India,099852 79850,14.665382,77.601675,
3,Elahi Enterprises,"Elahi Enterprises, opp:kamma bhavan, Main Rd, ...",Anantapur,Andhra Pradesh,India,070137 52806,14.668793,77.58282,
4,Sudhir paints,"Sudhir paints, Ganesha Nagar, Pilligundla, Ana...",Anantapur,Andhra Pradesh,India,,14.683689,77.576337,


#**Assam**

Data has been gathered from various cities in Assam. Once collected, we will merge it all.

In [187]:
# City Name: Dispur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/assam/Dispur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
as1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
as1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Maa Enterprise,"Maa Enterprise, Nalapara, Sarusajai, Guwahati,...",Dispur,Assam,India,,26.119638,91.76043,
1,D.A ENTERPRISE,"D.A ENTERPRISE, Sonapur - Rewa Rd, KUSHAL NAGA...",Dispur,Assam,India,,26.121048,91.968882,
2,Usha trade agencies,"Usha trade agencies, near SFS SCHOOL, Kalitaku...",Dispur,Assam,India,,26.165103,91.831996,
3,Balaji Enterprise,"Balaji Enterprise, NH 37, Lokhra Chariali, Nea...",Dispur,Assam,India,073992 38325,26.111926,91.74994,
4,city steel,"city steel, Bhetapara hatigaon road central ba...",Dispur,Assam,India,094355 47362,26.121469,91.78725,
5,H.M. Refinish,"H.M. Refinish, GS Rd, Ganeshguri, Guwahati, As...",Dispur,Assam,India,,26.150272,91.784335,
6,Shyam Hardware Stores,"Shyam Hardware Stores, Ganeshguri Subji Bazaar...",Dispur,Assam,India,,26.148028,91.785839,
7,Vikas Hardware Store,"Vikas Hardware Store, Dr. R.P. Road, Dispur, G...",Dispur,Assam,India,,26.146315,91.786191,
8,Rajdhani Paint House,"Rajdhani Paint House, Hatigaon Main Rd, Near N...",Dispur,Assam,India,098541 45005,26.135043,91.785807,
9,SHREE SAI TRADERS,"SHREE SAI TRADERS, high school, near, Hatigaon...",Dispur,Assam,India,,26.133316,91.785859,


In [188]:
# City Name: Guwahati

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/assam/Guwahati/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
as2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
as2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sandeep Commercial,"Sandeep Commercial, Fayez Ahmed Rd, Kumarpara,...",Guwahati,Assam,India,,26.174628,91.736093,
1,Pratayush Enterprise,"Pratayush Enterprise, MB Rd, Bishnupur, Fatasi...",Guwahati,Assam,India,,26.166636,91.737918,
2,SANTOSH CHANDRA DAS AND SON'S,"SANTOSH CHANDRA DAS AND SON'S, SRCB Rd, Lakhto...",Guwahati,Assam,India,098545 54071,26.18172,91.738201,http://mowjee.com
3,Sarala Store,"Sarala Store, GS Rd, Kejriwal Bhawan Market, M...",Guwahati,Assam,India,0361 273 8307,26.178656,91.751588,
4,Industrial traders,"Industrial traders, MB MARKET AT ROAD, near RL...",Guwahati,Assam,India,094351 03866,26.174471,91.735455,
5,Shubham Trading,"Shubham Trading, Dr BK Kakati Rd, Sarabbhati, ...",Guwahati,Assam,India,097060 87646,26.168513,91.749617,
6,Pratayush Enterprises,"Pratayush Enterprises, MB Rd, Bishnupur, Fatas...",Guwahati,Assam,India,,26.16662,91.737983,
7,Jai Kolor Chrome,"Jai Kolor Chrome, Surya Tower,Prag Channel Bui...",Guwahati,Assam,India,095089 12190,26.175914,91.757014,
8,Om Hardware Mart,"Om Hardware Mart, Kedar Rd, Machkhowa, Guwahat...",Guwahati,Assam,India,,26.177917,91.736834,
9,Shanti Bhandar,"Shanti Bhandar, UN Bezbaruah Road, Silphukuri,...",Guwahati,Assam,India,0361 266 7530,26.185195,91.762417,


In [189]:
# Combining all the dataframes into a single variable can be achieved using the 'pd.concat' function, which allows concatenation. To begin,
# create a list of dataframes and pass it to 'pd.concat'.
asm_data = [as1,as2]
asm_data = pd.concat(asm_data,ignore_index=True)
asm_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Maa Enterprise,"Maa Enterprise, Nalapara, Sarusajai, Guwahati,...",Dispur,Assam,India,,26.119638,91.76043,
1,D.A ENTERPRISE,"D.A ENTERPRISE, Sonapur - Rewa Rd, KUSHAL NAGA...",Dispur,Assam,India,,26.121048,91.968882,
2,Usha trade agencies,"Usha trade agencies, near SFS SCHOOL, Kalitaku...",Dispur,Assam,India,,26.165103,91.831996,
3,Balaji Enterprise,"Balaji Enterprise, NH 37, Lokhra Chariali, Nea...",Dispur,Assam,India,073992 38325,26.111926,91.74994,
4,city steel,"city steel, Bhetapara hatigaon road central ba...",Dispur,Assam,India,094355 47362,26.121469,91.78725,


#**Bihar**

We have accumulated data from multiple cities in Bihar. Upon completion of data collection, we will consolidate it.

In [190]:
# City Name: Patna

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/bihar/Patna/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
bh1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
bh1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shivajeet Paints & Hardwere,"Shivajeet Paints & Hardwere, Ashok Nagar Ramla...",Patna,Bihar,India,,25.587001,85.143496,
1,ANJAY SALES,"ANJAY SALES, Congress Maidan Rd, Ghrounda, Pat...",Patna,Bihar,India,,25.610428,85.153055,
2,shri ram paint house,"shri ram paint house, near sbi bank, New Etwar...",Patna,Bihar,India,,25.580184,85.131137,
3,Shanti Colour Zone,"Shanti Colour Zone, 90, Vijay Nagar Rd, Hanuma...",Patna,Bihar,India,,25.58798,85.167949,
4,Kumar Hardware,"Kumar Hardware, Jawhar Nagar Opposit The Hotal...",Patna,Bihar,India,,25.583079,85.142723,
5,M/S Patna Paint House,"M/S Patna Paint House, New Bye Pass Road, F In...",Patna,Bihar,India,095761 97122,25.584375,85.096092,
6,Sainath Paints & Hardware,"Sainath Paints & Hardware, Parvathi Path, New ...",Patna,Bihar,India,,25.598669,85.168317,https://sainath-paints-hardware.business.site
7,Ganesh Paint House,"Ganesh Paint House, Balapur, Patna, Bihar 800010",Patna,Bihar,India,,25.64105,85.105949,
8,Ranjit Nerolac Paint,"Ranjit Nerolac Paint, Jakariyapur Jhali Rd, Pa...",Patna,Bihar,India,,25.584087,85.18088,
9,Asian Paints,"Asian Paints, rajivnagar near railway crossing...",Patna,Bihar,India,,25.630607,85.098604,https://website-2924626116339535095189-paintst...


In [191]:
# City Name: Muzaffarpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/bihar/Muzaffarpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
bh2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
bh2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shivam Paints,"Shivam Paints, Kacchi Pakki, Atardah Road, Muz...",Muzaffarpur,Bihar,India,072095 09115,26.092715,85.389826,
1,Dharmendra Motor,"Dharmendra Motor, Bairiya, near Bus Stand, Muz...",Muzaffarpur,Bihar,India,099310 73066,26.148517,85.361315,
2,Punjab Brothers,"Punjab Brothers, Near Chandralok Chowk, Satsan...",Muzaffarpur,Bihar,India,094312 48360,26.118012,85.384724,
3,SATYAM PAINTS AND HARDWARE,"SATYAM PAINTS AND HARDWARE, Hathi chowk, near ...",Muzaffarpur,Bihar,India,,26.117063,85.396742,
4,Paint and decorative (KAUSHAL HARDWARE),"Paint and decorative (KAUSHAL HARDWARE), Juma ...",Muzaffarpur,Bihar,India,,26.125636,85.394334,
5,Color Centre,"Color Centre, Maripur, Power House Chowk, Muza...",Muzaffarpur,Bihar,India,099393 08676,26.122583,85.364843,
6,Rainbow World,"Rainbow World, Vrindawn Market, Bela Rd, near ...",Muzaffarpur,Bihar,India,,26.103048,85.402097,https://rainbowworld.business.site
7,Punjab H/W & Paints,"Punjab H/W & Paints, Old Motihari Road, Bairia...",Muzaffarpur,Bihar,India,098358 21772,26.149122,85.35703,
8,M/s Rabi Enterprises,"M/s Rabi Enterprises, Jail Rd, Tinkothia, Chan...",Muzaffarpur,Bihar,India,094310 13024,26.124647,85.401607,
9,Óñ¬ÓñÂÓÑüÓñ¬ÓññÓÑÇÓñ¿Óñ¥ÓñÑ Óñ¬ÓÑçÓñéÓñ¿ÓÑìÓñƒÓñ©,Óñ¬ÓñÂÓÑüÓñ¬ÓññÓÑÇÓñ¿Óñ¥ÓñÑ Óñ¬ÓÑçÓñéÓñ¿ÓÑìÓñƒ...,Muzaffarpur,Bihar,India,,26.138455,85.365491,


In [192]:
# In order to rectify the presence of an irrelevant store name in the dataset, we conducted a web search using the store's address to obtain the correct name.
# With the assistance of the '.str.replace' function, we seamlessly replaced the outdated store name with the accurate one retrieved from the web search.

bh2['name'] = bh2['name'].str.replace("Óñ¬ÓñÂÓÑüÓñ¬ÓññÓÑÇÓñ¿Óñ¥ÓñÑ Óñ¬ÓÑçÓñéÓñ¿ÓÑìÓñƒÓñ©","Pashupatinath Paints")

In [193]:
# The data contains unnecessary elements in 'address' column that need to be addressed.
# We can employ the '.apply' function along with a 'lambda' function to utilize the 'split' function for eliminating the unwanted content
# and extracting the last element as index '[-1]'.

bh2['address'] = bh2['address'].apply(lambda x: x.split("©, ")[-1])

In [194]:
bh2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shivam Paints,"Shivam Paints, Kacchi Pakki, Atardah Road, Muz...",Muzaffarpur,Bihar,India,072095 09115,26.092715,85.389826,
1,Dharmendra Motor,"Dharmendra Motor, Bairiya, near Bus Stand, Muz...",Muzaffarpur,Bihar,India,099310 73066,26.148517,85.361315,
2,Punjab Brothers,"Punjab Brothers, Near Chandralok Chowk, Satsan...",Muzaffarpur,Bihar,India,094312 48360,26.118012,85.384724,
3,SATYAM PAINTS AND HARDWARE,"SATYAM PAINTS AND HARDWARE, Hathi chowk, near ...",Muzaffarpur,Bihar,India,,26.117063,85.396742,
4,Paint and decorative (KAUSHAL HARDWARE),"Paint and decorative (KAUSHAL HARDWARE), Juma ...",Muzaffarpur,Bihar,India,,26.125636,85.394334,
5,Color Centre,"Color Centre, Maripur, Power House Chowk, Muza...",Muzaffarpur,Bihar,India,099393 08676,26.122583,85.364843,
6,Rainbow World,"Rainbow World, Vrindawn Market, Bela Rd, near ...",Muzaffarpur,Bihar,India,,26.103048,85.402097,https://rainbowworld.business.site
7,Punjab H/W & Paints,"Punjab H/W & Paints, Old Motihari Road, Bairia...",Muzaffarpur,Bihar,India,098358 21772,26.149122,85.35703,
8,M/s Rabi Enterprises,"M/s Rabi Enterprises, Jail Rd, Tinkothia, Chan...",Muzaffarpur,Bihar,India,094310 13024,26.124647,85.401607,
9,Pashupatinath Paints,"Daudpur Kothi, Muzaffarpur, Bihar 842003",Muzaffarpur,Bihar,India,,26.138455,85.365491,


In [195]:
# City Name: Gopālganj

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/bihar/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
bh3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
bh3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Samarthya Enterprises Authorised Dealer of Asi...,Samarthya Enterprises Authorised Dealer of Asi...,Gopālganj,Bihar,India,,26.375199,84.342828,https://samarthya-enterprises-authorised-deale...
1,Naveen Bhandar,"Naveen Bhandar, Chuna Gali, Haziapur, Gopalgan...",Gopālganj,Bihar,India,,26.466679,84.443101,
2,Keshav Hardware,"Keshav Hardware, Ramasheesh Complex, Arar More...",Gopālganj,Bihar,India,088738 63371,26.456354,84.452863,
3,shrivastv hardwear and paints,"shrivastv hardwear and paints, Kateya - Semra ...",Gopālganj,Bihar,India,,26.558995,84.157487,
4,Colour Tech,"Colour Tech, Banjari, Bihar 841428",Gopālganj,Bihar,India,,26.473224,84.428029,
5,A1 TRADERS,"A1 TRADERS, Bedu Tola, Bihar 841440",Gopālganj,Bihar,India,,26.435389,84.399745,
6,Samdarshi Traders,"Samdarshi Traders, Ram Nath Sharma Marg, Thana...",Gopālganj,Bihar,India,097082 81884,26.463858,84.443956,
7,Maa parvati enterprises,"Maa parvati enterprises, Laljee market, Statio...",Gopālganj,Bihar,India,,26.525914,84.360179,
8,Danish Traders,"Danish Traders, Zilla Parishad Market, Shop No...",Gopālganj,Bihar,India,082940 75786,26.460738,84.439371,
9,Panchdewari hardware and paint store,"Panchdewari hardware and paint store, Bus Stop...",Gopālganj,Bihar,India,,26.581679,84.167932,


In [196]:
# By utilizing the 'pd.concat' function, we can merge all the dataframes and store the result in a variable. To do this,
# compile the dataframes into a list and employ 'pd.concat', ensuring to include 'ignore_index=True' for index reset.
bih_data = [bh1,bh2,bh3]
bih_data = pd.concat(bih_data,ignore_index=True)
bih_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shivajeet Paints & Hardwere,"Shivajeet Paints & Hardwere, Ashok Nagar Ramla...",Patna,Bihar,India,,25.587001,85.143496,
1,ANJAY SALES,"ANJAY SALES, Congress Maidan Rd, Ghrounda, Pat...",Patna,Bihar,India,,25.610428,85.153055,
2,shri ram paint house,"shri ram paint house, near sbi bank, New Etwar...",Patna,Bihar,India,,25.580184,85.131137,
3,Shanti Colour Zone,"Shanti Colour Zone, 90, Vijay Nagar Rd, Hanuma...",Patna,Bihar,India,,25.58798,85.167949,
4,Kumar Hardware,"Kumar Hardware, Jawhar Nagar Opposit The Hotal...",Patna,Bihar,India,,25.583079,85.142723,


PSDdddw#**Chhattisgarh**ssssssssssss

Collected data from different cities in Chhattisgarh will be merged after the collection process.

In [197]:
# City Name: Raipur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/Chhattisgarh/Raipur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
ch1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
ch1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shree Hardware & Paints,"Shree Hardware & Paints, Main Road, New Purena...",Raipur,Chhattisgarh,India,093024 66225,21.223981,81.665659,
1,TiarOne Sales,"TiarOne Sales, D-11 Bhanpuri Machinery Market,...",Raipur,Chhattisgarh,India,,21.302093,81.636175,http://www.tiarone.com
2,Amit Hardware And Paints,"11, Amit Hardware And Paints, Hanuman Mandir, ...",Raipur,Chhattisgarh,India,093006 33032,21.23962,81.663962,
3,Kamal Hardware,"Kamal Hardware, Ring Rd Chowk, Chhattisgarh Na...",Raipur,Chhattisgarh,India,090396 26362,21.218002,81.641758,
4,shri Ramdeo colours,"shri Ramdeo colours, usha pride, Mowa, Raipur,...",Raipur,Chhattisgarh,India,,21.267149,81.663809,
5,Indra traders,"Indra traders, Mandi Rd, Jagriti Nagar, Devend...",Raipur,Chhattisgarh,India,,21.262323,81.645467,
6,Upasana Enterprises,"Upasana Enterprises, Radhaswami Nagar, Bhataga...",Raipur,Chhattisgarh,India,093031 62555,21.216701,81.625395,
7,Shivansh traders,"Shivansh traders, front of, Shivansh tower jal...",Raipur,Chhattisgarh,India,,21.228292,81.632534,
8,Maa Bhawani Hardware,"Maa Bhawani Hardware, Shop No. 04, Ammol Vihar...",Raipur,Chhattisgarh,India,098279 34753,21.268638,81.632582,
9,Manish Steel And Hardware,"Manish Steel And Hardware, Pahadipara, Janta C...",Raipur,Chhattisgarh,India,093010 94432,21.262333,81.62283,


In [198]:
# City Names: Bhilai and Baloda

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/Chhattisgarh/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
ch2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
ch2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shri krishna hardware store's,"Shri krishna hardware store's, Nandini Rd, Cam...",Bhilai,Chhattisgarh,India,099934 30373,21.211938,81.378132,
1,G.C. INFRATECH,"G.C. INFRATECH, Nandini Rd, Durgpara, Camp 2, ...",Bhilai,Chhattisgarh,India,,21.216711,81.379048,
2,Jai Bajrang Glass And Paints,"Jai Bajrang Glass And Paints, Akash Ganga, Sup...",Bhilai,Chhattisgarh,India,,21.207419,81.358377,
3,Krishana Hardware Power House Bhilai,"Krishana Hardware Power House Bhilai, Shri Kri...",Bhilai,Chhattisgarh,India,099934 30373,21.209853,81.378133,
4,TEJAS ENTERPRISES,"TEJAS ENTERPRISES, Light Industrial Area, Bhil...",Bhilai,Chhattisgarh,India,,21.230828,81.382896,
5,Tejas enterprises,"Tejas enterprises, Nandini Rd, Near Acc chowk,...",Bhilai,Chhattisgarh,India,,21.216927,81.379206,
6,Shankar Glass centre,"Shankar Glass centre, Nandini Rd, power house,...",Bhilai,Chhattisgarh,India,,21.212233,81.378109,
7,ViKAS PLYWOOD,"ViKAS PLYWOOD, Dhamdha, Chhattisgarh 491331",Bhilai,Chhattisgarh,India,,21.462867,81.333084,
8,MAHAMAYA STEEL,"MAHAMAYA STEEL, Akaltara, Baloda Rd, Word 8, B...",Baloda,Chhattisgarh,India,,22.126484,82.470447,http://mahamayasteel.com
9,Manish Steel And Hardware,"Manish Steel And Hardware, Pahadipara, Janta C...",Raipur,Chhattisgarh,India,093010 94432,21.262333,81.62283,


In [199]:
# Drop unwanted rows from data set because we don't want to keep that row.
ch2.drop(index=ch2.index[-1],axis=0,inplace=True)

In [200]:
# Check the data set.
ch2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shri krishna hardware store's,"Shri krishna hardware store's, Nandini Rd, Cam...",Bhilai,Chhattisgarh,India,099934 30373,21.211938,81.378132,
1,G.C. INFRATECH,"G.C. INFRATECH, Nandini Rd, Durgpara, Camp 2, ...",Bhilai,Chhattisgarh,India,,21.216711,81.379048,
2,Jai Bajrang Glass And Paints,"Jai Bajrang Glass And Paints, Akash Ganga, Sup...",Bhilai,Chhattisgarh,India,,21.207419,81.358377,
3,Krishana Hardware Power House Bhilai,"Krishana Hardware Power House Bhilai, Shri Kri...",Bhilai,Chhattisgarh,India,099934 30373,21.209853,81.378133,
4,TEJAS ENTERPRISES,"TEJAS ENTERPRISES, Light Industrial Area, Bhil...",Bhilai,Chhattisgarh,India,,21.230828,81.382896,
5,Tejas enterprises,"Tejas enterprises, Nandini Rd, Near Acc chowk,...",Bhilai,Chhattisgarh,India,,21.216927,81.379206,
6,Shankar Glass centre,"Shankar Glass centre, Nandini Rd, power house,...",Bhilai,Chhattisgarh,India,,21.212233,81.378109,
7,ViKAS PLYWOOD,"ViKAS PLYWOOD, Dhamdha, Chhattisgarh 491331",Bhilai,Chhattisgarh,India,,21.462867,81.333084,
8,MAHAMAYA STEEL,"MAHAMAYA STEEL, Akaltara, Baloda Rd, Word 8, B...",Baloda,Chhattisgarh,India,,22.126484,82.470447,http://mahamayasteel.com


In [201]:
# To concatenate multiple dataframes and store the combined result, the 'pd.concat' function comes in handy.
# Begin by creating a list of the dataframes and use 'pd.concat', making sure to include 'ignore_index=True' to reset the index.
chg_data = [ch1,ch2]
chg_data = pd.concat(chg_data,ignore_index=True)
chg_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shree Hardware & Paints,"Shree Hardware & Paints, Main Road, New Purena...",Raipur,Chhattisgarh,India,093024 66225,21.223981,81.665659,
1,TiarOne Sales,"TiarOne Sales, D-11 Bhanpuri Machinery Market,...",Raipur,Chhattisgarh,India,,21.302093,81.636175,http://www.tiarone.com
2,Amit Hardware And Paints,"11, Amit Hardware And Paints, Hanuman Mandir, ...",Raipur,Chhattisgarh,India,093006 33032,21.23962,81.663962,
3,Kamal Hardware,"Kamal Hardware, Ring Rd Chowk, Chhattisgarh Na...",Raipur,Chhattisgarh,India,090396 26362,21.218002,81.641758,
4,shri Ramdeo colours,"shri Ramdeo colours, usha pride, Mowa, Raipur,...",Raipur,Chhattisgarh,India,,21.267149,81.663809,


#**Gujarat**

The data has been gathered from various cities in Gujarat, and we will combine it all after collection.

In [202]:
# City Name: Bhavnagar

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/gujarat/Bhavnagar/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
gu1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
gu1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Gujarat Colour,"Gujarat Colour, Behind Danapith, M.G Road, Lok...",Bhavnagar,Gujarat,India,090336 56352,21.781823,72.145833,https://gujaratcolourbhavnagar.business.site
1,British Paints,"British Paints, Mahathma Gandhi Rd, near I.D.B...",Bhavnagar,Gujarat,India,093755 20052,21.779737,72.145885,http://www.britishpaints.in
2,Shree Shivshakti Paint Store,"Shree Shivshakti Paint Store, G-14, Kapisha Co...",Bhavnagar,Gujarat,India,094289 91029,21.762947,72.1233,https://shree-shivshakti-paint-store.business....
3,Darshan paints,"Darshan paints, Watch tower, 3, Pradhyumna Kan...",Bhavnagar,Gujarat,India,,21.742459,72.132811,
4,Satguru Color World,"Satguru Color World, Old Sindhunagar ,Shop No ...",Bhavnagar,Gujarat,India,,21.752753,72.150845,
5,R R Paints Private Limited,"R R Paints Private Limited, Lokhand Bazar, Dar...",Bhavnagar,Gujarat,India,0278 251 6884,21.78126,72.142909,
6,SHREE SAINATH PAINTS & TRADERS,"Plot no 31, SHREE SAINATH PAINTS & TRADERS, sh...",Bhavnagar,Gujarat,India,,21.76069,72.146079,
7,Coach paint shop,"Coach paint shop, 423B, Bhavnagar Para, Bhavna...",Bhavnagar,Gujarat,India,,21.772024,72.117111,
8,Shah Vrajlal Jamnadas,"Shah Vrajlal Jamnadas, Mahathma Gandhi Rd, nea...",Bhavnagar,Gujarat,India,095588 13051,21.780754,72.146409,
9,Patel Plylam,"Patel Plylam, Madhav Darsahan Complex, Jai Jhu...",Bhavnagar,Gujarat,India,098255 05121,21.767932,72.147971,


In [203]:
# City Name: Ahmedabad

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/gujarat/ahmedabad/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
gu2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
gu2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Manju Enterprise,"Manju Enterprise, OPP Dhaba WALL Chali Memco B...",Ahmedabad,Gujarat,India,099256 98822,23.045991,72.605827,
1,Padmavati Temple Paints,"Padmavati Temple Paints, 1463, Taliya Ni Pol, ...",Ahmedabad,Gujarat,India,,23.022243,72.598079,
2,Mahavir Paints,"Mahavir Paints, Ramkrishna Paramhans Marg, opp...",Ahmedabad,Gujarat,India,094267 59062,22.995459,72.61905,
3,Om Paint World,"Om Paint World, Gf/5, Pranav app, opp. Bhumina...",Ahmedabad,Gujarat,India,099258 12612,23.072018,72.542134,
4,Mangalam Hardware & Paint Mart,"Mangalam Hardware & Paint Mart, Shop No 1 & 2,...",Ahmedabad,Gujarat,India,096629 64461,22.986049,72.620027,
5,Kena Enterprise,"Kena Enterprise, 27, New Cloth Market, Malia, ...",Ahmedabad,Gujarat,India,079 2216 9328,23.019657,72.593679,
6,Divya Paints,"Divya Paints, Shukan Mall, 48, Opp.Cims Hospit...",Ahmedabad,Gujarat,India,094290 03940,23.071437,72.516548,
7,Aksar Paints,"Aksar Paints, Phase 4, GIDC Vatwa, Ahmedabad, ...",Ahmedabad,Gujarat,India,,22.9764,72.640383,
8,Balaji Paint,"Balaji Paint, Sola Rd, Ghatlodia, C.P. Nagar-1...",Ahmedabad,Gujarat,India,079 2745 4414,23.058993,72.537201,
9,Aksar Paints,"Aksar Paints, Rabari Colony, Amraiwadi, Ahmeda...",Ahmedabad,Gujarat,India,,22.997749,72.641289,


In [204]:
# City Name: Surat

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/gujarat/Surat/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
gu3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
gu3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Joshi Paints,"Joshi Paints, Shop no-28,nilkanth park society...",Surat,Gujarat,India,,21.210872,72.781403,
1,Ica Paint,"Ica Paint, City Centre, Sosyo Cir, Laxmi Nagar...",Surat,Gujarat,India,,21.172154,72.827077,
2,Husain Enterprise,"Husain Enterprise, Kaskiwad Baghal Rd, Mahidha...",Surat,Gujarat,India,099241 17252,21.199314,72.82951,
3,Shivani Enterprise,"Shivani Enterprise, Shop no-G/48,ascon plaza B...",Surat,Gujarat,India,,21.192996,72.798808,
4,Krishna Hardware,"Krishna Hardware, L-9, Bhavik Complex Ghod Dod...",Surat,Gujarat,India,098258 55483,21.176501,72.817652,
5,Yazdi Techno Sales & Services,"Yazdi Techno Sales & Services, Shop No. 202/5,...",Surat,Gujarat,India,098251 25918,21.143955,72.776517,
6,DIPAK TRADERS : Furniture Polish,"DIPAK TRADERS : Furniture Polish, 9/467, Kotsa...",Surat,Gujarat,India,0261 241 2492,21.195641,72.829517,
7,National Paints,"National Paints, Shop No. G 9 Firdis Tower Ran...",Surat,Gujarat,India,098254 96262,21.215895,72.795848,https://national-paints.com
8,Shakti Sales,"Shakti Sales, 5, Harpati Vas, Near Vishal Soci...",Surat,Gujarat,India,,21.161785,72.807448,
9,Rohit Painter,"Rohit Painter, 173, Bamroli Rd, Vinayak Nagar,...",Surat,Gujarat,India,,21.154848,72.83267,


In [205]:
# City Name: Vadodara

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/gujarat/Vadodara/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
gu4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
gu4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Jalaram Enterprises,"51, Jalaram Enterprises, Saiyed Vasna Rd, Near...",Vadodara,Gujarat,India,098240 99187,22.295087,73.151227,
1,Gordhanbhai Jayantabhai Patel,"Gordhanbhai Jayantabhai Patel, Chittesthan Han...",Vadodara,Gujarat,India,098257 56322,22.304102,73.199625,https://gordhanbhai-jayantabhai-patel.business...
2,Harikrupa Enterprises,"Harikrupa Enterprises, Rajlaxmi Complex A-, 22...",Vadodara,Gujarat,India,0265 231 2307,22.305176,73.166218,
3,Jay Khodiyar Auto Paints,"Jay Khodiyar Auto Paints, Shop - 5 Plam exotic...",Vadodara,Gujarat,India,096013 55398,22.322539,73.159936,
4,Navarang Paints,"Navarang Paints, S/12, Bahuchraji Nagar, Karel...",Vadodara,Gujarat,India,099989 77821,22.312721,73.197259,
5,G J Patel | Painting Contractor in Vadoadra | ...,G J Patel | Painting Contractor in Vadoadra | ...,Vadodara,Gujarat,India,098257 56322,22.29504,73.199645,
6,Arihant Ceramic,"Arihant Ceramic, Shop No. 37-38 Opp.Mahesh Nag...",Vadodara,Gujarat,India,078789 73846,22.299017,73.238108,
7,Narandas & Sons - Asian & Berger Paints Dealer...,Narandas & Sons - Asian & Berger Paints Dealer...,Vadodara,Gujarat,India,0265 243 9708,22.298123,73.210582,http://www.narandasandsons.com
8,Galaxy Color Co.,"Galaxy Color Co., Jetalpur Rd, Sampatrao Colon...",Vadodara,Gujarat,India,,22.307575,73.175118,
9,Kaivalam Enterprise,"S/8, Kaivalam Enterprise, Vadsar Kalali Road, ...",Vadodara,Gujarat,India,099242 96608,22.26113,73.173458,


In [206]:
# City Name: Rajkot

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/gujarat/Rajkot/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
gu5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
gu5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Taj Paints,"Taj Paints, Gondal Road Opp Suryakant Hotel & ...",Rajkot,Gujarat,India,,22.30301,70.895234,
1,Patel Hardware & Sanitary,"Patel Hardware & Sanitary, Kothariya Main Rd, ...",Rajkot,Gujarat,India,,22.267314,70.813891,
2,Nachiketa Hardware & Sanitaryware,"Shop No.4, Nachiketa Hardware & Sanitaryware, ...",Rajkot,Gujarat,India,099240 17377,22.299623,70.768386,
3,Siddhpura Enterprise,"Siddhpura Enterprise, 12-LL, Royal Complex, Dh...",Rajkot,Gujarat,India,0281 223 6872,22.28808,70.801562,
4,R R MARKETING,"R R MARKETING, 303-A, ""4 PLUS"" , SARDAR NAGAR ...",Rajkot,Gujarat,India,098795 11513,22.291416,70.7884,http://www.jotun.com
5,Ramdev enterprise,"Ramdev enterprise, Swati Park, Rajkot, Gujarat...",Rajkot,Gujarat,India,,22.255361,70.812913,
6,Indukumar Chunilal Gandhi,"Indukumar Chunilal Gandhi, Canal Rd, opposite ...",Rajkot,Gujarat,India,0281 222 4180,22.288729,70.807241,
7,Johar Colors,"Johar Colors, Hanuman Madhi Chowk 2, Nirav Com...",Rajkot,Gujarat,India,094277 44286,22.300936,70.756924,
8,P & C Colour,"P & C Colour, Sahakar Main Road, Hasan Vadi So...",Rajkot,Gujarat,India,094264 33621,22.270036,70.805014,
9,Patel Colour World,"10, Patel Colour World, Godown Rd, Manhar Plot...",Rajkot,Gujarat,India,099241 95812,22.28609,70.796242,


In [207]:
# Achieving dataframe concatenation and saving the output into a variable is possible using 'pd.concat'.
# Start by constructing a list of the dataframes, then implement 'pd.concat' with 'ignore_index=True' to reset the index.
guj_data = [gu1,gu2,gu3,gu4,gu5]
guj_data = pd.concat(guj_data,ignore_index=True)
guj_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Gujarat Colour,"Gujarat Colour, Behind Danapith, M.G Road, Lok...",Bhavnagar,Gujarat,India,090336 56352,21.781823,72.145833,https://gujaratcolourbhavnagar.business.site
1,British Paints,"British Paints, Mahathma Gandhi Rd, near I.D.B...",Bhavnagar,Gujarat,India,093755 20052,21.779737,72.145885,http://www.britishpaints.in
2,Shree Shivshakti Paint Store,"Shree Shivshakti Paint Store, G-14, Kapisha Co...",Bhavnagar,Gujarat,India,094289 91029,21.762947,72.1233,https://shree-shivshakti-paint-store.business....
3,Darshan paints,"Darshan paints, Watch tower, 3, Pradhyumna Kan...",Bhavnagar,Gujarat,India,,21.742459,72.132811,
4,Satguru Color World,"Satguru Color World, Old Sindhunagar ,Shop No ...",Bhavnagar,Gujarat,India,,21.752753,72.150845,


#**Haryana**

After obtaining data from different cities in Haryana, we will merge it together.

In [208]:
# City Name: Faridabad

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/haryana/Faridabad/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
hy1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
hy1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Rama Drader,"Rama Drader, Shop No 1810, Dabua Pali Road, Ne...",Faridabad,Haryana,India,,28.490126,77.32708,
1,Laxmi Paint Store,"Laxmi Paint Store, 33 Feet Road, Near 17 No. C...",Faridabad,Haryana,India,088609 71617,28.386861,77.275288,
2,Om Paints Hardware & Sanitary,"Om Paints Hardware & Sanitary, Shop Number-1 ,...",Faridabad,Haryana,India,,28.473408,77.321042,https://ompaintshardwaresanitary3339.business....
3,Krishna Enterprise H/W & paints,"Krishna Enterprise H/W & paints, Cheers. Krish...",Faridabad,Haryana,India,098994 58485,28.48141,77.323827,
4,Aman Trading Company,"Aman Trading Company, Shop Number -12/6 Guruku...",Faridabad,Haryana,India,,28.472038,77.296295,https://www.amantrading.co.in
5,AMBE DECORATORS,"AMBE DECORATORS, SBI atm, Main, Chetan Market ...",Faridabad,Haryana,India,,28.480767,77.324616,https://ambedecorators.business.site
6,Bhanu Trading Company,"Bhanu Trading Company, Shop Number - 5D/26, Mo...",Faridabad,Haryana,India,092131 13086,28.396172,77.304133,
7,Bhim Denting,"Bhim Denting, Sector 20B, Haryana 121001",Faridabad,Haryana,India,,28.406267,77.310753,
8,Yash Sanitary & Hardware Paint Store,"Yash Sanitary & Hardware Paint Store, Shop No ...",Faridabad,Haryana,India,099711 79820,28.361595,77.325571,
9,Gaurav Paints & Hardware Store,"Gaurav Paints & Hardware Store, F-82/15, Mitta...",Faridabad,Haryana,India,,28.497224,77.296741,


In [209]:
# City Name: Bilaspur and Bhiwani and Ateli Mandi

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/haryana/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
hy2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
hy2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Saini Paint and Hardware Store,"Saini Paint and Hardware Store, Near Kotla, Ch...",Bilāspur,Haryana,India,,30.387849,77.225459,
1,Yamuna Hardware Store.,"Yamuna Hardware Store., near Shiv chownk, Bila...",Bilāspur,Haryana,India,,30.299302,77.302232,https://yamuna-hardware-store.business.site
2,R K Traders,"R K Traders, Bilaspur Rd, Malak Raipur, Haryan...",Bilāspur,Haryana,India,094666 62762,30.317232,77.269502,
3,Om Sai Ram Paints Hardware Store,"Om Sai Ram Paints Hardware Store, Bhawanipur-K...",Bilāspur,Haryana,India,,30.335338,77.312146,
4,Vasu Granite Centre,"Vasu Granite Centre, Tehsil, Vasu Granite Cent...",Bilāspur,Haryana,India,,30.378338,77.219754,https://vasu-granite-centre.business.site
5,Mahesh paints and Electric store,"Mahesh paints and Electric store, Railway Road...",Ateli Mandi,Haryana,India,094164 79290,28.104404,76.256907,
6,Rosy Paints & Saintry Ateli,"Rosy Paints & Saintry Ateli, Ateli, Haryana 12...",Ateli Mandi,Haryana,India,,28.107441,76.259,
7,Shree Ganesh Marbles,"Shree Ganesh Marbles, Loharu Rd, near Jogi Wal...",Bhiwani,Haryana,India,098133 98879,28.789198,76.130297,
8,Pawan Paint & H/WStore,"Pawan Paint & H/WStore, near electricity offic...",Bhiwani,Haryana,India,,28.796796,76.135247,
9,Pawan Paints & H/W store,"Pawan Paints & H/W store, near electricity off...",Bhiwani,Haryana,India,,28.796866,76.13531,


In [210]:
# When it comes to merging dataframes and storing them as one, 'pd.concat' is a useful tool. First, generate a list of the dataframes,
# and then utilize 'pd.concat' while ensuring 'ignore_index=True' is set to reset the index.
hya_data = [hy1,hy2]
hya_data = pd.concat(hya_data,ignore_index=True)
hya_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Rama Drader,"Rama Drader, Shop No 1810, Dabua Pali Road, Ne...",Faridabad,Haryana,India,,28.490126,77.32708,
1,Laxmi Paint Store,"Laxmi Paint Store, 33 Feet Road, Near 17 No. C...",Faridabad,Haryana,India,088609 71617,28.386861,77.275288,
2,Om Paints Hardware & Sanitary,"Om Paints Hardware & Sanitary, Shop Number-1 ,...",Faridabad,Haryana,India,,28.473408,77.321042,https://ompaintshardwaresanitary3339.business....
3,Krishna Enterprise H/W & paints,"Krishna Enterprise H/W & paints, Cheers. Krish...",Faridabad,Haryana,India,098994 58485,28.48141,77.323827,
4,Aman Trading Company,"Aman Trading Company, Shop Number -12/6 Guruku...",Faridabad,Haryana,India,,28.472038,77.296295,https://www.amantrading.co.in


#**Jharkhand**

We have procured data from diverse cities in Jharkhand, which will be consolidated after collection.

In [211]:
# City Name: Jamshedpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/jharkhand/Jamshedpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
jh1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
jh1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Star Hardware & Paints,"Star Hardware & Paints, Zakir Nagar, Mango, Ja...",Jamshedpur,Jharkhand,India,,22.835153,86.20242,
1,City sanitary and paint city,"City sanitary and paint city, Rd Number 18, ne...",Jamshedpur,Jharkhand,India,,22.841271,86.204618,
2,Ayodhya Enterprises,"Ayodhya Enterprises, 77 Golmuri market, Jamshe...",Jamshedpur,Jharkhand,India,094146 17852,22.787017,86.228554,http://ayodhyaenterprises.com
3,Shreeji Paints,"Shreeji Paints, Thakurbari Road, Sakchi, Jamsh...",Jamshedpur,Jharkhand,India,0657 223 0575,22.805217,86.203917,
4,KIRAN ENTERPRISES,"KIRAN ENTERPRISES, Shop, Street Number 1, New ...",Jamshedpur,Jharkhand,India,091226 24456,22.796536,86.17264,
5,Kumar Enterprises,"Kumar Enterprises, Near Ram Mandir, New Line, ...",Jamshedpur,Jharkhand,India,,22.82018,86.170629,
6,MAA Durga agency just,"MAA Durga agency just, 2nd Govind Nagar main r...",Jamshedpur,Jharkhand,India,,22.810063,86.165638,
7,M/s. Ayodhya Enterprises,"M/s. Ayodhya Enterprises, H.No. 77, Golmuri Ma...",Jamshedpur,Jharkhand,India,094319 32417,22.794337,86.222027,
8,Sangita construction,"Sangita construction, Tata kandra Main Road, n...",Jamshedpur,Jharkhand,India,,22.804374,86.122254,
9,Hari Enterprises,"Hari Enterprises, Shop No. 33, Chunna Line, Sa...",Jamshedpur,Jharkhand,India,098013 40051,22.803858,86.205903,


In [212]:
# City Name: Ranchi

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/jharkhand/Ranchi/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
jh2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
jh2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Hi-Life,"Hi-Life, Harmu, Harmu-Argora Bypass Road & Mai...",Ranchi,Jharkhand,India,093342 94300,23.354041,85.299558,
1,M/S. J.J. Sales,"M/S. J.J. Sales, Syndicate Building, Kantatoli...",Ranchi,Jharkhand,India,096935 03536,23.364186,85.346313,
2,New S.K. Paints,"New S.K. Paints, Church Rd, Lower Bazaar, Ranc...",Ranchi,Jharkhand,India,0651 235 1247,23.363636,85.327277,
3,Sumit hardware and sanitary store,"Sumit hardware and sanitary store, Jora mander...",Ranchi,Jharkhand,India,,23.309728,85.394341,
4,Shree ganesh Traders,"Shree ganesh Traders, Shiwam complex, opp. pla...",Ranchi,Jharkhand,India,098351 32256,23.37115,85.332525,
5,JK Wall Paint Shop,"JK Wall Paint Shop, UBI Bank Street, Ranchi - ...",Ranchi,Jharkhand,India,099054 71862,23.352207,85.381682,
6,Ajay Paint Centre & Hardware,"Ajay Paint Centre & Hardware, Super Market, Ko...",Ranchi,Jharkhand,India,098351 18264,23.37236,85.338748,
7,Colour Centre,"Colour Centre, Omair Alan Khan Jamal Building,...",Ranchi,Jharkhand,India,098353 22440,23.350114,85.310887,
8,NEHA AGENCIES,"FLAT NO, NEHA AGENCIES, Mewara Kunj, 202, Bara...",Ranchi,Jharkhand,India,,23.371647,85.318097,
9,Parwati Paint & Hardware,"Parwati Paint & Hardware, Near 1.N Gate, H.B.R...",Ranchi,Jharkhand,India,095727 41999,23.384289,85.369431,


In [213]:
# City Names: Dugda and Dumka and Bhojudih

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/jharkhand/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
jh3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
jh3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Omkar Glass,"Omkar Glass, Plot No. B/3, Sector 9, B Road, C...",Dugda,Jharkhand,India,094317 38515,23.691597,86.148724,
1,Hari Enterprises,"Hari Enterprises, Shop No. 33, Chunna Line, Sa...",Jamshedpur,Jharkhand,India,098013 40051,22.803858,86.205903,
2,Sangita construction,"Sangita construction, Tata kandra Main Road, n...",Jamshedpur,Jharkhand,India,,22.804374,86.122254,
3,M/s. Ayodhya Enterprises,"M/s. Ayodhya Enterprises, H.No. 77, Golmuri Ma...",Jamshedpur,Jharkhand,India,094319 32417,22.794337,86.222027,
4,MAA Durga agency just,"MAA Durga agency just, 2nd Govind Nagar main r...",Jamshedpur,Jharkhand,India,,22.810063,86.165638,
5,Kumar Enterprises,"Kumar Enterprises, Near Ram Mandir, New Line, ...",Jamshedpur,Jharkhand,India,,22.82018,86.170629,
6,KIRAN ENTERPRISES,"KIRAN ENTERPRISES, Shop, Street Number 1, New ...",Jamshedpur,Jharkhand,India,091226 24456,22.796536,86.17264,
7,"Pankaj Paints (Asian paints, Nerolac Paints, B...","Pankaj Paints (Asian paints, Nerolac Paints, B...",Dumka,Jharkhand,India,,24.38958,87.065392,
8,AGARWAL HARDWARE,"AGARWAL HARDWARE, Kandra, bazar, Rohraband, Jh...",Bhojudih,Jharkhand,India,,23.659379,86.461716,
9,Keshri Brothers,"Keshri Brothers, Main Road, near Burma jewelle...",Dumka,Jharkhand,India,,24.267993,87.248652,https://keshribrothers.business.site


In [214]:
# Remove unwanted rows that we don't want to keep it in our data set.
jh3.drop(index=jh3.index[1:7],axis=0,inplace=True)

In [215]:
# Check the data set.
jh3.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Omkar Glass,"Omkar Glass, Plot No. B/3, Sector 9, B Road, C...",Dugda,Jharkhand,India,094317 38515,23.691597,86.148724,
7,"Pankaj Paints (Asian paints, Nerolac Paints, B...","Pankaj Paints (Asian paints, Nerolac Paints, B...",Dumka,Jharkhand,India,,24.38958,87.065392,
8,AGARWAL HARDWARE,"AGARWAL HARDWARE, Kandra, bazar, Rohraband, Jh...",Bhojudih,Jharkhand,India,,23.659379,86.461716,
9,Keshri Brothers,"Keshri Brothers, Main Road, near Burma jewelle...",Dumka,Jharkhand,India,,24.267993,87.248652,https://keshribrothers.business.site


In [216]:
# To combine multiple dataframes into one and save it as a variable, employ the 'pd.concat' function. Prepare a list of the dataframes and
# use 'pd.concat', specifying 'ignore_index=True' for index resetting.
jhk_data = [jh1,jh2,jh3]
jhk_data = pd.concat(jhk_data,ignore_index=True)
jhk_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Star Hardware & Paints,"Star Hardware & Paints, Zakir Nagar, Mango, Ja...",Jamshedpur,Jharkhand,India,,22.835153,86.20242,
1,City sanitary and paint city,"City sanitary and paint city, Rd Number 18, ne...",Jamshedpur,Jharkhand,India,,22.841271,86.204618,
2,Ayodhya Enterprises,"Ayodhya Enterprises, 77 Golmuri market, Jamshe...",Jamshedpur,Jharkhand,India,094146 17852,22.787017,86.228554,http://ayodhyaenterprises.com
3,Shreeji Paints,"Shreeji Paints, Thakurbari Road, Sakchi, Jamsh...",Jamshedpur,Jharkhand,India,0657 223 0575,22.805217,86.203917,
4,KIRAN ENTERPRISES,"KIRAN ENTERPRISES, Shop, Street Number 1, New ...",Jamshedpur,Jharkhand,India,091226 24456,22.796536,86.17264,


#**Karnataka**

Data collection has been completed from different cities in Karnataka. Subsequently, we will merge the collected data.

In [217]:
# City Name: Shimoga

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/karnataka/Shimoga/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kt1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kt1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sujatha Traders,"Sujatha Traders, JPN Rd, Mission Compound, Shi...",Shimoga,Karnataka,India,094481 48095,13.934142,75.572169,
1,VENKATAGIRI PAINTS,"VENKATAGIRI PAINTS, 2nd Cross Rd, KIADB Auto C...",Shimoga,Karnataka,India,,13.94185,75.557011,
2,DHANALAXMI Traders,"DHANALAXMI Traders, LLR Rd, Durgigudi, Shivamo...",Shimoga,Karnataka,India,088803 55561,13.932843,75.570926,http://dhanalaxmi-traders.business.site
3,BCK Traders,"BCK Traders, Bandigadi Complex, Kashipura Main...",Shimoga,Karnataka,India,,13.954795,75.553475,
4,Billava Traders,"Billava Traders, N.T. Road, Krishna Tyres Buil...",Shimoga,Karnataka,India,094811 53195,13.920322,75.567634,
5,Sri Balaji Enterprises,"Sri Balaji Enterprises, NT Rd, opp. Kamath Pet...",Shimoga,Karnataka,India,098458 39296,13.922411,75.569434,
6,Mahalakshmi Agency,"Mahalakshmi Agency, Near Laxmi Talkies Circle,...",Shimoga,Karnataka,India,095909 22888,13.944886,75.567259,
7,Nisarga Enterprises,"Nisarga Enterprises, Opp Pruthvi Mansion, Vino...",Shimoga,Karnataka,India,094801 71329,13.946114,75.564248,
8,Sri Mahaveer Trading Co,"Sri Mahaveer Trading Co, BH Road, Karnataka Sa...",Shimoga,Karnataka,India,097421 84650,13.930331,75.576757,
9,A.R. Traders,"A.R. Traders, N.T. Road, opp. MRF Show Room, S...",Shimoga,Karnataka,India,096203 60265,13.924281,75.569297,


In [218]:
# City Name: Mysore

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/karnataka/Mysore/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kt2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kt2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Ramdev hardware & Plywoods,"Ramdev hardware & Plywoods, Hebbal Main Road, ...",Mysore,Karnataka,India,099016 28119,12.3495,76.622917,
1,Sri VenkateshwaraTraders,"Sri VenkateshwaraTraders, #591, 10th Main Rd, ...",Mysore,Karnataka,India,,12.337031,76.62481,
2,BW ELEGANT PAINTS&GLASS,"BW ELEGANT PAINTS&GLASS, 1678, Syed Abdul Rahm...",Mysore,Karnataka,India,,12.339929,76.675363,
3,Punitha Traders,"Punitha Traders, 240, Near Surya Bakery, Hebba...",Mysore,Karnataka,India,094482 93661,12.343683,76.618233,
4,Sapthagiripaints.,"Sapthagiripaints., Belvatta, Belvata, Mysuru, ...",Mysore,Karnataka,India,097419 62221,12.357478,76.649168,
5,L.N.ENTERPRISES,"L.N.ENTERPRISES, Lakshmikanth Nagar, Hebbal 1s...",Mysore,Karnataka,India,,12.348338,76.607491,
6,Ganavi paints and hardware,"Ganavi paints and hardware, #25/3 opp muthoot ...",Mysore,Karnataka,India,,12.296331,76.702548,
7,Capital paints and hardware,"Capital paints and hardware, 87c 1 st floor, M...",Mysore,Karnataka,India,,12.323455,76.674843,
8,Ramdev Hardware And Paints,"Ramdev Hardware And Paints, Shakthinagar, Mysu...",Mysore,Karnataka,India,,12.304655,76.697909,
9,New Dhanalakshmi Hardware,"New Dhanalakshmi Hardware, Bus Stop, 283, Huns...",Mysore,Karnataka,India,,12.340604,76.585274,


In [219]:
# City Name: Mangalore

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/karnataka/Mangalore/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kt3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kt3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Prakyath Paints And Hardwares,"Prakyath Paints And Hardwares, Hosabettu, Sura...",Mangalore,Karnataka,India,0824 240 6807,12.967828,74.805148,
1,Kamath Traders Paints & Hardwares,"Shop No. 11, Kamath Traders Paints & Hardwares...",Mangalore,Karnataka,India,0824 242 2921,12.867,74.839691,
2,Surface Coats,"Surface Coats, Platinum Jublee Building, Bunts...",Mangalore,Karnataka,India,,12.876769,74.847816,
3,M.A. Enterprise,"M.A. Enterprise, 20-1-122/6 cutuchi memon masj...",Mangalore,Karnataka,India,,12.863613,74.833806,
4,Panchmal Traders,"Panchmal Traders, Beebi Alabi Road, Maidan 4th...",Mangalore,Karnataka,India,099863 01908,12.865732,74.836393,
5,Imperial Paints & Chemicals,"Imperial Paints & Chemicals, Neemarga, P O Roc...",Mangalore,Karnataka,India,0824 277 4292,12.914142,74.855957,
6,Sugam Paint Industries,"Sugam Paint Industries, Lehari Kodialguthu Eas...",Mangalore,Karnataka,India,0824 240 7331,12.914142,74.855957,
7,Varnaa Paints,"Varnaa Paints, Sannidhi, Derebail-Konchadi, Ma...",Mangalore,Karnataka,India,099015 12102,12.906235,74.847405,https://business.google.com
8,Rai car spray painting,"Rai car spray painting, Near Karnataka Polytec...",Mangalore,Karnataka,India,,12.890928,74.852615,
9,Ultrashine,"Ultrashine, Door No. 3-W-29-2372/9, City Comme...",Mangalore,Karnataka,India,087925 78150,12.879051,74.853268,


In [220]:
# City Name: Belgaum

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/karnataka/Belgaum/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kt4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kt4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shiva Traders,"Shiva Traders, 89 2/B Vardappa Galli Belgaum N...",Belgaum,Karnataka,India,,15.839801,74.523653,
1,Happy enterprises,"Happy enterprises, cross, Nilji, Karnataka 591313",Belgaum,Karnataka,India,,15.86216,74.580461,
2,7 K Enterprises,"7 K Enterprises, Goaves Shopping Complex, 33 G...",Belgaum,Karnataka,India,,15.85036,74.504669,
3,Laxmi Electricals & Hrdware,"Laxmi Electricals & Hrdware, Sambra, Karnataka...",Belgaum,Karnataka,India,080501 88188,15.868537,74.608936,
4,Asian paints ( Patil Traders ),"Asian paints ( Patil Traders ), opposite SPINN...",Belgaum,Karnataka,India,,15.874397,74.634933,
5,Ambica Paints,"Ambica Paints, 473, Khade Bazar, Shahapur, Bel...",Belgaum,Karnataka,India,0831 248 8656,15.843348,74.516954,
6,Shri Ganesh Paints & Plastics,"Shri Ganesh Paints & Plastics, 213, Srishail C...",Belgaum,Karnataka,India,090363 69244,15.841334,74.517488,
7,Sanny Oxides And Paints,"Sanny Oxides And Paints, Plot No. 9, Sry. No. ...",Belgaum,Karnataka,India,094483 69547,15.819511,74.494495,
8,Shri Omkar Paints,"Shri Omkar Paints, Corner, Goaves, 2063, Shri ...",Belgaum,Karnataka,India,097425 57521,15.841445,74.511044,
9,Indradhanush Paints,"Indradhanush Paints, Union Society, Azam Nagar...",Belgaum,Karnataka,India,,15.882978,74.511191,


In [221]:
# City Name: Harihar

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/karnataka/Harihar/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kt5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kt5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,asian paints,"asian paints, Shrinivaasa Nagara, Davanagere, ...",Harihar,Karnataka,India,,14.450945,75.920707,https://www.asianpaints.com
1,G.K. Hardware & Paints,"G.K. Hardware & Paints, Bertuur Road, Davanage...",Harihar,Karnataka,India,096635 06268,14.476956,75.92565,
2,M.M. Hardwares,"M.M. Hardwares, Chamrajpet, Stall No. 77, Dava...",Harihar,Karnataka,India,094487 28994,14.471486,75.923295,
3,Shree Vinayaka Enterprises,"Shree Vinayaka Enterprises, #1224/54 & 1225/55...",Harihar,Karnataka,India,090665 62529,14.444842,75.933504,
4,Sri Basaveshwara Colour World,"Sri Basaveshwara Colour World, Musturrshwara C...",Harihar,Karnataka,India,091645 07211,14.45149,75.903488,
5,Sri Ganesh Hardware,"Sri Ganesh Hardware, Old, PB Rd, Opp to trishu...",Harihar,Karnataka,India,081922 50533,14.458437,75.932843,
6,Pragathi Paints And Hardwares,"Pragathi Paints And Hardwares, Near Pooja Hote...",Harihar,Karnataka,India,099640 49988,14.469624,75.907121,
7,Shwetha Enterprises,"Shwetha Enterprises, Ellamma Nagar, Vinobha Na...",Harihar,Karnataka,India,,14.469873,75.904221,
8,Sri Malleshwara Enterprises,"Sri Malleshwara Enterprises, #1659/35 Opp: Sha...",Harihar,Karnataka,India,099726 01354,14.445293,75.919111,
9,Sneha marketing,"Sneha marketing, Chamaraj pet, Davanagere, Kar...",Harihar,Karnataka,India,,14.47178,75.923263,


In [222]:
# The 'pd.concat' function is valuable for merging dataframes and storing the result in a variable. To achieve this, compile the dataframes into a list,
# and when applying 'pd.concat', include 'ignore_index=True' to reset the index.
ktk_data = [kt1,kt2,kt3,kt4,kt5]
ktk_data = pd.concat(ktk_data,ignore_index=True)
ktk_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sujatha Traders,"Sujatha Traders, JPN Rd, Mission Compound, Shi...",Shimoga,Karnataka,India,094481 48095,13.934142,75.572169,
1,VENKATAGIRI PAINTS,"VENKATAGIRI PAINTS, 2nd Cross Rd, KIADB Auto C...",Shimoga,Karnataka,India,,13.94185,75.557011,
2,DHANALAXMI Traders,"DHANALAXMI Traders, LLR Rd, Durgigudi, Shivamo...",Shimoga,Karnataka,India,088803 55561,13.932843,75.570926,http://dhanalaxmi-traders.business.site
3,BCK Traders,"BCK Traders, Bandigadi Complex, Kashipura Main...",Shimoga,Karnataka,India,,13.954795,75.553475,
4,Billava Traders,"Billava Traders, N.T. Road, Krishna Tyres Buil...",Shimoga,Karnataka,India,094811 53195,13.920322,75.567634,


#**Kerala**

Having collected data from multiple cities in Kerala, we will now proceed to merge it.

In [223]:
# City Name: Kozhikode

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/kerala/Kozhikode/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kr1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kr1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Safeway,"Safeway, KP Kesava Menon Rd, Opp Kozhikode Tow...",Kozhikode,Kerala,India,0495 236 6654,11.250292,75.779413,http://www.safeway.com
1,Home Touch,"Home Touch, Karikkamkulam, Thadampattuthazham,...",Kozhikode,Kerala,India,098959 27515,11.290304,75.789052,
2,Colour World,"Colour World, Panniyankara Overbridge, Kallai,...",Kozhikode,Kerala,India,,11.228764,75.785737,
3,EP colours paints & Hardwares,"EP colours paints & Hardwares, West, pallikand...",Kozhikode,Kerala,India,,11.237784,75.785405,https://ep-colours-paints-hardwares.business.site
4,P.K. Hardware,"P.K. Hardware, Kallai Rd, Vattampoil, Vattampo...",Kozhikode,Kerala,India,098951 12911,11.238932,75.78682,
5,Hi-Build Coatings Pvt. Ltd.,"Hi-Build Coatings Pvt. Ltd., Room No.3/3009, V...",Kozhikode,Kerala,India,0495 276 1008,11.258246,75.770967,http://www.hbc.co.in
6,Spray painting,"Spray painting, Chalappuram, Kozhikode, Kerala...",Kozhikode,Kerala,India,,11.240794,75.788741,
7,NEW GLACE SPRAY PAINTING WORKS,"NEW GLACE SPRAY PAINTING WORKS, Saibaba Mandir...",Kozhikode,Kerala,India,,11.244071,75.784596,
8,J.J. Paints,"No:050/1382, J.J. Paints, Wayanad Rd, Opposite...",Kozhikode,Kerala,India,0495 276 7373,11.265268,75.779237,
9,Kansai Nerolac paints,"Kansai Nerolac paints, 3/2501-A, Beach Rd, Kun...",Kozhikode,Kerala,India,0495 236 0118,11.269417,75.769314,http://www.nerolac.com


In [224]:
# City Name: Kottayam

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/kerala/Kottayam/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kr2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kr2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,St.George Paint Store,"St.George Paint Store, Ettumanoor - Peroor Rd,...",Kottayam,Kerala,India,,9.645916,76.564907,
1,Colour Universe,"Colour Universe, Colour universe Sreesylam Bui...",Kottayam,Kerala,India,098464 40124,9.557574,76.571791,
2,Rainbow Colour House,"Rainbow Colour House, Ettumanoor, Kerala 686631",Kottayam,Kerala,India,079027 51035,9.671131,76.56186,
3,Vijaya Trade Center,"Vijaya Trade Center, Market Landing Rd, near t...",Kottayam,Kerala,India,081568 05425,9.587113,76.5235,
4,Vembanad Agencies,"Vembanad Agencies, Kalarickal Bazar, Eerayil K...",Kottayam,Kerala,India,0481 256 4314,9.589156,76.522897,
5,Canyon Paints,"Canyon Paints, Kottayam, Kerala 686001",Kottayam,Kerala,India,085890 23400,9.590683,76.52258,
6,British Paints,"British Paints, Near ICICI Bank ATM M.C, Kodat...",Kottayam,Kerala,India,094471 32466,9.667185,76.554597,http://www.britishpaints.in
7,House Of Paints,"House Of Paints, SH 1, Ettumanoor, Kerala 686631",Kottayam,Kerala,India,0481 253 3417,9.667328,76.554381,
8,Asian paints Colour Idea Store Ettumanoor,"Asian paints Colour Idea Store Ettumanoor, Par...",Kottayam,Kerala,India,090723 99552,9.658354,76.550474,
9,Asian paint Color world,"Asian paint Color world, Mannanam Rd, Mannanam...",Kottayam,Kerala,India,,9.64611,76.5263,


In [225]:
# City Name: Kollam

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/kerala/Kollam/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kr3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kr3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Siva Sakthi Paints,"Siva Sakthi Paints, Karicode, Peroor, Kottamka...",Kollam,Kerala,India,092872 17100,8.905116,76.623516,
1,"Asian Paints Dealer , Alummoodu","Asian Paints Dealer , Alummoodu, Bus Stop, Alu...",Kollam,Kerala,India,,8.918973,76.668241,
2,Salman Traders,"Salman Traders, Salman .manzil, Thaha Jct Rd, ...",Kollam,Kerala,India,098957 89816,8.909702,76.645988,
3,MYK agencies,"MYK agencies, Kottiyam - Kundara Rd, Perumpuzh...",Kollam,Kerala,India,,8.93765,76.679121,
4,FAIHA TRADERS,"FAIHA TRADERS, Kottiyam - Kundara Rd, Thazhuth...",Kollam,Kerala,India,,8.890328,76.678816,
5,Sen Trade Center,"Sen Trade Center, Kollam District, Kundara, Ke...",Kollam,Kerala,India,093879 69007,8.956489,76.671364,
6,SBA Agencies,"SBA Agencies, Nedumonkavu, Kundara, Kerala 691501",Kollam,Kerala,India,,8.961096,76.680998,
7,Saniya Colors,"Saniya Colors, Nedumonkavu, Kundara, Kerala 69...",Kollam,Kerala,India,,8.959766,76.678099,
8,GS Paints & Glass Centre,"GS Paints & Glass Centre, Nedumonkavu, Kundara...",Kollam,Kerala,India,0474 252 5007,8.960815,76.680313,
9,KM Traders,"KM Traders, Unnamed Road, Anchalummood, Kerala...",Kollam,Kerala,India,,8.935972,76.604351,


In [226]:
# City Name: Thiruvananthapuram

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/kerala/Thiruvananthapuram/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
kr4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
kr4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Drishya Paints,"Drishya Paints, SS Shoping complex, Thirumala,...",Thiruvananthapuram,Kerala,India,,8.501532,76.993208,
1,Jeeva Traders,"Jeeva Traders, Near Block Office, Kazhakkootta...",Thiruvananthapuram,Kerala,India,093877 57271,8.576881,76.867488,
2,QUALITY TRADERS,"QUALITY TRADERS, Pachalloor Punkulam Rd, Thiru...",Thiruvananthapuram,Kerala,India,0471 238 0696,8.434404,76.963651,
3,ABC paint house,"ABC paint house, Test, Karyavattom, Thiruvanan...",Thiruvananthapuram,Kerala,India,,8.562979,76.887426,
4,NIPPON PAINTS (DAKSHINA PAINTS),"NIPPON PAINTS (DAKSHINA PAINTS), High School R...",Thiruvananthapuram,Kerala,India,,8.565758,76.87351,
5,BUIL-TECH COLOURS,"BUIL-TECH COLOURS, TC 68/325-1, Farras Complex...",Thiruvananthapuram,Kerala,India,,8.440394,76.956137,
6,New Chala Agencies,"New Chala Agencies, Eanchakkal, Thiruvananthap...",Thiruvananthapuram,Kerala,India,,8.479322,76.933031,
7,"M M Paints, Hardware, Plumbing Materials","M M Paints, Hardware, Plumbing Materials, Vall...",Thiruvananthapuram,Kerala,India,098951 23362,8.468092,76.923088,
8,Ambili Paints,"Ambili Paints, Lalaji Junction, Karunagapally,...",Thiruvananthapuram,Kerala,India,0471 262 1952,8.487495,76.948623,
9,Vibgyor Colour World,"Vibgyor Colour World, Kazhakuttam, Vadakkumbha...",Thiruvananthapuram,Kerala,India,,8.572815,76.869723,


In [227]:
# By leveraging 'pd.concat', it is possible to concatenate all the dataframes and save the output as a variable.
# Create a list containing the dataframes and use 'pd.concat', making sure to include 'ignore_index=True' to reset the index.
krl_data = [kr1,kr2,kr3,kr4]
krl_data = pd.concat(krl_data,ignore_index=True)
krl_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Safeway,"Safeway, KP Kesava Menon Rd, Opp Kozhikode Tow...",Kozhikode,Kerala,India,0495 236 6654,11.250292,75.779413,http://www.safeway.com
1,Home Touch,"Home Touch, Karikkamkulam, Thadampattuthazham,...",Kozhikode,Kerala,India,098959 27515,11.290304,75.789052,
2,Colour World,"Colour World, Panniyankara Overbridge, Kallai,...",Kozhikode,Kerala,India,,11.228764,75.785737,
3,EP colours paints & Hardwares,"EP colours paints & Hardwares, West, pallikand...",Kozhikode,Kerala,India,,11.237784,75.785405,https://ep-colours-paints-hardwares.business.site
4,P.K. Hardware,"P.K. Hardware, Kallai Rd, Vattampoil, Vattampo...",Kozhikode,Kerala,India,098951 12911,11.238932,75.78682,


#**Madhya Pradesh**

The data from various cities in Madhya Pradesh has been collected and will be merged once the collection is finished.

In [228]:
# City Name: Jabalpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/madhya-pradesh/Jabalpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mp1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mp1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Samajhdar Choice,"Samajhdar Choice, Main Road, Sanjeevani Nagar,...",Jabalpur,Madhya Pradesh,India,095842 48841,23.164695,79.903179,
1,Rani Traders & Hardware,"Rani Traders & Hardware, Amkhera Road, Jagriti...",Jabalpur,Madhya Pradesh,India,093000 83728,23.205739,79.93541,
2,British Paints,"British Paints, Shop No-6, Triveni Plaza Shard...",Jabalpur,Madhya Pradesh,India,,23.153258,79.908707,
3,M/s Apollo Traders,"M/s Apollo Traders, 585, Nunhai Dixitpura Road...",Jabalpur,Madhya Pradesh,India,093008 00033,23.178389,79.93183,
4,Mahadeo Hardware,"Mahadeo Hardware, Gwarighat Road, Gorakhpur, J...",Jabalpur,Madhya Pradesh,India,097532 32333,23.151508,79.927294,
5,Ess Kay Enterprises,"Ess Kay Enterprises, Shop No.3,81, Rabra Compl...",Jabalpur,Madhya Pradesh,India,094253 25759,23.15031,79.934484,
6,Color Zone,"Color Zone, Shop No. 03, Main Road, Ambedkar W...",Jabalpur,Madhya Pradesh,India,098935 01207,23.192538,79.925425,
7,Nanak Traders,"Nanak Traders, Bengali Colony, Ranjhi, Jabalpu...",Jabalpur,Madhya Pradesh,India,088787 96720,23.207321,80.005265,
8,Kashish Enterprises,"Kashish Enterprises, 1098, Hathital Rd, Gorakh...",Jabalpur,Madhya Pradesh,India,093294 01118,23.150322,79.926706,
9,AP ColourIdeas BHATIA HARDWARE STORES,"AP ColourIdeas BHATIA HARDWARE STORES, Shop No...",Jabalpur,Madhya Pradesh,India,098930 31010,23.167143,79.933618,http://www.asianpaints.com


In [229]:
# City Name: Indore

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/madhya-pradesh/indore/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mp2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mp2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,MMSolutions Paints & Hardware,"MMSolutions Paints & Hardware, Bholaram Ustad ...",Indore,Madhya Pradesh,India,099777 55517,22.684463,75.863993,http://business.google.com
1,SHYAM TRADERS,"SHYAM TRADERS, 293-A, Sector A, Mahalaxmi Naga...",Indore,Madhya Pradesh,India,099775 75353,22.754569,75.907375,https://shyamtraders-paintstore.business.site
2,Arti Enterprises,"#429, Arti Enterprises, Kanadia Main Rd, Alok ...",Indore,Madhya Pradesh,India,098265 72574,22.723003,75.918647,
3,T.A Lokhandwala,"T.A Lokhandwala, 122, Siyaganj, Indore, Madhya...",Indore,Madhya Pradesh,India,,22.716831,75.863921,http://ta-lokhandwala.business.site
4,Sk traders,"Sk traders, Apply hospital, 49, near by Indore...",Indore,Madhya Pradesh,India,,22.69551,75.862836,http://www.sk.com
5,Chandan Traders,"Chandan Traders, No.3119-E, Western Ring Rd, S...",Indore,Madhya Pradesh,India,,22.685747,75.828798,
6,hakimi hardware and paint,"hakimi hardware and paint, 3 sukhdev nagar 60 ...",Indore,Madhya Pradesh,India,,22.708195,75.882442,https://hakimi-hardware-and-paint.business.site
7,Ezzi paints and sanitary,"Ezzi paints and sanitary, Shop no.G-3,Naman av...",Indore,Madhya Pradesh,India,097557 45552,22.682906,75.843179,http://business.google.com
8,Suvidha traders,"Suvidha traders, 82, Banganaga Main Rd, Kushwa...",Indore,Madhya Pradesh,India,0731 242 2209,22.743581,75.850798,
9,New Hardware & Paints,"54, New Hardware & Paints, Netaji Subhash Marg...",Indore,Madhya Pradesh,India,098264 30005,22.7225,75.853287,


In [230]:
# City Name: Bhopal

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/madhya-pradesh/Bhopal/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mp3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mp3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Lakshya Traders,"Shop No. 2, Lakshya Traders, A-17, Kolar Rd, S...",Bhopal,Madhya Pradesh,India,0755 424 6081,23.187726,77.416786,
1,Asian Paints Awadhpuri,"Plot No.8, Asian Paints Awadhpuri, Rachna Viha...",Bhopal,Madhya Pradesh,India,097709 99333,23.230988,77.48929,
2,Paras Hardware,"Paras Hardware, Shamin Manzil, Raj Talkies Rd,...",Bhopal,Madhya Pradesh,India,075099 96055,23.248649,77.41896,
3,Bharat Traders,"Bharat Traders, 2, New Market, STT Nagar, TT N...",Bhopal,Madhya Pradesh,India,0755 255 6297,23.234348,77.400997,
4,Jain Traders,"Jain Traders, Bus Stand, Gandhinagar Gandhinag...",Bhopal,Madhya Pradesh,India,099070 01527,23.298167,77.344583,
5,Milan radium cutting,"Milan radium cutting, Shop no 25, Tulsi nagar,...",Bhopal,Madhya Pradesh,India,097523 66063,23.225843,77.40708,
6,Gupta Trading Company,"Shop No. 1, Gupta Trading Company, Berasia Rd,...",Bhopal,Madhya Pradesh,India,,23.300074,77.403981,
7,R.K Traders,"15, R.K Traders, New Market, STT Nagar, TT Nag...",Bhopal,Madhya Pradesh,India,0755 255 4006,23.234242,77.400745,
8,Nirman Enterprises,"H No. 54, Nirman Enterprises, Ashoka Garden, A...",Bhopal,Madhya Pradesh,India,093298 04352,23.257661,77.428724,
9,R.N.G. Enterprises,"R.N.G. Enterprises, Yantriki Bhawan, Zone-I, M...",Bhopal,Madhya Pradesh,India,,23.232873,77.4323,


In [231]:
# City Name: Gwalior

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/madhya-pradesh/Gwalior/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mp4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mp4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Goyal Paints,"Goyal Paints, Shaheed Circle, Hem Singh ki Par...",Gwalior,Madhya Pradesh,India,097704 68585,26.187279,78.141012,
1,Bantu Paints,"Bantu Paints, Taraganj Pull Rd, Taraganj, Sama...",Gwalior,Madhya Pradesh,India,,26.196322,78.140264,
2,Shiv Pop House,"Shiv Pop House, Infront Of, near Naka Chandra ...",Gwalior,Madhya Pradesh,India,098277 22910,26.192278,78.166266,
3,M/s. Raja Paints,"M/s. Raja Paints, Lohiya Bazaar Rd, Dal Bazaar...",Gwalior,Madhya Pradesh,India,093021 13329,26.198768,78.157233,
4,Kailadevi Paints,"Kailadevi Paints, Koteshwar, Chandan Nagar, Gw...",Gwalior,Madhya Pradesh,India,099936 86745,26.235617,78.162879,
5,P.O.P. House,"P.O.P. House, Roshni Ghar Rd, Near SDM Road, J...",Gwalior,Madhya Pradesh,India,0751 645 0627,26.200873,78.16222,
6,"Paint, hardware and sanatery store","Paint, hardware and sanatery store, In front o...",Gwalior,Madhya Pradesh,India,,26.196913,78.152707,https://paint-hardware-and-sanatery-store.busi...
7,S.S. Hardware & Paints,"S.S. Hardware & Paints, Near Police Chowki, AH...",Gwalior,Madhya Pradesh,India,094069 78303,26.201283,78.136575,
8,Aastha Tiles And Paints,"B-Block, Aastha Tiles And Paints, Sagartaal Ch...",Gwalior,Madhya Pradesh,India,094253 07460,26.241081,78.155948,
9,Akash Iron & Paints,"Akash Iron & Paints, Raj Mandi, Ghauspura, Gwa...",Gwalior,Madhya Pradesh,India,,26.235123,78.171919,https://akash-iron-paints.business.site


In [232]:
# City Names: Anuppur and Amla and Ashoknagar and Ashta

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/madhya-pradesh/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mp5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mp5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Imperial Stores,"Imperial Stores, Anuppur, Madhya Pradesh 484224",Anuppur,Madhya Pradesh,India,078691 83449,23.118498,81.697235,
1,Viswakarma Paint And Spray Shop,"Viswakarma Paint And Spray Shop, petrol pump, ...",Amla,Madhya Pradesh,India,,21.994738,78.354449,
2,AP ColourIdeas BHATIA HARDWARE STORES,"AP ColourIdeas BHATIA HARDWARE STORES, Shop No...",Jabalpur,Madhya Pradesh,India,098930 31010,23.167143,79.933618,http://www.asianpaints.com
3,Kashish Enterprises,"Kashish Enterprises, 1098, Hathital Rd, Gorakh...",Jabalpur,Madhya Pradesh,India,093294 01118,23.150322,79.926706,
4,Nanak Traders,"Nanak Traders, Bengali Colony, Ranjhi, Jabalpu...",Jabalpur,Madhya Pradesh,India,088787 96720,23.207321,80.005265,
5,Goyal Enterprises,"Goyal Enterprises, behind Reliance Petrol Pump...",Ashoknagar,Madhya Pradesh,India,,24.577515,77.731849,
6,Gaurav Paints,"Gaurav Paints, Ashoknagar,, Purana Bazar, Asho...",Ashoknagar,Madhya Pradesh,India,,24.569034,77.734166,
7,Color Zone,"Color Zone, Shop No. 03, Main Road, Ambedkar W...",Jabalpur,Madhya Pradesh,India,098935 01207,23.192538,79.925425,
8,Ess Kay Enterprises,"Ess Kay Enterprises, Shop No.3,81, Rabra Compl...",Jabalpur,Madhya Pradesh,India,094253 25759,23.15031,79.934484,
9,Neeraj painter khushigraphics,"Neeraj painter khushigraphics, Bajrang Colony,...",Ashta,Madhya Pradesh,India,,22.999759,76.723991,http://www.khushigraphics.com


In [233]:
# Remove unwanted rows that we don't want to keep it in our data set.
mp5.drop(index=mp5.index[2:5],axis=0,inplace=True)

In [234]:
# Remove unwanted rows that we don't want to keep it in our data set.
mp5.drop(index=mp5.index[4:6],axis=0,inplace=True)

In [235]:
# Check the data set.
mp5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Imperial Stores,"Imperial Stores, Anuppur, Madhya Pradesh 484224",Anuppur,Madhya Pradesh,India,078691 83449,23.118498,81.697235,
1,Viswakarma Paint And Spray Shop,"Viswakarma Paint And Spray Shop, petrol pump, ...",Amla,Madhya Pradesh,India,,21.994738,78.354449,
5,Goyal Enterprises,"Goyal Enterprises, behind Reliance Petrol Pump...",Ashoknagar,Madhya Pradesh,India,,24.577515,77.731849,
6,Gaurav Paints,"Gaurav Paints, Ashoknagar,, Purana Bazar, Asho...",Ashoknagar,Madhya Pradesh,India,,24.569034,77.734166,
9,Neeraj painter khushigraphics,"Neeraj painter khushigraphics, Bajrang Colony,...",Ashta,Madhya Pradesh,India,,22.999759,76.723991,http://www.khushigraphics.com


In [236]:
# Using 'pd.concat', we can merge all the dataframes and store the combined result in a variable. Begin by creating a list containing the dataframes,
# and when employing 'pd.concat', include 'ignore_index=True' to reset the index.
mpd_data = [mp1,mp2,mp3,mp4,mp5]
mpd_data = pd.concat(mpd_data,ignore_index=True)
mpd_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Samajhdar Choice,"Samajhdar Choice, Main Road, Sanjeevani Nagar,...",Jabalpur,Madhya Pradesh,India,095842 48841,23.164695,79.903179,
1,Rani Traders & Hardware,"Rani Traders & Hardware, Amkhera Road, Jagriti...",Jabalpur,Madhya Pradesh,India,093000 83728,23.205739,79.93541,
2,British Paints,"British Paints, Shop No-6, Triveni Plaza Shard...",Jabalpur,Madhya Pradesh,India,,23.153258,79.908707,
3,M/s Apollo Traders,"M/s Apollo Traders, 585, Nunhai Dixitpura Road...",Jabalpur,Madhya Pradesh,India,093008 00033,23.178389,79.93183,
4,Mahadeo Hardware,"Mahadeo Hardware, Gwarighat Road, Gorakhpur, J...",Jabalpur,Madhya Pradesh,India,097532 32333,23.151508,79.927294,


#**Maharashtra**

After gathering data from different cities in Maharashtra, our next step is to merge it all together.

In [237]:
# City Name: Kolhapur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Kolhapur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Woodland Sheets,"Woodland Sheets, Lohia Rd, Gandhinagar, Mahara...",Kolhapur,Maharashtra,India,,16.705293,74.296535,
1,Siddhant spray painting,"Siddhant spray painting, Shastri Nagar, Kolhap...",Kolhapur,Maharashtra,India,,16.690492,74.242275,
2,Patvegar Enterprises,"Patvegar Enterprises, 732, 4th Ln, E Ward, Sha...",Kolhapur,Maharashtra,India,098609 44959,16.700782,74.23451,
3,Satyam Paints,"Satyam Paints, shiru chowk road, Gandhinagar, ...",Kolhapur,Maharashtra,India,,16.706333,74.301245,
4,Deshmukh Paints,"Deshmukh Paints, E Ward, bhosale park, Kadamwa...",Kolhapur,Maharashtra,India,,16.721259,74.257748,
5,Aarambh traders,"Aarambh traders, Subhash Rd, Laxmipuri, Kolhap...",Kolhapur,Maharashtra,India,,16.696866,74.231518,
6,Ekta Paints,"Ekta Paints, Vasanshah Colony, Gandhinagar, Ma...",Kolhapur,Maharashtra,India,,16.706626,74.301111,
7,ANKUSH PAINTS,"ANKUSH PAINTS, Opp Swastik Market,Mainroad, Ga...",Kolhapur,Maharashtra,India,0231 261 3778,16.705312,74.296269,
8,Sai Paints,"Sai Paints, Y.P.Powar nagar, Kolhapur, Maharas...",Kolhapur,Maharashtra,India,0231 269 0711,16.691371,74.235683,
9,STAR MARKETING,"STAR MARKETING, SHAHUNAGAR, GANPATI CHOWK, Kol...",Kolhapur,Maharashtra,India,095959 37899,16.690846,74.24319,


In [238]:
# City Name: Mumbai

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Mumbai/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Jotun,"Jotun, 124, Sant Savata Mali Marg, Opposite Ma...",Mumbai,Maharashtra,India,044 2841 9420,18.975832,72.836045,http://www.jotun.com
1,S.Rajnikant & Co,"S.Rajnikant & Co, Shop No-8, R.K. Building-2, ...",Mumbai,Maharashtra,India,022 2382 9146,18.958928,72.822444,
2,Milan Electrical Hardware & Paints,"Milan Electrical Hardware & Paints, Vijay Naga...",Mumbai,Maharashtra,India,099676 69818,19.047036,72.893274,
3,Khuzema Hardware Stores,"Khuzema Hardware Stores, 275, Balgovind Chawl,...",Mumbai,Maharashtra,India,022 2309 6212,18.962375,72.817898,
4,Mohanlal Rajmal & Co,"Mohanlal Rajmal & Co, Anju Shopping Centre, 6,...",Mumbai,Maharashtra,India,022 2605 7035,19.082324,72.838662,http://www.dulux.in
5,Fakhri Paint Centre,"Fakhri Paint Centre, E-29, Rizvi Park , S.V. R...",Mumbai,Maharashtra,India,,19.089607,72.837366,
6,Mukesh & Company,"Mukesh & Company, Shop No. 8, Santacruz Mansio...",Mumbai,Maharashtra,India,022 2610 8178,19.080938,72.842501,
7,Maruti Paint Mart,"Maruti Paint Mart, Dattatray Rd, opposite Sham...",Mumbai,Maharashtra,India,,19.085798,72.834191,
8,Shreenath Paint & Hardware Stores,"Shreenath Paint & Hardware Stores, Shop No.7, ...",Mumbai,Maharashtra,India,022 2661 2798,19.080755,72.829974,
9,Star Paints,"Star Paints, Sitladevi Opp.Kala Pradarshani ,L...",Mumbai,Maharashtra,India,022 6510 0841,19.125369,72.830851,http://www.starpaintindia.com


In [239]:
# City Name: Pune

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Pune/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shilpa Trading,"Shilpa Trading, 24, Hingane Home Colony, Karve...",Pune,Maharashtra,India,020 6450 5829,18.489842,73.819116,
1,Sajid Painter,"Sajid Painter, Police Station, Sr.No.315,Marke...",Pune,Maharashtra,India,,18.504857,73.917578,
2,Deepak Paints,"Deepak Paints, Matrismirti Building, Pune - So...",Pune,Maharashtra,India,020 2399 9361,18.501231,73.936706,
3,Asian Paints Home Solutions,"Asian Paints Home Solutions, No 43, Karve Rd, ...",Pune,Maharashtra,India,,18.512548,73.839266,https://www.asianpaints.com
4,Kankoo Paints PVT. LTD.,"Kankoo Paints PVT. LTD., Camp, Pune, Maharasht...",Pune,Maharashtra,India,020 2634 6180,18.511429,73.879469,
5,Krishna paints,"Krishna paints, 694/10, ,pune, Shivneri Path, ...",Pune,Maharashtra,India,099222 98123,18.489402,73.858182,
6,Evergreen Trading Agency,"Evergreen Trading Agency, Bohari Ali, Rameshwa...",Pune,Maharashtra,India,020 2447 1292,18.514567,73.859414,
7,PRAGATI PAINTS,"PRAGATI PAINTS, Sr.No 85/2, Shop No 3, BhimaSh...",Pune,Maharashtra,India,090495 03040,18.501875,73.928835,
8,Purple Colours & Hardware,"Shop No. 35, Purple Colours & Hardware, Amar S...",Pune,Maharashtra,India,,18.500221,73.945335,
9,India Paints,"India Paints, S.no 63/1, Shop no.1, Mahadev Vi...",Pune,Maharashtra,India,098604 02894,18.481043,73.856264,


In [240]:
# City Name: Nagpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Nagpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Gurudeo lime stores,"Gurudeo lime stores, chuna oli, itwari, Nagpur...",Nagpur,Maharashtra,India,0712 277 1614,21.15568,79.11032,
1,D B Mehta & Sons,"D B Mehta & Sons, Wholesale Cloth Market, Jaga...",Nagpur,Maharashtra,India,098906 04832,21.152783,79.107543,
2,Deepak Paint Mart,"Deepak Paint Mart, Amar Sajjan Complex, Mangal...",Nagpur,Maharashtra,India,0712 252 6744,21.143675,79.112513,
3,Hakik Paints,"Hakik Paints, Medical college Road, Baidhnath ...",Nagpur,Maharashtra,India,,21.135906,79.09643,
4,Colourful Store,"Colourful Store, 27, Ghat road, Sardar Patel S...",Nagpur,Maharashtra,India,090968 80080,21.137567,79.090212,https://colourful-store.business.site
5,Paras Paints,"Paras Paints, Plot No.483, Shop No.3, Medical ...",Nagpur,Maharashtra,India,094036 77232,21.132703,79.098989,
6,Asian Traders,"Asian Traders, Vishvanathaji Rahate Bhavan, Lo...",Nagpur,Maharashtra,India,0712 276 4730,21.158725,79.101861,http://asiantradersindia.com
7,J.P. Enterprises,"J.P. Enterprises, Khamele Chamber, Dharaskar G...",Nagpur,Maharashtra,India,098503 34560,21.152673,79.108778,
8,Vasant Colour And Chemicals,"Vasant Colour And Chemicals, Dharaskar Gali, B...",Nagpur,Maharashtra,India,0712 277 2294,21.155475,79.109588,
9,P.G Kale & Sons,"P.G Kale & Sons, 6-7, A- Wing Keshav Imperial,...",Nagpur,Maharashtra,India,,21.143291,79.08681,


In [241]:
# City Name: Thane

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Thane/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M/s. Vinay Paints Supply Company,"Shop No. 1, M/s. Vinay Paints Supply Company, ...",Thane,Maharashtra,India,022 2560 3569,19.17604,72.953923,
1,Color Choice centre,"Color Choice centre, Shop no.2, Shree Ganesh D...",Thane,Maharashtra,India,022 2538 0627,19.191837,72.96888,
2,Colour Corner,"Colour Corner, Shop No : 4 , Vaitywadi, Old Ag...",Thane,Maharashtra,India,,19.188127,72.963535,
3,Probation electric &hardware,"Probation electric &hardware, 1, Nikam Guruji ...",Thane,Maharashtra,India,,19.20774,72.982758,
4,"PNP Hardware ,Electricals and paints","PNP Hardware ,Electricals and paints, Bhoir Wa...",Thane,Maharashtra,India,098928 56283,19.20218,72.97147,
5,Deepak Trading Company,"Deepak Trading Company, Ghodbundar Road, Gowni...",Thane,Maharashtra,India,097697 37354,19.274842,72.959596,
6,Mahavir Paints Co.,"Mahavir Paints Co., Shop No.1 Ambaji dham buld...",Thane,Maharashtra,India,093724 40221,19.176348,72.951595,
7,Rajawat Electrical & Industical Centre,"Rajawat Electrical & Industical Centre, Opposi...",Thane,Maharashtra,India,098700 25374,19.201245,72.953564,
8,Rastriya Marketing Pvt Ltd,"Rastriya Marketing Pvt Ltd, Shop No. 1 &2, Sha...",Thane,Maharashtra,India,022 2561 1043,19.173336,72.956807,
9,Mahavir Electric & Hardware Stores,"Mahavir Electric & Hardware Stores, Shop Numbe...",Thane,Maharashtra,India,098690 37160,19.172265,72.954846,


In [242]:
# City Name: Nashik

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Nashik/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh6=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh6.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,G.A.SONS,"5, G.A.SONS, Paradise Plaza, Shingada Talav, R...",Nashik,Maharashtra,India,080870 20180,19.993599,73.787773,
1,AP ColourIdeas Cement Shoppe,"Shop No -03, AP ColourIdeas Cement Shoppe, Rus...",Nashik,Maharashtra,India,,19.953924,73.763552,https://www.asianpaints.com
2,S.A. Enterprises,"S.A. Enterprises, Ashwin Nagar, Nashik, Mahara...",Nashik,Maharashtra,India,099706 22224,19.960865,73.758137,
3,SATPUR HARDWARE AND PAINTS,"SATPUR HARDWARE AND PAINTS, 46/1, Trambakeshwa...",Nashik,Maharashtra,India,080874 79013,19.990371,73.732135,
4,Sai Traders,"Sai Traders, Shri Prasad Apartment, Narhari Na...",Nashik,Maharashtra,India,073044 77755,19.949352,73.768308,
5,ALF PM SURFACE COATING PVT.LTD,"ALF PM SURFACE COATING PVT.LTD, MIDC Ambad, Na...",Nashik,Maharashtra,India,,19.94351,73.728571,
6,Ganpati Stone,"Ganpati Stone, Gangapur Rd, Gangapur, Nashik, ...",Nashik,Maharashtra,India,,20.029716,73.713497,
7,Shreeram Electricals & Hardware,"Shreeram Electricals & Hardware, Yogiraj Apart...",Nashik,Maharashtra,India,080870 64086,19.975937,73.753298,
8,H.P Enterprises,"Plot Number 1, 02, H.P Enterprises, Savarkar N...",Nashik,Maharashtra,India,086006 39691,19.988869,73.719429,
9,STANDOX 2K PREMIUM AUTO PAINT,"STANDOX 2K PREMIUM AUTO PAINT, Pathardi Fata, ...",Nashik,Maharashtra,India,098229 53888,19.958931,73.760387,


In [243]:
# City Name: Aurangabad

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/maharashtra/Aurangabad/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
mh7=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
mh7.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Padmavati powder coating,"Padmavati powder coating, C- 274, Mahavir chow...",Aurangabad,Maharashtra,India,,19.837615,75.248255,
1,Vasani Traders,"Vasani Traders, Patel Chamber, Pandharpur, Aur...",Aurangabad,Maharashtra,India,098220 33446,19.834895,75.25063,
2,RAJ RANG,"RAJ RANG, Raj Galaxy, Rout Corner, Shri Nagar ...",Aurangabad,Maharashtra,India,0240 248 1886,19.88747,75.357417,
3,Bhogale Coating & Paints Private Limited,"Bhogale Coating & Paints Private Limited, Hame...",Aurangabad,Maharashtra,India,0240 233 5293,19.857549,75.316707,http://www.bhogalecoating.com
4,Pankaj Paints & Hardware,"Pankaj Paints & Hardware, Mahalaxmi Colony, Th...",Aurangabad,Maharashtra,India,098225 43011,19.866894,75.372383,
5,Rahul Hardware And Paints,"Rahul Hardware And Paints, Pundalik Nagar Road...",Aurangabad,Maharashtra,India,0240 244 2206,19.86602,75.358721,
6,S.M. ENTERPRISES,"Plot No: 6, S.M. ENTERPRISES, Mayur Park Rd, O...",Aurangabad,Maharashtra,India,097679 99880,19.910965,75.354946,
7,CHAUDHRI HARDWARE & PAINTS,"CHAUDHRI HARDWARE & PAINTS, Champa Chowk, Shar...",Aurangabad,Maharashtra,India,080878 78434,19.889435,75.341319,
8,Renuka Hardware & Paints,"Renuka Hardware & Paints, Jalgaon Rd, Hudco, S...",Aurangabad,Maharashtra,India,099703 00206,19.906545,75.355409,
9,Bombay Hardware & Paints,"18, Bombay Hardware & Paints, Bhaji Mndi Road,...",Aurangabad,Maharashtra,India,092253 13172,19.88707,75.328151,


In [244]:
# To concatenate and merge the dataframes into one, saving it as a variable, utilize 'pd.concat'. First,
# construct a list comprising the dataframes, and when applying 'pd.concat', set 'ignore_index=True' to reset the index.
mhs_data = [mh1,mh2,mh3,mh4,mh5,mh6,mh7]
mhs_data = pd.concat(mhs_data,ignore_index=True)
mhs_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Woodland Sheets,"Woodland Sheets, Lohia Rd, Gandhinagar, Mahara...",Kolhapur,Maharashtra,India,,16.705293,74.296535,
1,Siddhant spray painting,"Siddhant spray painting, Shastri Nagar, Kolhap...",Kolhapur,Maharashtra,India,,16.690492,74.242275,
2,Patvegar Enterprises,"Patvegar Enterprises, 732, 4th Ln, E Ward, Sha...",Kolhapur,Maharashtra,India,098609 44959,16.700782,74.23451,
3,Satyam Paints,"Satyam Paints, shiru chowk road, Gandhinagar, ...",Kolhapur,Maharashtra,India,,16.706333,74.301245,
4,Deshmukh Paints,"Deshmukh Paints, E Ward, bhosale park, Kadamwa...",Kolhapur,Maharashtra,India,,16.721259,74.257748,


#**Odisha**

We have gathered data from various cities in Odisha, and our next task is to combine it into one dataset.

In [245]:
# City Name: Bhubaneshwar

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/odisha/Bhubaneshwar/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
od1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
od1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M/s Saimaa Enterprises,"M/s Saimaa Enterprises, Damana Square, Max Sho...",Bhubaneshwar,Odisha,India,,20.330114,85.81879,
1,Omm Traders - Asian Paints Ltd,"Omm Traders - Asian Paints Ltd, Koradakanta Rd...",Bhubaneshwar,Odisha,India,,20.280653,85.873801,http://spconstruction.in
2,Sulochana Hardware & Paint,"Sulochana Hardware & Paint, Shampur, Bhubanesw...",Bhubaneshwar,Odisha,India,,20.284844,85.770223,
3,Maa Tarini Hardware & Paints,"A/22, Maa Tarini Hardware & Paints, In Falcon ...",Bhubaneshwar,Odisha,India,094380 13156,20.275452,85.852497,
4,Dulux Paints,"Dulux Paints, HIG-109, Jagamara - Sundarpada R...",Bhubaneshwar,Odisha,India,,20.238051,85.810342,
5,Khandagiri hardware and paints,"Plot no 26, Khandagiri hardware and paints, ne...",Bhubaneshwar,Odisha,India,,20.25448,85.786779,
6,Phulwari Enterprises,"Phulwari Enterprises, 2974, Cuttack - Puri Byp...",Bhubaneshwar,Odisha,India,,20.307308,85.8724,
7,Dhauli Traders,"Dhauli Traders, Old, Bhubaneswar - Puri Hwy, U...",Bhubaneshwar,Odisha,India,,20.201611,85.855172,
8,Dalmia Magic P S Agency,"Dalmia Magic P S Agency, Puri Rd, Sisupalgarh,...",Bhubaneshwar,Odisha,India,,20.218964,85.848793,
9,Dulux Paints,"Dulux Paints, Khandagiri Marg, Jagamara, Bhuba...",Bhubaneshwar,Odisha,India,,20.257046,85.795363,


In [246]:
# City Name: Cuttack

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/odisha/Cuttack/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
od2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
od2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Jagannath Traders,"Jagannath Traders, Nakhara, Phulnakhara (Infro...",Cuttack,Odisha,India,0671 235 6792,20.360306,85.892177,
1,Dulux Paint,"Dulux Paint, Sambalpur - Angul - Dhenkanal - C...",Cuttack,Odisha,India,,20.529136,85.930618,
2,Pinku marbal and tiles mistry,"Pinku marbal and tiles mistry, Cuttack, Odisha...",Cuttack,Odisha,India,,20.481841,85.895757,
3,Shree Ganesh Paints & Ply,"Shree Ganesh Paints & Ply, Justice chakk, CDA ...",Cuttack,Odisha,India,,20.480812,85.820907,
4,smartpainting,"smartpainting, Plot No-800/A,Mahanadi Vihar,Na...",Cuttack,Odisha,India,,20.46861,85.912193,
5,Sp brothers,"Sp brothers, NH16, near Mayur Hotel, Gandarpur...",Cuttack,Odisha,India,089767 07477,20.464487,85.904142,https://sp-brothers-paint-store.business.site
6,Shreeji Trading,"Shreeji Trading, Kazi Bazar, Cuttack - 753001,...",Cuttack,Odisha,India,,20.468966,85.862859,
7,Ghanteswar Paints and Hardware,"Ghanteswar Paints and Hardware, near SBI Bank,...",Cuttack,Odisha,India,,20.45619,85.917701,
8,Barun Paint,"Barun Paint, Sibaji Nagar Satichaura,Bidanasi,...",Cuttack,Odisha,India,,20.478224,85.848021,
9,"Asian Paints,Berger Shop","Asian Paints,Berger Shop, At/ PO Jhinkiria, Vi...",Cuttack,Odisha,India,,20.405685,85.944693,


In [247]:
# The process of concatenating all the dataframes and assigning them to a variable can be accomplished using the 'pd.concat' function.
# Start by creating a list that contains the dataframes, then employ 'pd.concat', ensuring to include 'ignore_index=True' for index resetting.
odi_data = [od1,od2]
odi_data = pd.concat(odi_data,ignore_index=True)
odi_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M/s Saimaa Enterprises,"M/s Saimaa Enterprises, Damana Square, Max Sho...",Bhubaneshwar,Odisha,India,,20.330114,85.81879,
1,Omm Traders - Asian Paints Ltd,"Omm Traders - Asian Paints Ltd, Koradakanta Rd...",Bhubaneshwar,Odisha,India,,20.280653,85.873801,http://spconstruction.in
2,Sulochana Hardware & Paint,"Sulochana Hardware & Paint, Shampur, Bhubanesw...",Bhubaneshwar,Odisha,India,,20.284844,85.770223,
3,Maa Tarini Hardware & Paints,"A/22, Maa Tarini Hardware & Paints, In Falcon ...",Bhubaneshwar,Odisha,India,094380 13156,20.275452,85.852497,
4,Dulux Paints,"Dulux Paints, HIG-109, Jagamara - Sundarpada R...",Bhubaneshwar,Odisha,India,,20.238051,85.810342,


#**Punjab**

Once we have collected data from different cities in Punjab, we will proceed to merge it.

In [248]:
# City Name: Mohali

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/punjab/Mohali/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
pn1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
pn1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shalimar Paints - Singla Oil Company,"Shalimar Paints - Singla Oil Company, Gurudwar...",Mohali,Punjab,India,098764 68378,30.66802,76.699916,
1,Neel kamal traders,"Booth No - 42, Neel kamal traders, Sector 40C,...",Mohali,Punjab,India,,30.733983,76.732931,
2,City Paint & Painting,"City Paint & Painting, Shop No 165, New Mortor...",Mohali,Punjab,India,098885 09144,30.687164,76.759906,
3,Bansal Sons,"Bansal Sons, Sector 80, Sahibzada Ajit Singh N...",Mohali,Punjab,India,090410 61611,30.670685,76.717849,http://business.google.com
4,Anil Paint & Hardware Store,"Anil Paint & Hardware Store, SCO 955 Pind, Mat...",Mohali,Punjab,India,098150 76254,30.698347,76.714019,
5,Mahaluxmi Paint And Hardware Store,"Mahaluxmi Paint And Hardware Store, Paint Shop...",Mohali,Punjab,India,098882 24409,30.728989,76.712468,https://mahaluxmi-paint-and-hardware-store.bus...
6,V.K Paints & Hardware Store,"V.K Paints & Hardware Store, Booth No. 23, Pha...",Mohali,Punjab,India,0172 509 3453,30.70662,76.724654,
7,Shivalik Paints,"Booth No.7, Shivalik Paints, 11-12, Phase 7, I...",Mohali,Punjab,India,098889 97726,30.71359,76.699788,
8,Haryana paint and hardware,"Haryana paint and hardware, Sco 75, Phase 9, S...",Mohali,Punjab,India,,30.694176,76.733258,
9,Asian Paints - Empire Building Material,"Asian Paints - Empire Building Material, S.C.F...",Mohali,Punjab,India,0172 509 1702,30.709091,76.722372,


In [249]:
# City Name: Ludhiana

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/punjab/Ludhiana/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
pn2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
pn2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Friends Paints Store,"Friends Paints Store, Street Number 8, Link Rd...",Ludhiana,Punjab,India,094171 47613,30.890006,75.866251,
1,Mashal colour sales,"Mashal colour sales, Moti Nagar Rd, Moti Nagar...",Ludhiana,Punjab,India,,30.894675,75.886919,
2,Onkar Paint Store,"Onkar Paint Store, 156, Rani Jhansi Rd, Civil ...",Ludhiana,Punjab,India,,30.909374,75.836365,
3,Aggarwal Automotive Sales,"4076, Aggarwal Automotive Sales, Chandigarh Rd...",Ludhiana,Punjab,India,098884 89766,30.909677,75.881959,
4,Pawan Paints & Chemicals,"Pawan Paints & Chemicals, Gurudwara Kalgi Dhar...",Ludhiana,Punjab,India,098889 11873,30.900055,75.849062,
5,Guru Nanak Paints & Building Material Store,"Guru Nanak Paints & Building Material Store, D...",Ludhiana,Punjab,India,099154 19132,30.879996,75.847305,
6,Rajveer Paint Store,"Rajveer Paint Store, New Model Town, Bagdi Par...",Ludhiana,Punjab,India,093169 70499,30.90129,75.84083,
7,Punjab Paint Store,"Punjab Paint Store, Wall Singh Nagar Road, She...",Ludhiana,Punjab,India,094185 67811,30.929402,75.863662,
8,K P Gupta and Sons,"K P Gupta and Sons, R.K. Steel, Cycle Market, ...",Ludhiana,Punjab,India,0161 503 6027,30.895616,75.857587,
9,Basant Hardware,"Basant Hardware, Prince Market, Prince Market ...",Ludhiana,Punjab,India,098726 44448,30.90147,75.83383,


In [250]:
# City Name: Amritsar

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/punjab/Amritsar/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
pn3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
pn3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Kanshi Ram Sham Lal & Co.,"Kanshi Ram Sham Lal & Co., K. S. Towers, 2788,...",Amritsar,Punjab,India,097790 99201,31.626098,74.884143,
1,Simran Paints Corporation,"Simran Paints Corporation, DASHMESH NAGAR, TAR...",Amritsar,Punjab,India,098728 42977,31.599545,74.882699,
2,Royal Touch Paints,"Royal Touch Paints, Preet Nagar, Post Nangali,...",Amritsar,Punjab,India,0183 271 1754,31.633979,74.872264,
3,Deco Paints,"Deco Paints, H No Hm-3, Ist Floor, Kapoor Cott...",Amritsar,Punjab,India,0183 200 1921,31.645787,74.862878,
4,Deco Paints,"Deco Paints, Lawrence Road, Opposite Hotel, La...",Amritsar,Punjab,India,098885 84436,31.642486,74.875892,
5,Arora paint &Hardware,"Arora paint &Hardware, 88ft Road, Chowk Tungwa...",Amritsar,Punjab,India,096533 43966,31.654505,74.892707,
6,Deco Paints Chemicals,"Deco Paints Chemicals, G T Road: Guru Ravi Das...",Amritsar,Punjab,India,0183 258 5659,31.633979,74.872264,
7,Gati Pa─îðîers and Amritsar AK Hardware & Pain...,Gati Pa─îðîers and Amritsar AK Hardware & Pain...,Amritsar,Punjab,India,,31.621668,74.901171,
8,Vicki Paints,"Vicki Paints, Near Hussainpura Chownk, Maharaj...",Amritsar,Punjab,India,098880 66391,31.633545,74.881034,
9,R.K.Traders,"R.K.Traders, Tarn Taran Rd, Near Bawa Hospital...",Amritsar,Punjab,India,098551 72728,31.609354,74.880628,


In [251]:
# City Name: Jalandhar

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/punjab/Jalandhar/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
pn4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
pn4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Krishna Hardware & Paints Store,"Krishna Hardware & Paints Store, Jalandhar - N...",Jalandhar,Punjab,India,090414 87333,31.309443,75.566427,
1,Kohli Trading Company,"Kohli Trading Company, SD College Rd, Gobind G...",Jalandhar,Punjab,India,0181 223 7041,31.326346,75.592194,
2,Dhir & Co,"Dhir & Co, New Railway Rd, opp. Laxmi Cinema, ...",Jalandhar,Punjab,India,098761 48139,31.333155,75.586799,
3,Ladha Singh Satnam Singh,"Ladha Singh Satnam Singh, Bazar Bansanwala, Ba...",Jalandhar,Punjab,India,,31.32711,75.591709,
4,Delhi Paints Decor,"Delhi Paints Decor, Mithapur Rd, Near Easyday ...",Jalandhar,Punjab,India,098559 55559,31.292795,75.579959,
5,Chatri Mal Jawahar Lal,"Chatri Mal Jawahar Lal, Old Phagwara Rd, Jalan...",Jalandhar,Punjab,India,097791 87787,31.286555,75.647497,
6,Kabir Sales,"Kabir Sales, Model House, Jalandhar, Punjab 14...",Jalandhar,Punjab,India,,31.300145,75.562096,
7,Swag Trading Corporation,"Swag Trading Corporation, Opposit Galaxy Honda...",Jalandhar,Punjab,India,086994 84078,31.337234,75.611274,
8,Chopra Paints & Hardware Store,"Chopra Paints & Hardware Store, 584, Cool Road...",Jalandhar,Punjab,India,098156 55548,31.314751,75.582168,
9,Simran Traders,"Simran Traders, Nari Niketan Rd, near Jalandha...",Jalandhar,Punjab,India,098141 05258,31.31467,75.570303,


In [252]:
# City Name: Patiala

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/punjab/Patiala/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
pn5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
pn5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,New Singla Paint and Hardware Store,"New Singla Paint and Hardware Store, Chandni C...",Patiala,Punjab,India,078376 68462,30.324511,76.395373,
1,Capital Paint Store,"Capital Paint Store, Sanauri adda, Patiala, Pu...",Patiala,Punjab,India,094635 86159,30.32411,76.407802,
2,S. K. Traders,"S. K. Traders, near Lahori Gate, Lahori Gate, ...",Patiala,Punjab,India,099889 29214,30.336145,76.401775,
3,sahib dayal paint store,"sahib dayal paint store, #28,sai market near a...",Patiala,Punjab,India,,30.339794,76.386893,https://sahib-dayal-paint-store.business.site
4,Barnala Paint and Hardware Store,"Shop No. 4, Barnala Paint and Hardware Store, ...",Patiala,Punjab,India,,30.325647,76.403061,
5,Jodh Singh Balwinder Singh,"Jodh Singh Balwinder Singh, Nabha Gate, Patial...",Patiala,Punjab,India,094171 42200,30.32825,76.392502,
6,Gupta Trading Company,"Gupta Trading Company, Ram Nagar, NH 64, Patia...",Patiala,Punjab,India,098150 27800,30.330485,76.414148,
7,Shree Ganesh Enterprises,"Shree Ganesh Enterprises, Sanauri Adda, Markal...",Patiala,Punjab,India,093168 83293,30.323449,76.408668,
8,Shadiram Shankur Dat,"Shadiram Shankur Dat, Sanauri Gate, Patiala, P...",Patiala,Punjab,India,093175 30402,30.324587,76.406813,
9,NEW SINGLA PAINT AND HARDWEAR STORE,"NEW SINGLA PAINT AND HARDWEAR STORE, Rose Aven...",Patiala,Punjab,India,,30.315495,76.378583,


In [253]:
# City Name: Adampur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/punjab/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
pn6=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
pn6.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,NIPUN PAINT PRIVATE LIMITED,"NIPUN PAINT PRIVATE LIMITED, ADAMPUR, Railway ...",Adampur,Punjab,India,098152 43442,31.427782,75.72037,
1,VISHAL PAINT STORE,"VISHAL PAINT STORE, ADAMPUR, Railway Rd, JALAN...",Adampur,Punjab,India,098150 31840,31.427782,75.72037,
2,LOVE PAINT STORE,"LOVE PAINT STORE, #1842, Main road, opposite t...",Adampur,Punjab,India,098773 44126,31.431928,75.719704,
3,Master Paint Store,"Master Paint Store, Bhogpur - Adampur Rd, Adam...",Adampur,Punjab,India,,31.436898,75.717474,
4,Khalsa Hardware & Paint Store,"Khalsa Hardware & Paint Store, Daroli Kalan, P...",Adampur,Punjab,India,094643 36763,31.405604,75.773074,
5,Asian Paints - Empire Building Material,"Asian Paints - Empire Building Material, S.C.F...",Mohali,Punjab,India,0172 509 1702,30.709091,76.722372,
6,Haryana paint and hardware,"Haryana paint and hardware, Sco 75, Phase 9, S...",Mohali,Punjab,India,,30.694176,76.733258,
7,Shivalik Paints,"Booth No.7, Shivalik Paints, 11-12, Phase 7, I...",Mohali,Punjab,India,098889 97726,30.71359,76.699788,
8,V.K Paints & Hardware Store,"V.K Paints & Hardware Store, Booth No. 23, Pha...",Mohali,Punjab,India,0172 509 3453,30.70662,76.724654,
9,Mahaluxmi Paint And Hardware Store,"Mahaluxmi Paint And Hardware Store, Paint Shop...",Mohali,Punjab,India,098882 24409,30.728989,76.712468,https://mahaluxmi-paint-and-hardware-store.bus...


In [254]:
# Remove unwanted rows that we don't want to keep it in our data set.
pn6.drop(index=pn6.index[5:10],axis=0,inplace=True)

In [255]:
# Check the data set.
pn6.head(5)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,NIPUN PAINT PRIVATE LIMITED,"NIPUN PAINT PRIVATE LIMITED, ADAMPUR, Railway ...",Adampur,Punjab,India,098152 43442,31.427782,75.72037,
1,VISHAL PAINT STORE,"VISHAL PAINT STORE, ADAMPUR, Railway Rd, JALAN...",Adampur,Punjab,India,098150 31840,31.427782,75.72037,
2,LOVE PAINT STORE,"LOVE PAINT STORE, #1842, Main road, opposite t...",Adampur,Punjab,India,098773 44126,31.431928,75.719704,
3,Master Paint Store,"Master Paint Store, Bhogpur - Adampur Rd, Adam...",Adampur,Punjab,India,,31.436898,75.717474,
4,Khalsa Hardware & Paint Store,"Khalsa Hardware & Paint Store, Daroli Kalan, P...",Adampur,Punjab,India,094643 36763,31.405604,75.773074,


In [256]:
# When merging the dataframes and storing the combined output in a variable, 'pd.concat' is the ideal choice.
# Begin by creating a list of the dataframes and use 'pd.concat', specifying 'ignore_index=True' to reset the index.
pnb_data = [pn1,pn2,pn3,pn4,pn5,pn6]
pnb_data = pd.concat(pnb_data,ignore_index=True)
pnb_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shalimar Paints - Singla Oil Company,"Shalimar Paints - Singla Oil Company, Gurudwar...",Mohali,Punjab,India,098764 68378,30.66802,76.699916,
1,Neel kamal traders,"Booth No - 42, Neel kamal traders, Sector 40C,...",Mohali,Punjab,India,,30.733983,76.732931,
2,City Paint & Painting,"City Paint & Painting, Shop No 165, New Mortor...",Mohali,Punjab,India,098885 09144,30.687164,76.759906,
3,Bansal Sons,"Bansal Sons, Sector 80, Sahibzada Ajit Singh N...",Mohali,Punjab,India,090410 61611,30.670685,76.717849,http://business.google.com
4,Anil Paint & Hardware Store,"Anil Paint & Hardware Store, SCO 955 Pind, Mat...",Mohali,Punjab,India,098150 76254,30.698347,76.714019,


#**Rajasthan**

Data collection has been conducted across different cities in Rajasthan. Afterward, we will merge the collected data.

In [257]:
# City Name: Ajmer

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/rajasthan/Ajmer/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
rj1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
rj1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shri Vinayak Paints,"Shri Vinayak Paints, Near swastic tent house, ...",Ajmer,Rajasthan,India,077378 51720,26.477176,74.647892,
1,Vanshika paints,"Vanshika paints, Nasirabad Rd, Nagra, Ajmer, R...",Ajmer,Rajasthan,India,,26.425234,74.650409,
2,Chadausia Store,"Chadausia Store, Naya Bazar, Naya Bazaar, Ajme...",Ajmer,Rajasthan,India,094140 07068,26.461123,74.631566,
3,Rawat Traders,"Rawat Traders, Mayo Link Rd, Shiva Colony, Ajm...",Ajmer,Rajasthan,India,094145 93209,26.439004,74.652004,
4,Ak Paint And Chemicals,"Ak Paint And Chemicals, Shree Gurukrupa Apartm...",Ajmer,Rajasthan,India,081070 93110,26.464777,74.630353,
5,Thakurdas Kutti ki Taal,"Thakurdas Kutti ki Taal, Hatundi Road, Makhupu...",Ajmer,Rajasthan,India,094144 32648,26.39422,74.666095,
6,Shri Vinayak Paints,"Shri Vinayak Paints, Naya Bada, Police Line Rd...",Ajmer,Rajasthan,India,077378 51720,26.47633,74.648344,
7,Kanhiyalal Khyaliram Jain,"Kanhiyalal Khyaliram Jain, Shri Mahaveer Marke...",Ajmer,Rajasthan,India,094143 14644,26.457969,74.635356,
8,Amit Marketing,"Amit Marketing, 172/22, Near Khwaji Gas Agency...",Ajmer,Rajasthan,India,098293 31701,26.453472,74.632683,
9,Moin Paints,"Moin Paints, Khwaja Market Parbarpura Byepass,...",Ajmer,Rajasthan,India,081070 27787,26.408256,74.660251,


In [258]:
# City Name: Jaipur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/rajasthan/Jaipur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
rj2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
rj2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Toshika Paint House,"Toshika Paint House, ÓñòÓñ¥Óñ▓ÓñÁÓñ¥Óñí, Óñ░ÓÑ...",Jaipur,Rajasthan,India,,26.95558,75.687723,http://www.asianpaints.com
1,Gupta Paint Agencies,"Gupta Paint Agencies, 50, Tripolia Bazar, Gang...",Jaipur,Rajasthan,India,098293 64760,26.924426,75.820391,http://www.dulux.in
2,Ridhi Sidhi Paints,"Ridhi Sidhi Paints, Boring Rd, Jagannath Puri,...",Jaipur,Rajasthan,India,0141 329 2068,26.943743,75.746938,
3,Taj Paints & Hardware,"Taj Paints & Hardware, Shop No:25, Near Kiran ...",Jaipur,Rajasthan,India,098280 99202,26.912297,75.775001,
4,Shri shri salasar paints,"Shri shri salasar paints, Kamla nehru nagar aj...",Jaipur,Rajasthan,India,,26.893811,75.747208,https://shree-shree-salasar-paints.business.site
5,Salasar Paints,"Salasar Paints, Shop No.14,Kamla Nehru Nagar, ...",Jaipur,Rajasthan,India,,26.880404,75.716421,
6,Raj Paint,"Raj Paint, Ajmer Rd, Subhash Colony, Heerapura...",Jaipur,Rajasthan,India,,26.879063,75.717092,
7,Shiv Shakti Paints & Sanitary,"Shiv Shakti Paints & Sanitary, D 429 B, -D Pow...",Jaipur,Rajasthan,India,097821 62402,26.88939,75.722076,
8,Swami Paint House,"Swami Paint House, 117, Ajmer Rd, Madrampur, G...",Jaipur,Rajasthan,India,097999 62928,26.898387,75.764768,
9,Indian Paints Colour Factory,"Indian Paints Colour Factory, 20, Ajmer Rd, Ja...",Jaipur,Rajasthan,India,,26.90624,75.772718,


In [259]:
#City Name: Kota

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/rajasthan/Kota/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
rj3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
rj3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M.K. Enterprises,"M.K. Enterprises, Gobriya Bawadi Circel, Jhala...",Kota,Rajasthan,India,098285 04283,25.131609,75.853214,
1,Ganpati Sales Corp.,"Ganpati Sales Corp., 8-H-5, Mahaveer Nagar III...",Kota,Rajasthan,India,094600 77490,25.137177,75.837498,
2,Roshan Deep,"Roshan Deep, Balaji Market, Corner, Main Rd, R...",Kota,Rajasthan,India,093140 60377,25.13087,75.835167,https://roshan-deep.business.site
3,Raj Hardware & Paints Trdg.,"Raj Hardware & Paints Trdg., Shop No-266, Moto...",Kota,Rajasthan,India,080582 96258,25.129845,75.847462,
4,Nisha Tea - Assam Darjeeling Tea Wholeseller,"Nisha Tea - Assam Darjeeling Tea Wholeseller, ...",Kota,Rajasthan,India,,25.146167,75.828753,
5,NAMA STORE,"NAMA STORE, Swami Vivekananda Nagar, Kota, Raj...",Kota,Rajasthan,India,,25.125347,75.819369,
6,Dhriti Traders,"Dhriti Traders, Shop no-2, Rawatbhata Main Roa...",Kota,Rajasthan,India,,25.154993,75.820872,
7,Gera Enterprises,"Gera Enterprises, Shop No-06, Patel Market, Ne...",Kota,Rajasthan,India,096805 12758,25.145425,75.830246,
8,Riddhi sidhi Enterprises,"Shop no, Riddhi sidhi Enterprises, 28, Aawli R...",Kota,Rajasthan,India,,25.115998,75.798357,
9,Ganapati Enterprises,"Ganapati Enterprises, 3-k-20, Dadabari Extensi...",Kota,Rajasthan,India,,25.157705,75.825435,


In [260]:
# City Name: Udaipur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/rajasthan/Udaipur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
rj4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
rj4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Parshvanath Paints,"Parshvanath Paints, 1-Ka-19, Shop No 2, opposi...",Udaipur,Rajasthan,India,,24.568832,73.726521,
1,Kings Colour,"Kings Colour, Shop 3, near Dai Mata Temple, Pa...",Udaipur,Rajasthan,India,094610 76266,24.566446,73.714556,https://kingscolour.in
2,Kansai Nerolac Paints Ltd,"Kansai Nerolac Paints Ltd, Plot No. E1-229, RI...",Udaipur,Rajasthan,India,,24.633599,73.716527,http://www.nerolac.com
3,New Mahesh Paint House,"New Mahesh Paint House, 559, Tekri Madri Link ...",Udaipur,Rajasthan,India,097994 91619,24.567102,73.71434,
4,A. P. Enterprises,"A. P. Enterprises, 100, near Jain Transport, S...",Udaipur,Rajasthan,India,,24.603133,73.709333,
5,Shree Parshwanath Enterprises,"Shree Parshwanath Enterprises, By Pass Chourah...",Udaipur,Rajasthan,India,094141 64735,24.592426,73.747541,
6,Tobwala Buildcon,"Tobwala Buildcon, 100 Feet Road, Nava Ghar, Bh...",Udaipur,Rajasthan,India,094141 62420,24.621068,73.706849,
7,Talesra Paint House,"Talesra Paint House, City Station Rd, Toran Ba...",Udaipur,Rajasthan,India,094147 88789,24.578016,73.697677,
8,Modern Paint House,"Modern Paint House, Kanda Complex, Mewar Motor...",Udaipur,Rajasthan,India,098281 44110,24.578878,73.699104,
9,Badala Paints And Hardware,"Badala Paints And Hardware, Ganapati Nagar, Ud...",Udaipur,Rajasthan,India,,24.593093,73.72145,


In [261]:
# City Name: Bharatpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/rajasthan/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
rj5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
rj5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sanchuri interprises,"Sanchuri interprises, Circular Rd, Indra Nagar...",Bharatpur,Rajasthan,India,,27.211064,77.48159,
1,Singhal Asian Paints,"Singhal Asian Paints, Circular Rd, U.I.T Colon...",Bharatpur,Rajasthan,India,,27.22424,77.48523,
2,Jain Paints,"Jain Paints, Tara Mahendra Colony, Bharatpur, ...",Bharatpur,Rajasthan,India,,27.216227,77.487827,
3,Om paint's store and pipes fitting,"Om paint's store and pipes fitting, Sony show ...",Bharatpur,Rajasthan,India,,27.215137,77.495521,
4,Krishna Paints Enterprises,"Krishna Paints Enterprises, kali bagichi chaur...",Bharatpur,Rajasthan,India,085618 65801,27.217151,77.505117,
5,Agrawal And Brothers,"Agrawal And Brothers, Shri Ganga Maharani Ji M...",Bharatpur,Rajasthan,India,,27.216428,77.495782,
6,Agrawal Paints,"Agrawal Paints, Bada Mohalla, Pai Bagh, Bharat...",Bharatpur,Rajasthan,India,,27.217907,77.486986,
7,ÓñÂÓñ┐Óñ▓Óñ¥ ÓñåÓñ░Óñ»Óñ¿ Óñ©ÓÑìÓñƒÓÑïÓñ░,"ÓñÂÓñ┐Óñ▓Óñ¥ ÓñåÓñ░Óñ»Óñ¿ Óñ©ÓÑìÓñƒÓÑïÓñ░, RJ ...",Abhaneri,Rajasthan,India,,26.885072,76.639678,
8,S R Electrical Colour paint and sanitary,"S R Electrical Colour paint and sanitary, Mane...",Abhaneri,Rajasthan,India,081046 16758,26.971068,76.682032,
9,Moin Paints,"Moin Paints, Khwaja Market Parbarpura Byepass,...",Ajmer,Rajasthan,India,081070 27787,26.408256,74.660251,


In [262]:
# To address the irrelevance of the store name appearing in this dataset, we conducted a web search using the store's address.
# Utilizing the '.str.replace' function, we successfully replaced the outdated name with the accurate one obtained from the web search results.

rj5['name'] = rj5['name'].str.replace("ÓñÂÓñ┐Óñ▓Óñ¥ ÓñåÓñ░Óñ»Óñ¿ Óñ©ÓÑìÓñƒÓÑïÓñ░","Ankit Electricals Geejgarh")

In [263]:
# In this dataset, we have some data that is considered garbage in 'address' column and needs to be handled properly.
# To accomplish this, we can use the '.apply' function in conjunction with a 'lambda' function and apply the 'split' function to
# eliminate the irrelevant portion, taking the last element as index '[-1]'

rj5['address'] = rj5['address'].apply(lambda x: x.split("░,")[-1])

In [264]:
# Just to check the data.
rj5.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sanchuri interprises,"Sanchuri interprises, Circular Rd, Indra Nagar...",Bharatpur,Rajasthan,India,,27.211064,77.48159,
1,Singhal Asian Paints,"Singhal Asian Paints, Circular Rd, U.I.T Colon...",Bharatpur,Rajasthan,India,,27.22424,77.48523,
2,Jain Paints,"Jain Paints, Tara Mahendra Colony, Bharatpur, ...",Bharatpur,Rajasthan,India,,27.216227,77.487827,
3,Om paint's store and pipes fitting,"Om paint's store and pipes fitting, Sony show ...",Bharatpur,Rajasthan,India,,27.215137,77.495521,
4,Krishna Paints Enterprises,"Krishna Paints Enterprises, kali bagichi chaur...",Bharatpur,Rajasthan,India,085618 65801,27.217151,77.505117,


In [265]:
# Remove unwanted rows that we don't want to keep it in our data set.
rj5.drop(index=rj5.index[7:10],axis=0,inplace=True)

In [266]:
# Check the data set.
rj5.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sanchuri interprises,"Sanchuri interprises, Circular Rd, Indra Nagar...",Bharatpur,Rajasthan,India,,27.211064,77.48159,
1,Singhal Asian Paints,"Singhal Asian Paints, Circular Rd, U.I.T Colon...",Bharatpur,Rajasthan,India,,27.22424,77.48523,
2,Jain Paints,"Jain Paints, Tara Mahendra Colony, Bharatpur, ...",Bharatpur,Rajasthan,India,,27.216227,77.487827,
3,Om paint's store and pipes fitting,"Om paint's store and pipes fitting, Sony show ...",Bharatpur,Rajasthan,India,,27.215137,77.495521,
4,Krishna Paints Enterprises,"Krishna Paints Enterprises, kali bagichi chaur...",Bharatpur,Rajasthan,India,085618 65801,27.217151,77.505117,


In [267]:
# By utilizing 'pd.concat', we can combine all the dataframes and save the result into a variable.
# To do this, create a list that contains the dataframes and use 'pd.concat' with 'ignore_index=True' to reset the index.
rjt_data = [rj1,rj2,rj3,rj4,rj5]
rjt_data = pd.concat(rjt_data,ignore_index=True)
rjt_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Shri Vinayak Paints,"Shri Vinayak Paints, Near swastic tent house, ...",Ajmer,Rajasthan,India,077378 51720,26.477176,74.647892,
1,Vanshika paints,"Vanshika paints, Nasirabad Rd, Nagra, Ajmer, R...",Ajmer,Rajasthan,India,,26.425234,74.650409,
2,Chadausia Store,"Chadausia Store, Naya Bazar, Naya Bazaar, Ajme...",Ajmer,Rajasthan,India,094140 07068,26.461123,74.631566,
3,Rawat Traders,"Rawat Traders, Mayo Link Rd, Shiva Colony, Ajm...",Ajmer,Rajasthan,India,094145 93209,26.439004,74.652004,
4,Ak Paint And Chemicals,"Ak Paint And Chemicals, Shree Gurukrupa Apartm...",Ajmer,Rajasthan,India,081070 93110,26.464777,74.630353,


#**Tamil Nadu**

We have amassed data from multiple cities in Tamil Nadu, and our next action is to merge it collectively.

In [268]:
# City Name: Chennai

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/tamil-nadu/Chennai/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tn1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tn1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,kumar hardware stores,"kumar hardware stores, 20/2, Manickam St, Muni...",Chennai,Tamil Nadu,India,,13.091399,80.261681,
1,Pauls Chemical Corporation,"Pauls Chemical Corporation, 173, General Patte...",Chennai,Tamil Nadu,India,098410 22338,13.064977,80.267596,http://www.dulux.in
2,M. Kanthaswamy Chetty & Co. Agency,"M. Kanthaswamy Chetty & Co. Agency, No. 39, Pa...",Chennai,Tamil Nadu,India,044 2535 0068,13.086872,80.279742,
3,Lakshmi Paints,"Lakshmi Paints, No. 62/1, Thayar Sahib Street,...",Chennai,Tamil Nadu,India,091717 65378,13.064275,80.26792,
4,M.Kandaswamy Chetty & Company,"M.Kandaswamy Chetty & Company, 18, Venkatachal...",Chennai,Tamil Nadu,India,098849 64387,13.086054,80.279634,
5,Asian Paint Dealers,"Asian Paint Dealers, 170, Rasappa Chetty St, P...",Chennai,Tamil Nadu,India,,13.086091,80.279237,https://www.asianpaints.com
6,Suresh paints,"Suresh paints, 136, Swamy Naicken St, Chintadr...",Chennai,Tamil Nadu,India,098402 62115,13.078007,80.270907,http://business.google.com
7,Hindustan Hardware & Paint Mart,"Hindustan Hardware & Paint Mart, New No.180, O...",Chennai,Tamil Nadu,India,044 2535 5080,13.085898,80.27989,
8,Asian Paints Colour World Crescent Hardware Ce...,Asian Paints Colour World Crescent Hardware Ce...,Chennai,Tamil Nadu,India,089254 21247,13.049871,80.274406,
9,jai paints,"jai paints, 22, Mangappan St, Sowcarpet, Eleph...",Chennai,Tamil Nadu,India,098410 54386,13.091232,80.276158,


In [269]:
# City Name: Coimbatore

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/tamil-nadu/Coimbatore/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tn2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tn2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sri Sakthi Hardwares,"Sri Sakthi Hardwares, 921, Avinashi Rd, Pudur,...",Coimbatore,Tamil Nadu,India,0422 222 1778,11.0149,76.988613,
1,ROALUX PAINTS,"ROALUX PAINTS, 150, Vysial St, Kempatty Colony...",Coimbatore,Tamil Nadu,India,0422 238 1197,10.991933,76.954235,http://business.google.com
2,vaanamaamalai paints,"vaanamaamalai paints, no.205, NSR Rd, Saibaba ...",Coimbatore,Tamil Nadu,India,095974 00266,11.025474,76.939409,https://vaanamaamalai-paints.business.site
3,Sri Vasavi Paints,"Sri Vasavi Paints, Opposite Lakshmi Mills Bus ...",Coimbatore,Tamil Nadu,India,0422 221 0894,11.01676,76.977376,
4,Paint A Point and Hardware,"Paint A Point and Hardware, 55, Ramachandra Rd...",Coimbatore,Tamil Nadu,India,0422 436 6456,11.006558,76.954491,
5,Amutha| Plumber in coimbatore| best plumbing w...,Amutha| Plumber in coimbatore| best plumbing w...,Coimbatore,Tamil Nadu,India,095663 89890,11.042897,76.977806,
6,NIPPON PAINT DEALERS In ganapathy,"NIPPON PAINT DEALERS In ganapathy, Vijayalaksh...",Coimbatore,Tamil Nadu,India,095007 06319,11.035197,76.971912,
7,New India Paint,"New India Paint, NH209, ATT Colony, Gopalapura...",Coimbatore,Tamil Nadu,India,,11.003597,76.966682,
8,Radha And Chalam,"31/29, Radha And Chalam, Vysial Street,Oppanak...",Coimbatore,Tamil Nadu,India,0422 239 0448,10.992203,76.958784,
9,Shanthi paint corporation,"Shanthi paint corporation, No:213, Bharathiyar...",Coimbatore,Tamil Nadu,India,090039 57011,11.050984,76.981926,


In [270]:
# City Name: Madurai

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/tamil-nadu/Madurai/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tn3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tn3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sri Meenakshi Agencies,"Sri Meenakshi Agencies, DD Main Rd, Pethaniapu...",Madurai,Tamil Nadu,India,087544 03888,9.93632,78.101742,
1,Jai Ganesh Traders,"Jai Ganesh Traders, Priyasamy Nagar, Tamil Nad...",Madurai,Tamil Nadu,India,098434 60472,9.883254,78.111702,
2,Super deluxe hardware mart,"Super deluxe hardware mart, Alanganallur Main ...",Madurai,Tamil Nadu,India,,9.977074,78.099224,
3,Sri Sankareswari Traders,"Sri Sankareswari Traders, 101, MMC Colony Bus ...",Madurai,Tamil Nadu,India,095439 22755,9.892897,78.117168,
4,Sri Madurai Meenakshi Agencies,"Sri Madurai Meenakshi Agencies, No.30, Swamy S...",Madurai,Tamil Nadu,India,098433 91791,9.920815,78.123751,
5,Meenakshi Sundereshwam Paints,"Meenakshi Sundereshwam Paints, MDR227, Sikkand...",Madurai,Tamil Nadu,India,,9.971337,78.097205,
6,Nachiyaar Paints,"Nachiyaar Paints, 4/271, Sourashtrapuram, Mand...",Madurai,Tamil Nadu,India,,9.913836,78.155598,
7,Harini Paint's & Hardware's,"Harini Paint's & Hardware's, Ashok Nagar, Tami...",Madurai,Tamil Nadu,India,,9.939151,78.084751,
8,Sri Jayam Motors,"Sri Jayam Motors, Thathaneri, Tamil Nadu 625018",Madurai,Tamil Nadu,India,,9.939078,78.106309,
9,BJN Paints India Limited,"No. 146, BJN Paints India Limited, NSK Rd, Vel...",Madurai,Tamil Nadu,India,0452 420 0829,9.912657,78.094604,


In [271]:
# City Name: Tiruchirappalli

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/tamil-nadu/Tiruchirappalli/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tn4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tn4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,"Saravana Hardware Stores (ASIAN PAINTS, fosroc...","Saravana Hardware Stores (ASIAN PAINTS, fosroc...",Tiruchirappalli,Tamil Nadu,India,0431 270 9851,10.814527,78.696436,https://saravana-hardware-stores.business.site
1,Malugu business,"Malugu business, 3/330, Punganur, Tiruchirappa...",Tiruchirappalli,Tamil Nadu,India,,10.785757,78.635618,
2,Annai Tinkering workshop,"Annai Tinkering workshop, Trichy Main Rd, Kill...",Tiruchirappalli,Tamil Nadu,India,,10.89233,78.608175,
3,Addisons Paints & Chemicals (A Division of Sim...,Addisons Paints & Chemicals (A Division of Sim...,Tiruchirappalli,Tamil Nadu,India,0431 266 0091,10.733016,78.716921,
4,Pinturaa Colour Zone,"Pinturaa Colour Zone, No. 55, St. Mary Complex...",Tiruchirappalli,Tamil Nadu,India,098946 05674,10.801691,78.68977,
5,Ó«ÁÓ«┐Ó«¿Ó«¥Ó«»Ó«òÓ«¥ Ó«¬Ó»åÓ«»Ó«┐Ó«®Ó»ìÓ«ƒÓ»ì...,Ó«ÁÓ«┐Ó«¿Ó«¥Ó«»Ó«òÓ«¥ Ó«¬Ó»åÓ«»Ó«┐Ó«®Ó»ìÓ«ƒÓ»ì...,Tiruchirappalli,Tamil Nadu,India,0431 276 0267,10.827091,78.683674,
6,V.Chandra Prakasa Nadar Sons,"V.Chandra Prakasa Nadar Sons, No. 6, Chinna Ka...",Tiruchirappalli,Tamil Nadu,India,0431 270 4404,10.826822,78.697636,
7,SDV Paints & Automobiles,"SDV Paints & Automobiles, 28/28, Edumalai Rd, ...",Tiruchirappalli,Tamil Nadu,India,,10.910426,78.699988,
8,Om Sri Hardwares,"Om Sri Hardwares, 76/1, W.B Road, opp. Ibrahim...",Tiruchirappalli,Tamil Nadu,India,099527 66122,10.81975,78.693786,https://om-sri-hardwares.business.site
9,Sri Preethi Paints,"Shop No, Sri Preethi Paints, 73-A, Salai Rd, n...",Tiruchirappalli,Tamil Nadu,India,0431 276 0768,10.82813,78.683856,


In [272]:
# We have to replace the name of store which is showing in this dataset as irrelevant. We have found the name of this store in web by using their address
# and now replace the name with the help of '.str.replace' function.

tn4['name'] = tn4['name'].str.replace("Ó«ÁÓ«┐Ó«¿Ó«¥Ó«»Ó«òÓ«¥ Ó«¬Ó»åÓ«»Ó«┐Ó«®Ó»ìÓ«ƒÓ»ìÓ«©Ó»ì","Vinayaka Paints")

In [273]:
# We have some irrelevant data present in 'address' column on this dataset that needs to be taken care of.
# To handle it, we can make use of the '.apply' function along with a 'lambda' function,
# and then utilize the 'split' function to remove the undesired content, taking the last element as index '[-1]'.
tn4['address'] = tn4['address'].apply(lambda x: x.split("ì,")[-1])

In [274]:
tn4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,"Saravana Hardware Stores (ASIAN PAINTS, fosroc...","Saravana Hardware Stores (ASIAN PAINTS, fosroc...",Tiruchirappalli,Tamil Nadu,India,0431 270 9851,10.814527,78.696436,https://saravana-hardware-stores.business.site
1,Malugu business,"Malugu business, 3/330, Punganur, Tiruchirappa...",Tiruchirappalli,Tamil Nadu,India,,10.785757,78.635618,
2,Annai Tinkering workshop,"Annai Tinkering workshop, Trichy Main Rd, Kill...",Tiruchirappalli,Tamil Nadu,India,,10.89233,78.608175,
3,Addisons Paints & Chemicals (A Division of Sim...,Addisons Paints & Chemicals (A Division of Sim...,Tiruchirappalli,Tamil Nadu,India,0431 266 0091,10.733016,78.716921,
4,Pinturaa Colour Zone,"Pinturaa Colour Zone, No. 55, St. Mary Complex...",Tiruchirappalli,Tamil Nadu,India,098946 05674,10.801691,78.68977,
5,Vinayaka Paints,"Bus Stop, Salai Road , 15/8, Big Sowrastra St...",Tiruchirappalli,Tamil Nadu,India,0431 276 0267,10.827091,78.683674,
6,V.Chandra Prakasa Nadar Sons,"V.Chandra Prakasa Nadar Sons, No. 6, Chinna Ka...",Tiruchirappalli,Tamil Nadu,India,0431 270 4404,10.826822,78.697636,
7,SDV Paints & Automobiles,"SDV Paints & Automobiles, 28/28, Edumalai Rd, ...",Tiruchirappalli,Tamil Nadu,India,,10.910426,78.699988,
8,Om Sri Hardwares,"Om Sri Hardwares, 76/1, W.B Road, opp. Ibrahim...",Tiruchirappalli,Tamil Nadu,India,099527 66122,10.81975,78.693786,https://om-sri-hardwares.business.site
9,Sri Preethi Paints,"Shop No, Sri Preethi Paints, 73-A, Salai Rd, n...",Tiruchirappalli,Tamil Nadu,India,0431 276 0768,10.82813,78.683856,


In [275]:
# City Name: Tirunelveli

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/tamil-nadu/Tirunelveli/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tn5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tn5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Kalasri Agencies,"Kalasri Agencies, kurunji nagar venkatachalapu...",Tirunelveli,Tamil Nadu,India,,8.737934,77.766037,
1,Sri Andal Traders,"Sri Andal Traders, 30, Swami Nellaiappar High ...",Tirunelveli,Tamil Nadu,India,094433 57525,8.728887,77.704096,
2,HAJA HARDWARES,"HAJA HARDWARES, 56, High Road, Tirunelveli Jun...",Tirunelveli,Tamil Nadu,India,0462 233 4251,8.728654,77.71069,
3,SELVA KUMARAN STORES,"SELVA KUMARAN STORES, 988 a mangammal salai, A...",Tirunelveli,Tamil Nadu,India,098652 81188,8.722248,77.77151,
4,T.P. KANNU&CO,"T.P. KANNU&CO, 26 D/2, S.N. High Road, Tirunel...",Tirunelveli,Tamil Nadu,India,0462 232 2117,8.728811,77.703617,
5,R.R Paints & Hardwares,"R.R Paints & Hardwares, No.1 Therkku Mohideen ...",Tirunelveli,Tamil Nadu,India,,8.696082,77.710815,
6,Nova traders,"Nova traders, SH 40, Rahmath Nagar, Palayamkot...",Tirunelveli,Tamil Nadu,India,,8.720776,77.75631,
7,KARTHIKEYAN STORE,"KARTHIKEYAN STORE, 118, Sivanthipatti Rd, Anbu...",Tirunelveli,Tamil Nadu,India,098659 51188,8.704655,77.750539,
8,Lavanya Store,"Lavanya Store, St. John's College Road, Perinb...",Tirunelveli,Tamil Nadu,India,098949 99430,8.723032,77.741967,https://lavanya-store-paint-store.business.site
9,KARTHIKEYAN ENTERPRISES,"KARTHIKEYAN ENTERPRISES, 93A/1 Bye Pass Raod, ...",Tirunelveli,Tamil Nadu,India,094434 56071,8.731448,77.722091,


In [276]:
# City Name: Salem

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/tamil-nadu/Salem/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tn6=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tn6.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Suganya Hardwares,"Suganya Hardwares, 344.B, Yercaud Main Rd, KK ...",Salem,Tamil Nadu,India,098651 83143,11.697861,78.165609,
1,Salem Paints,"Salem Paints, 1, Senthil Plaza, Junction Main ...",Salem,Tamil Nadu,India,097510 70719,11.674283,78.124336,
2,LAKSHMI PAINTS,"LAKSHMI PAINTS, jayarathna complex, Omalur mai...",Salem,Tamil Nadu,India,094432 33883,11.682155,78.12676,
3,Bharath Paints,"Bharath Paints, Junction Main Rd, Subramania N...",Salem,Tamil Nadu,India,099420 10117,11.671366,78.116923,
4,VKM Trading - Berger Paints Store,"VKM Trading - Berger Paints Store, Berger Pain...",Salem,Tamil Nadu,India,,11.631073,78.129075,
5,Varnam Paints,"Varnam Paints, 40/10, K.M. Nagar, Ayothiyapatt...",Salem,Tamil Nadu,India,098943 35285,11.672474,78.232147,
6,Asian Paints Limited,"Asian Paints Limited, 2, Suramangalam Main Rd,...",Salem,Tamil Nadu,India,,11.66963,78.121773,https://www.asianpaints.com
7,J.K.Paints,"J.K.Paints, 3/54, Kamala Complex, Meyyanur Roa...",Salem,Tamil Nadu,India,098420 21808,11.674226,78.133579,
8,Asian Paints EzyColour AYYAPPA HARDWARES,"Asian Paints EzyColour AYYAPPA HARDWARES, NEXT...",Salem,Tamil Nadu,India,094872 10194,11.67997,78.114042,https://www.asianpaints.com
9,Pandiyan Paints,"Pandiyan Paints, Omalur Main Rd, SIDCO Industr...",Salem,Tamil Nadu,India,,11.676104,78.133788,


In [277]:
# The 'pd.concat' function facilitates the concatenation of dataframes and saving the output as a variable.
# Create a list of the dataframes, and when applying 'pd.concat', include 'ignore_index=True' to reset the index.
tnd_data = [tn1,tn2,tn3,tn4,tn5,tn6]
tnd_data = pd.concat(tnd_data,ignore_index=True)
tnd_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,kumar hardware stores,"kumar hardware stores, 20/2, Manickam St, Muni...",Chennai,Tamil Nadu,India,,13.091399,80.261681,
1,Pauls Chemical Corporation,"Pauls Chemical Corporation, 173, General Patte...",Chennai,Tamil Nadu,India,098410 22338,13.064977,80.267596,http://www.dulux.in
2,M. Kanthaswamy Chetty & Co. Agency,"M. Kanthaswamy Chetty & Co. Agency, No. 39, Pa...",Chennai,Tamil Nadu,India,044 2535 0068,13.086872,80.279742,
3,Lakshmi Paints,"Lakshmi Paints, No. 62/1, Thayar Sahib Street,...",Chennai,Tamil Nadu,India,091717 65378,13.064275,80.26792,
4,M.Kandaswamy Chetty & Company,"M.Kandaswamy Chetty & Company, 18, Venkatachal...",Chennai,Tamil Nadu,India,098849 64387,13.086054,80.279634,


#**Telangana**

After obtaining data from various cities in Telangana, we will consolidate it into a unified dataset.

In [278]:
# City Name: Hyderabad

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/telangana/Hyderabad/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tl1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tl1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sankalp Enterprises,"Sankalp Enterprises, Bank St, Troop Bazaar, Ab...",Hyderabad,Telangana,India,090145 72777,17.388102,78.476969,http://www.sankalpenterprises.in
1,Home Needs asian paints,"Home Needs asian paints, 13-3-821, Jiyaguda Ro...",Hyderabad,Telangana,India,,17.369686,78.449585,https://www.asianpaints.com
2,Sri Sai Srinivasa Paints & Hardware,"Sri Sai Srinivasa Paints & Hardware, Srt 234, ...",Hyderabad,Telangana,India,040 5562 7167,17.402311,78.495084,
3,Royal Hardware Sanitary Paints,"Royal Hardware Sanitary Paints, # 4-2-1060, Ra...",Hyderabad,Telangana,India,088019 47417,17.392318,78.489075,
4,Bhaskar Enamel Industries,"Bhaskar Enamel Industries, 5-2-926, opposite G...",Hyderabad,Telangana,India,040 2461 6767,17.379017,78.475505,
5,MBK Paints & Hardware,"Plot No. 29, MBK Paints & Hardware, P&T Colony...",Hyderabad,Telangana,India,,17.361233,78.399558,
6,Vaishnavi Enterprises.,"Vaishnavi Enterprises., 15-5-199, Osmanshai, G...",Hyderabad,Telangana,India,,17.376599,78.479826,
7,Hi - Tech Colours,"Hi - Tech Colours, Shop No 19-1-876/4, Chowras...",Hyderabad,Telangana,India,040 2446 1951,17.408037,78.488388,
8,Gopi Electrical & Hardware Paints & Sanitary,"Gopi Electrical & Hardware Paints & Sanitary, ...",Hyderabad,Telangana,India,099089 99090,17.355933,78.422033,
9,Shakti Paints & Chemicals,"Shakti Paints & Chemicals, 24/1/B, Gandhinagar...",Hyderabad,Telangana,India,040 2307 3380,17.415843,78.48915,


In [279]:
# City Name: Warangal

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/telangana/Warangal/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tl2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tl2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Ch. Sridhar Arts (Rajart),"Ch. Sridhar Arts (Rajart), 11-25-220/1, Auto N...",Warangal,Telangana,India,,17.991887,79.598523,
1,Hanuman Enterprises,"Hanuman Enterprises, Nayeem Nagar, Lashkar Sin...",Warangal,Telangana,India,,18.015078,79.542472,
2,Thirumala Paints,"Thirumala Paints, Near Desaipet Road,11-24/343...",Warangal,Telangana,India,093940 23810,17.988875,79.602095,
3,Balaji Paints,"Balaji Paints, House No-24-7-215 Main Road Bes...",Warangal,Telangana,India,099498 92633,17.989077,79.534943,
4,Reddy Arts and Paints,"Reddy Arts and Paints, 4-4-192, NH163, Kothur ...",Warangal,Telangana,India,,18.00851,79.568507,
5,Bhadrakali Paints,"H.No. 3-14-665, Bhadrakali Paints, Kakatiya Un...",Warangal,Telangana,India,,18.019979,79.562863,
6,Ravindra Paints Industries,"Ravindra Paints Industries, 4-6-126, Reddy Col...",Warangal,Telangana,India,0870 245 1757,18.014234,79.571121,
7,Sri Veerabhadra Paints,"Sri Veerabhadra Paints, 2, Gaurav St, Lashkar ...",Warangal,Telangana,India,098493 39488,18.014433,79.554367,
8,Kodakandla Romaniah Paints & Hardware,"H.No.1-4-96, Kodakandla Romaniah Paints & Hard...",Warangal,Telangana,India,098497 27278,17.976057,79.509114,
9,Balaji paints,"Balaji paints, near vidyodaya high school, Huz...",Warangal,Telangana,India,,18.282122,79.4727,


In [280]:
# City Name: Alampur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/telangana/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
tl3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
tl3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M/s. G.P. Hardware,"Shop No. 46/88A, M/s. G.P. Hardware, Near, Gow...",Ālampur,Telangana,India,093979 53110,15.824056,78.0401,
1,M.N. Paints Hardware Sanitary,"M.N. Paints Hardware Sanitary, Ram Rahim Nagar...",Ālampur,Telangana,India,097013 02929,15.807407,78.033145,
2,A1-Safa hardware paints & sanitary,"A1-Safa hardware paints & sanitary, Ambedkar N...",Ālampur,Telangana,India,,15.841488,78.013084,
3,Varalakshmi Paints & Hardware,"Varalakshmi Paints & Hardware, D.No. 87/1362/C...",Ālampur,Telangana,India,094409 24768,15.797067,78.056025,
4,M/s. Jaleel Paints & Hardware,"M/s. Jaleel Paints & Hardware, S. No. 25, A.P....",Ālampur,Telangana,India,098486 26094,15.825031,78.021858,https://ms-jaleel-paints-hardware.business.site
5,Apex Enterprises,"Apex Enterprises, 26/191/1, Chittari St, Mohal...",Ālampur,Telangana,India,096523 82456,15.823122,78.054715,https://apexenterprises-paintstore.business.site
6,"Y. Lakshmi Devamma Cement, Paints & Hardware","s.no, Y. Lakshmi Devamma Cement, Paints & Hard...",Ālampur,Telangana,India,094400 91918,15.80702,78.034608,
7,S.A. Gaffar & Sons,"S.A. Gaffar & Sons, Maddur Nagar, Kurnool, And...",Ālampur,Telangana,India,091775 75074,15.811529,78.039762,
8,All Tawakkal Sanitary Hardware & Paints,"All Tawakkal Sanitary Hardware & Paints, Roja ...",Ālampur,Telangana,India,090908 57806,15.824607,78.018612,
9,Fakhruddin Kurban Husen Hardware & Paints,"Fakhruddin Kurban Husen Hardware & Paints, 18-...",Ālampur,Telangana,India,095737 37603,15.830931,78.049632,


In [281]:
# Concatenating all the dataframes and storing the combined result in a variable can be achieved using 'pd.concat'.
# Create a list containing the dataframes, and when using 'pd.concat', ensure 'ignore_index=True' is included to reset the index.
tlg_data = [tl1,tl2,tl3]
tlg_data = pd.concat(tlg_data,ignore_index=True)
tlg_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sankalp Enterprises,"Sankalp Enterprises, Bank St, Troop Bazaar, Ab...",Hyderabad,Telangana,India,090145 72777,17.388102,78.476969,http://www.sankalpenterprises.in
1,Home Needs asian paints,"Home Needs asian paints, 13-3-821, Jiyaguda Ro...",Hyderabad,Telangana,India,,17.369686,78.449585,https://www.asianpaints.com
2,Sri Sai Srinivasa Paints & Hardware,"Sri Sai Srinivasa Paints & Hardware, Srt 234, ...",Hyderabad,Telangana,India,040 5562 7167,17.402311,78.495084,
3,Royal Hardware Sanitary Paints,"Royal Hardware Sanitary Paints, # 4-2-1060, Ra...",Hyderabad,Telangana,India,088019 47417,17.392318,78.489075,
4,Bhaskar Enamel Industries,"Bhaskar Enamel Industries, 5-2-926, opposite G...",Hyderabad,Telangana,India,040 2461 6767,17.379017,78.475505,


#**Uttar Pradesh**

The data collected from different cities in Uttar Pradesh, will be merged into a single entity after the collection phase.

In [282]:
#City Name: Greater Noida

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Greater%20Noida/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Prince Paint Sanitary & Hardware Store,"Prince Paint Sanitary & Hardware Store, Lakhna...",Greater Noida,Uttar Pradesh,India,099719 01818,28.516884,77.487383,
1,Prince Paint Sanitary & Hardware Store,"Prince Paint Sanitary & Hardware Store, Side-C...",Greater Noida,Uttar Pradesh,India,099719 01818,28.5199,77.458724,
2,Rajnish Paint & Hardware,"Rajnish Paint & Hardware, Shop No. 1, Basement...",Greater Noida,Uttar Pradesh,India,099996 48511,28.5199,77.458724,
3,Nandani Paints & Hardware Store,"Nandani Paints & Hardware Store, Near Lakhnawa...",Greater Noida,Uttar Pradesh,India,099105 66504,28.513707,77.487934,
4,Rajnish Paints And Hardware,"Rajnish Paints And Hardware, Amritpuram, Block...",Greater Noida,Uttar Pradesh,India,097184 03338,28.479896,77.500687,
5,Naval Paints And Hardware,"Naval Paints And Hardware, Tugalpur, Greater N...",Greater Noida,Uttar Pradesh,India,,28.466546,77.504611,
6,KANAK PAINTS,"KANAK PAINTS, Sector MU 1 Rd, near Shahid Park...",Greater Noida,Uttar Pradesh,India,099119 11390,28.468855,77.56383,
7,Sumit Paint House,"Sumit Paint House, Opposite Janhit College Cha...",Greater Noida,Uttar Pradesh,India,085108 06281,28.469964,77.504436,
8,"Bajrang Paints,Hardware & Sanitary Store","Bajrang Paints,Hardware & Sanitary Store, Vill...",Greater Noida,Uttar Pradesh,India,097172 01172,28.462682,77.540762,
9,om sai sanitary paint and hardware,"om sai sanitary paint and hardware, Bhagat mar...",Greater Noida,Uttar Pradesh,India,099902 04100,28.465645,77.50808,


In [283]:
# City Name: Kanpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Kanpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Bhatia Paints & Hardware,"272, Bhatia Paints & Hardware, Z-1, Barra-1, B...",Kanpur,Uttar Pradesh,India,094155 35356,26.41353,80.300201,
1,Yadav paints & Hardware,"Yadav paints & Hardware, Lal Banglal, near IVP...",Kanpur,Uttar Pradesh,India,,26.400829,80.413178,
2,Prakash Agencies,"Prakash Agencies, 1/1, Nai Sadak, Collectorgan...",Kanpur,Uttar Pradesh,India,094150 45245,26.458242,80.349863,
3,Jyoti Enterprises,"Jyoti Enterprises, Chandra Nagar Near Charrari...",Kanpur,Uttar Pradesh,India,099566 82403,26.405183,80.390907,http://business.google.com
4,Rajendra Paints,"Rajendra Paints, 95/94, Nai Sarak Road, Munna ...",Kanpur,Uttar Pradesh,India,098387 21808,26.472463,80.344573,
5,Nishat Paint,"Nishat Paint, 63/10, Gadaria Mohal, Near Apdar...",Kanpur,Uttar Pradesh,India,093052 35884,26.462665,80.359549,
6,Vinod And Company,"48/292, Vinod And Company, Hafsa Bint Umar St,...",Kanpur,Uttar Pradesh,India,,26.465465,80.351035,
7,Self,"Self, Sanjay Gandhi Nagar, 8858306268, Kanpur,...",Kanpur,Uttar Pradesh,India,,26.413123,80.30964,
8,Rainbow Colour Company,"Rainbow Colour Company, 26.468593, 80.326924, ...",Kanpur,Uttar Pradesh,India,099367 05000,26.470064,80.328067,
9,Colour World,"Colour World, central Plaza gopinath puram uan...",Kanpur,Uttar Pradesh,India,,26.485036,80.391462,


In [284]:
# City Name: Lucknow

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Lucknow/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,RAINBOW AUTO COLOR LAB,"RAINBOW AUTO COLOR LAB, Captain Aditya Mishra ...",Lucknow,Uttar Pradesh,India,,26.883429,80.979778,
1,Abhilasha Enterprises,"Abhilasha Enterprises, 12-A, Station Road, Shi...",Lucknow,Uttar Pradesh,India,092362 60668,26.841782,80.930046,
2,Bajpai Paints Store,"Bajpai Paints Store, 55/4, 55/4 Gautam Buddh M...",Lucknow,Uttar Pradesh,India,094151 59760,26.83985,80.92513,
3,Plaza Paint Mobile Shop,"Plaza Paint Mobile Shop, Shop No-9,Cantt Chour...",Lucknow,Uttar Pradesh,India,095594 31024,26.824415,80.945758,
4,Goel Brothers,"Goel Brothers, 78, Gautam Budh Marg, Gautam Bu...",Lucknow,Uttar Pradesh,India,0522 395 1472,26.84861,80.94718,
5,Mohan Lime Store,"Mohan Lime Store, Subhash Marg, Tedhi Bazar, D...",Lucknow,Uttar Pradesh,India,098396 67221,26.852457,80.916274,
6,R K Paint,"R K Paint, Gola Bazar, Gola Bazar Sadar, Shiv ...",Lucknow,Uttar Pradesh,India,093896 44273,26.83751,80.93875,
7,Goyal Banbhu Asian Paints,"Goyal Banbhu Asian Paints, 35/3, Yadav Market,...",Lucknow,Uttar Pradesh,India,0522 325 1422,26.845069,80.927913,
8,Sarswati Enterprises,"Sarswati Enterprises, Ramashray Purwa, Khargap...",Lucknow,Uttar Pradesh,India,,26.841524,81.017155,
9,B.K Enterprises,"B.K Enterprises, opposite primary school, Awad...",Lucknow,Uttar Pradesh,India,098389 51505,26.835857,81.026346,


In [285]:
# City Name: Agra

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Agra/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up4=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up4.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Raj Paints,"Raj Paints, Maruti Estate Rd, Albatiya, Infron...",Agra,Uttar Pradesh,India,097611 71551,27.181515,77.957309,
1,R.D.A Paints World,"R.D.A Paints World, naripura, Dhanauli, Agra, ...",Agra,Uttar Pradesh,India,,27.150777,77.975105,
2,Shree Ram Colours,"Shree Ram Colours, Amba Prasad Rd, Opposite Sa...",Agra,Uttar Pradesh,India,091528 61877,27.189455,77.964874,
3,Radhika Paints,"Radhika Paints, Kharai Panna Place Ke Samne,, ...",Agra,Uttar Pradesh,India,097585 55779,27.140867,78.045781,
4,Shri Ram ji paint house,"Shri Ram ji paint house, Bundu Katra, Agra Can...",Agra,Uttar Pradesh,India,,27.144361,78.009248,
5,Jai Durge Maa Paints House,"Shop No.109 MIG, Jai Durge Maa Paints House, J...",Agra,Uttar Pradesh,India,098972 95107,27.198537,77.961955,
6,Vishal Paint Store,"Shop No.20/20, Vishal Paint Store, Ratan Muni ...",Agra,Uttar Pradesh,India,098975 15487,27.188074,77.988953,
7,New Goyal Paint Mart,"Shop No. 26,27, New Goyal Paint Mart, Maa Durg...",Agra,Uttar Pradesh,India,074090 68584,27.19011,77.990157,
8,Shanker Paint Mart,"Shanker Paint Mart, 15/28, Chouraha Pashiya Rd...",Agra,Uttar Pradesh,India,092591 86759,27.177823,77.989117,
9,P.C. Paint House,"Shop No.3, P.C. Paint House, 225, Rui Ki Mandi...",Agra,Uttar Pradesh,India,089583 10648,27.174668,77.988634,


In [286]:
# City Name: Meerut

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Meerut/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up5=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up5.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M.S. Hardware And Paint,"M.S. Hardware And Paint, Rohta Road, Meerut, U...",Meerut,Uttar Pradesh,India,089090 37677,28.98958,77.659562,
1,Singhal paints and hardware store,"Singhal paints and hardware store, Jagriti Vih...",Meerut,Uttar Pradesh,India,,28.959563,77.740589,
2,Singhal Paints,"Singhal Paints, 30/3, Jagrati Vihar, Meerut, U...",Meerut,Uttar Pradesh,India,098977 61090,28.954235,77.748685,http://www.singhalpaints.com
3,Rahul Paints,"Rahul Paints, Lalla Pur, Meerut, Uttar Pradesh...",Meerut,Uttar Pradesh,India,096396 78905,28.964741,77.669755,
4,Raj Paint and Hardware Store,"Raj Paint and Hardware Store, 492/3, New Sadak...",Meerut,Uttar Pradesh,India,089791 54344,28.963291,77.729184,
5,Mahaveer Paints,"Mahaveer Paints, Roorkee Road, Meerut, 250001",Meerut,Uttar Pradesh,India,097606 92026,29.038068,77.706424,
6,Pundir Paints,"Pundir Paints, Roorkee Road, Meerut, Uttar Pra...",Meerut,Uttar Pradesh,India,099274 51224,29.037931,77.706409,
7,Neha Paints And Hardware,"Neha Paints And Hardware, Kaserukhera, Meerut,...",Meerut,Uttar Pradesh,India,096752 10280,29.008912,77.736972,
8,M/s Laxmi Paints,"M/s Laxmi Paints, 339/4, Gol Mandir Rd, near Z...",Meerut,Uttar Pradesh,India,,28.962526,77.724697,
9,Uttam Traders,"Uttam Traders, Pillokhari Rd, Shyam Nagar, Mas...",Meerut,Uttar Pradesh,India,,28.966442,77.706246,


In [287]:
# City Name: Varanasi

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Varanasi/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up6=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up6.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Rangshala,"Rangshala, Shop No.D-59/103, Smith School Road...",Varanasi,Uttar Pradesh,India,094156 97027,25.30877,82.98689,
1,Krishna Agencies,"Krishna Agencies, Bhojubeer,Sabzi Mandi, Varan...",Varanasi,Uttar Pradesh,India,,25.354465,82.974576,
2,New Shri Ram Paints,"New Shri Ram Paints, Shivpur Bypass Rd, Shivpu...",Varanasi,Uttar Pradesh,India,083549 62082,25.356407,82.965141,
3,Pawansut Paints,"Pawansut Paints, Raja Moti Chand Rd, Manduwadi...",Varanasi,Uttar Pradesh,India,090442 56290,25.302155,82.969027,
4,Neeraj Paints & Hardware,"Neeraj Paints & Hardware, near Stte Bank Of In...",Varanasi,Uttar Pradesh,India,099359 56764,25.356104,82.951726,
5,Chaurasia Paint Store,"S-8/252,A-K, Chaurasia Paint Store, Hukulganj ...",Varanasi,Uttar Pradesh,India,098390 75545,25.348001,82.993871,
6,Chaurasia Paint Store,"Chaurasia Paint Store, S-8/256-2A-1, Premchand...",Varanasi,Uttar Pradesh,India,098380 45545,25.348128,82.994043,
7,Avas Sanitary & Paints Store,"Avas Sanitary & Paints Store, Mahmoorganj Sigr...",Varanasi,Uttar Pradesh,India,093052 88350,25.30571,82.981026,
8,SANJAY PAINTS,"SANJAY PAINTS, J 15/65 D-3, Doshipura, Chaukag...",Varanasi,Uttar Pradesh,India,,25.336292,83.007022,
9,Shiv Paints,"Shiv Paints, Nadesar Police Chowki Ke Bagal Ma...",Varanasi,Uttar Pradesh,India,072329 80517,25.331722,82.988774,


In [288]:
# City Name: Bareilly

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Bareilly/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up7=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up7.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Paint shop,"Paint shop, 694 beharipur civel lines, near ci...",Bareilly,Uttar Pradesh,India,,28.356236,79.404411,
1,Shivay Traders,"Shivay Traders, Bareilly - Badaun Rd, Patel Bi...",Bareilly,Uttar Pradesh,India,089587 30438,28.332262,79.397791,
2,Shri Hariprabha Enterprises,"Shri Hariprabha Enterprises, Ravinder Nagar Co...",Bareilly,Uttar Pradesh,India,,28.34186,79.402386,
3,Shivay Traders,"Shivay Traders, Shiv Mandir, Dohra, Lalpur Roa...",Bareilly,Uttar Pradesh,India,089587 30438,28.37632,79.463369,
4,Sri Ram Enterprises,"Sri Ram Enterprises, Sri Balaji Temple, Budaun...",Bareilly,Uttar Pradesh,India,097195 89416,28.33233,79.397471,
5,Ahaan paint store,"Ahaan paint store, Gulab Nagar, Bareilly, Utta...",Bareilly,Uttar Pradesh,India,,28.373791,79.412794,
6,SHINRAI ENTERPRISES,"SHINRAI ENTERPRISES, Civil Lines, Bareilly, Ut...",Bareilly,Uttar Pradesh,India,081940 21801,28.353012,79.419052,
7,M/s. Kumar Traders,"M/s. Kumar Traders, Nainital Rd, opposite M.B....",Bareilly,Uttar Pradesh,India,098379 08952,28.380429,79.418304,
8,Singh paint store,"Singh paint store, Óñ░ÓÑçÓñÂÓñ«Óñ¥ Óñ¿Óñ░ÓÑìÓñ...",Bareilly,Uttar Pradesh,India,,28.390075,79.411457,
9,R A Agencies,"R A Agencies, Rp Enclave pbt bye pass road, op...",Bareilly,Uttar Pradesh,India,093591 01301,28.35682,79.450818,


In [289]:
# This data contains some unwanted information in 'address' column that needs to be addressed properly.
# To achieve this, we can leverage the '.apply' function along with a 'lambda' function,
# and subsequently utilize the 'split' function to remove the irrelevant data, extracting the last element as index '[-1]'.

up7['address'] = up7['address'].apply(lambda x: x.split("«,")[-1])

In [290]:
up7.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Paint shop,"Paint shop, 694 beharipur civel lines, near ci...",Bareilly,Uttar Pradesh,India,,28.356236,79.404411,
1,Shivay Traders,"Shivay Traders, Bareilly - Badaun Rd, Patel Bi...",Bareilly,Uttar Pradesh,India,089587 30438,28.332262,79.397791,
2,Shri Hariprabha Enterprises,"Shri Hariprabha Enterprises, Ravinder Nagar Co...",Bareilly,Uttar Pradesh,India,,28.34186,79.402386,
3,Shivay Traders,"Shivay Traders, Shiv Mandir, Dohra, Lalpur Roa...",Bareilly,Uttar Pradesh,India,089587 30438,28.37632,79.463369,
4,Sri Ram Enterprises,"Sri Ram Enterprises, Sri Balaji Temple, Budaun...",Bareilly,Uttar Pradesh,India,097195 89416,28.33233,79.397471,
5,Ahaan paint store,"Ahaan paint store, Gulab Nagar, Bareilly, Utta...",Bareilly,Uttar Pradesh,India,,28.373791,79.412794,
6,SHINRAI ENTERPRISES,"SHINRAI ENTERPRISES, Civil Lines, Bareilly, Ut...",Bareilly,Uttar Pradesh,India,081940 21801,28.353012,79.419052,
7,M/s. Kumar Traders,"M/s. Kumar Traders, Nainital Rd, opposite M.B....",Bareilly,Uttar Pradesh,India,098379 08952,28.380429,79.418304,
8,Singh paint store,"Mini Bypass, Karamchari Nagar, Bareilly, Utta...",Bareilly,Uttar Pradesh,India,,28.390075,79.411457,
9,R A Agencies,"R A Agencies, Rp Enclave pbt bye pass road, op...",Bareilly,Uttar Pradesh,India,093591 01301,28.35682,79.450818,


In [291]:
# City Name: Gorakhpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Gorakhpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up8=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up8.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Sri Ram Pipe and Sanitary Store,"Sri Ram Pipe and Sanitary Store, near Gorakhna...",Gorakhpur,Uttar Pradesh,India,094528 47593,26.774725,83.35949,
1,Yash Hadware And Paints,"Yash Hadware And Paints, Maruthi Complex, Tara...",Gorakhpur,Uttar Pradesh,India,094158 56923,26.734349,83.376233,
2,P Madhusudan Chemicals India,"P Madhusudan Chemicals India, Charfatak, Mohad...",Gorakhpur,Uttar Pradesh,India,094150 82923,26.75324,83.398579,
3,Baba paints,"Baba paints, Kachery Rd, near Vishal Mega Mart...",Gorakhpur,Uttar Pradesh,India,,26.757004,83.368145,
4,Sri Paint & Hardware Machinery Store,"Sri Paint & Hardware Machinery Store, Moharipu...",Gorakhpur,Uttar Pradesh,India,092353 03115,26.814297,83.350484,
5,Bharat trading company,"Bharat trading company, Turakmanpur, TP Nagar,...",Gorakhpur,Uttar Pradesh,India,,26.73841,83.356155,
6,Pathak Hardware & Paints,"Pathak Hardware & Paints, Tara Mandal Road, Bh...",Gorakhpur,Uttar Pradesh,India,072218 60633,26.721717,83.389737,
7,MaaAmbey TradersPaints&Hardwearetc,"MaaAmbey TradersPaints&Hardwearetc, LIG2593FCI...",Gorakhpur,Uttar Pradesh,India,,26.802145,83.356531,


In [292]:
# City Name: Noida

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttar-pradesh/Noida/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
up9=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
up9.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,A S ENTERPRISES,"A S ENTERPRISES, MAIN ROAD,SHOP.NO.2, SEC.70, ...",Noida,Uttar Pradesh,India,098997 83737,28.594515,77.388431,https://www.as-enterprises.com
1,Colors India,"Colors India, Plot No.CS-7, Shop No.3, Gyan Sa...",Noida,Uttar Pradesh,India,092788 16417,28.64359,77.349754,
2,Furkan Painter,"Furkan Painter, Jamia Nagar,Abul Fazi Enclave,...",Noida,Uttar Pradesh,India,095407 19751,28.545778,77.301269,
3,Shree Ram Paint & Hardware,"Shree Ram Paint & Hardware, 107, Kala Pathar R...",Noida,Uttar Pradesh,India,092121 57816,28.637308,77.359824,
4,Jagdamba Paints and Hardware,"Jagdamba Paints and Hardware, Sector 5, Vaisha...",Noida,Uttar Pradesh,India,,28.645939,77.347059,
5,Yash Paint Store,"Yash Paint Store, Shop No. 15, Kaling Marg, Se...",Noida,Uttar Pradesh,India,098189 58191,28.643846,77.341468,
6,R K General And Paint Store,"Shop No: 418, R K General And Paint Store, 5, ...",Noida,Uttar Pradesh,India,093113 66899,28.643375,77.344259,
7,Paras Paints & Hardware,"Paras Paints & Hardware, Shop No-2,Central Mar...",Noida,Uttar Pradesh,India,099105 18666,28.669431,77.331112,
8,balaji paints,"balaji paints, 16/180/1-2, Maharaja Agarsain M...",Noida,Uttar Pradesh,India,,28.65306,77.353823,
9,Balaji Paints,"Shop No, Balaji Paints, 901, Kaling Marg, GDA ...",Noida,Uttar Pradesh,India,098914 69639,28.644306,77.345228,


In [293]:
# To merge the dataframes and save the merged output as a variable, 'pd.concat' is the appropriate choice.
# Begin by creating a list of the dataframes, then use 'pd.concat' with 'ignore_index=True' to reset the index.
upd_data = [up1,up2,up3,up4,up5,up6,up7,up8,up9]
upd_data = pd.concat(upd_data,ignore_index=True)
upd_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Prince Paint Sanitary & Hardware Store,"Prince Paint Sanitary & Hardware Store, Lakhna...",Greater Noida,Uttar Pradesh,India,099719 01818,28.516884,77.487383,
1,Prince Paint Sanitary & Hardware Store,"Prince Paint Sanitary & Hardware Store, Side-C...",Greater Noida,Uttar Pradesh,India,099719 01818,28.5199,77.458724,
2,Rajnish Paint & Hardware,"Rajnish Paint & Hardware, Shop No. 1, Basement...",Greater Noida,Uttar Pradesh,India,099996 48511,28.5199,77.458724,
3,Nandani Paints & Hardware Store,"Nandani Paints & Hardware Store, Near Lakhnawa...",Greater Noida,Uttar Pradesh,India,099105 66504,28.513707,77.487934,
4,Rajnish Paints And Hardware,"Rajnish Paints And Hardware, Amritpuram, Block...",Greater Noida,Uttar Pradesh,India,097184 03338,28.479896,77.500687,


#**Uttarakhand**

We have compiled data from diverse cities in Uttarakhand, and our next step is to merge it all together.

In [294]:
# City Name: Dehradun

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttarakhand/Dehradun/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
uk1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
uk1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M/S Arun Paints,"M/S Arun Paints, Haridwar Bypass Road, Haridwa...",Dehradun,Uttarakhand,India,097590 75846,30.2928,78.051682,
1,M. K. Enterprises,"M. K. Enterprises, Haridwar Bypass Road, Harid...",Dehradun,Uttarakhand,India,099177 52698,30.293748,78.05334,
2,Bhatt Paints,"Bhatt Paints, Johri, Malsi, Dehradun, Uttarakh...",Dehradun,Uttarakhand,India,,30.376805,78.066777,
3,Sahib Paints,"Sahib Paints, Saharanpur Rd, Patel Nagar, Dehr...",Dehradun,Uttarakhand,India,099973 73777,30.30944,78.021054,
4,M/s. B.S. Thapa,"M/s. B.S. Thapa, Birpur Rd, Garhi Cantonment, ...",Dehradun,Uttarakhand,India,097192 66039,30.354913,78.022681,
5,M/S Vijay Kumar Sharma- Sheenlac Paints,"M/S Vijay Kumar Sharma- Sheenlac Paints, 93, C...",Dehradun,Uttarakhand,India,097595 76830,30.320526,78.032664,
6,Krishna COLORS FOR LIFE,"Krishna COLORS FOR LIFE, Chakarata Rd, Jhajra,...",Dehradun,Uttarakhand,India,,30.345212,77.906094,
7,Sudhir Paint Mart,"43, Sudhir Paint Mart, Old, Dispensary Rd, Har...",Dehradun,Uttarakhand,India,0135 265 5976,30.321718,78.040196,
8,Umar hardware,"Umar hardware, 1275, Indra Nagar Colony, Dehra...",Dehradun,Uttarakhand,India,,30.311215,77.995469,https://umar-hardware.business.site
9,Bhatia Sons,"Bhatia Sons, General Mahadev Singh Rd, Behind ...",Dehradun,Uttarakhand,India,098972 73041,30.307959,78.007042,


In [295]:
# City Names: Rāmnagar and Rāipur and Birbhaddar

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/uttarakhand/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
uk2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
uk2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Rawat Paints & Hardware Store,"Rawat Paints & Hardware Store, 244715, Chandra...",Rāmnagar,Uttarakhand,India,,29.337895,79.063994,
1,Angel Paint,"Angel Paint, Rajeshwar Nagar Phase-I, Aman Vih...",Rāipur,Uttarakhand,India,097608 03835,30.355397,78.083351,
2,Bhatia Sons,"Bhatia Sons, General Mahadev Singh Rd, Behind ...",Dehradun,Uttarakhand,India,098972 73041,30.307959,78.007042,
3,Umar hardware,"Umar hardware, 1275, Indra Nagar Colony, Dehra...",Dehradun,Uttarakhand,India,,30.311215,77.995469,https://umar-hardware.business.site
4,Shree Sai Ram Enterprises,"Shree Sai Ram Enterprises, Tapovan Ashram Rd, ...",Rāipur,Uttarakhand,India,,30.32231,78.076776,
5,Raj Hardware And Paints,"Raj Hardware And Paints, Tapowan Rd, Friends C...",Rāipur,Uttarakhand,India,099276 74509,30.326299,78.080149,
6,Shahi associate,"Shahi associate, Gali No-7, House No-2, 20 Big...",Birbhaddar,Uttarakhand,India,,30.077263,78.27547,
7,S.S.TRADERS,"S.S.TRADERS, Chini Godam Rd, Gumaniwala, Rishi...",Birbhaddar,Uttarakhand,India,,30.077435,78.246632,
8,Gaurav paints and decorators,"Gaurav paints and decorators, NH34, Geeta Naga...",Birbhaddar,Uttarakhand,India,,30.083652,78.261078,
9,RANA ENTERPRISES,"RANA ENTERPRISES, MALVIYA NAGAR NEAR DURGA MAN...",Birbhaddar,Uttarakhand,India,,30.077716,78.261857,


In [296]:
# Remove unwanted rows that we don't want to keep it in our data set.
uk2.drop(index=uk2.index[2:4],axis=0,inplace=True)

In [297]:
# Check the data set
uk2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Rawat Paints & Hardware Store,"Rawat Paints & Hardware Store, 244715, Chandra...",Rāmnagar,Uttarakhand,India,,29.337895,79.063994,
1,Angel Paint,"Angel Paint, Rajeshwar Nagar Phase-I, Aman Vih...",Rāipur,Uttarakhand,India,097608 03835,30.355397,78.083351,
4,Shree Sai Ram Enterprises,"Shree Sai Ram Enterprises, Tapovan Ashram Rd, ...",Rāipur,Uttarakhand,India,,30.32231,78.076776,
5,Raj Hardware And Paints,"Raj Hardware And Paints, Tapowan Rd, Friends C...",Rāipur,Uttarakhand,India,099276 74509,30.326299,78.080149,
6,Shahi associate,"Shahi associate, Gali No-7, House No-2, 20 Big...",Birbhaddar,Uttarakhand,India,,30.077263,78.27547,
7,S.S.TRADERS,"S.S.TRADERS, Chini Godam Rd, Gumaniwala, Rishi...",Birbhaddar,Uttarakhand,India,,30.077435,78.246632,
8,Gaurav paints and decorators,"Gaurav paints and decorators, NH34, Geeta Naga...",Birbhaddar,Uttarakhand,India,,30.083652,78.261078,
9,RANA ENTERPRISES,"RANA ENTERPRISES, MALVIYA NAGAR NEAR DURGA MAN...",Birbhaddar,Uttarakhand,India,,30.077716,78.261857,


In [298]:
# Utilizing 'pd.concat', we can concatenate all the dataframes and assign the result to a variable. Create a list that includes
# the dataframes and use 'pd.concat' while setting 'ignore_index=True' to reset the index.
ukd_data = [uk1,uk2]
ukd_data = pd.concat(ukd_data,ignore_index=True)
ukd_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,M/S Arun Paints,"M/S Arun Paints, Haridwar Bypass Road, Haridwa...",Dehradun,Uttarakhand,India,097590 75846,30.2928,78.051682,
1,M. K. Enterprises,"M. K. Enterprises, Haridwar Bypass Road, Harid...",Dehradun,Uttarakhand,India,099177 52698,30.293748,78.05334,
2,Bhatt Paints,"Bhatt Paints, Johri, Malsi, Dehradun, Uttarakh...",Dehradun,Uttarakhand,India,,30.376805,78.066777,
3,Sahib Paints,"Sahib Paints, Saharanpur Rd, Patel Nagar, Dehr...",Dehradun,Uttarakhand,India,099973 73777,30.30944,78.021054,
4,M/s. B.S. Thapa,"M/s. B.S. Thapa, Birpur Rd, Garhi Cantonment, ...",Dehradun,Uttarakhand,India,097192 66039,30.354913,78.022681,


#**West Bengal**

Having gathered data from different cities in West Bengal, we will proceed to merge and integrate it.

In [299]:
# City Name: Santoshpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/west-bengal/Santoshpur/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
wb1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
wb1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Nu Unique Sales Agencies,"Nu Unique Sales Agencies, 18/5 Raja Rammohan R...",Santoshpur,West Bengal,India,,22.487124,88.313756,
1,Dawn Paint House,"Dawn Paint House, James Long Sarani, Jagadish ...",Santoshpur,West Bengal,India,,22.472861,88.312059,
2,South Calcutta Paints,"Shop No. 81/1, South Calcutta Paints, M.G. Rd,...",Santoshpur,West Bengal,India,090515 82339,22.486328,88.340014,
3,Bristi Colour Bank,"Bristi Colour Bank, 83, Charulatha Building, J...",Santoshpur,West Bengal,India,078908 57198,22.462901,88.30964,
4,Allied Resins and Chemical Limited,"Allied Resins and Chemical Limited, Paharpur, ...",Santoshpur,West Bengal,India,,22.521309,88.304263,
5,Adak Enterprise,"Adak Enterprise, 135/4, B. G. Press, Behala, K...",Santoshpur,West Bengal,India,098747 74269,22.495504,88.302668,https://business.google.com
6,Biswas Paints,"Biswas Paints, 700063 774, Diamond Harbour Rd,...",Santoshpur,West Bengal,India,,22.457036,88.305337,
7,Das Service,"Das Service, Shop No. 23, SN Roy Rd, Sahapur, ...",Santoshpur,West Bengal,India,,22.50686,88.325069,http://www.daskolkata.org
8,Elga Paints And Polymers,"Elga Paints And Polymers, 30d, D H Road, Parna...",Santoshpur,West Bengal,India,033 2406 2431,22.48628,88.312999,
9,New south prints,"New south prints, 277, M.G. Rd, Arobindo Pally...",Santoshpur,West Bengal,India,,22.476334,88.339033,


In [300]:
# City Name: Kolkata

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/west-bengal/Kolkata/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
wb2=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
wb2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,The Colour House,"The Colour House, 52/1A, Raja Ram Mohan Sarani...",Kolkata,West Bengal,India,094330 46854,22.578848,88.369772,
1,Gita Paints and Hardwares ( ÓªùÓºÇÓªñÓª¥ Óª¬Óº...,Gita Paints and Hardwares ( ÓªùÓºÇÓªñÓª¥ Óª¬Óº...,Kolkata,West Bengal,India,,22.554991,88.379876,
2,Sonu Paints,"Sonu Paints, 40, Park Street, Rafi Ahmed Kidwa...",Kolkata,West Bengal,India,,22.553766,88.356726,
3,Indo Paints,"No:37, Indo Paints, Muzaffar Ahmed St, Esplana...",Kolkata,West Bengal,India,033 2229 2656,22.553672,88.357438,
4,New Maa Manasha Paints,"New Maa Manasha Paints, 5/23, Rajendra Prasad ...",Kolkata,West Bengal,India,092316 39272,22.500365,88.357036,
5,Laxmi Hardware Stores & Paints,"Laxmi Hardware Stores & Paints, 152, Dr G S Bo...",Kolkata,West Bengal,India,098317 10071,22.52858,88.375652,http://laxmi-hardware-stores-paints.business.site
6,Hossain Paints Supply,"29, 44C, Hossain Paints Supply, 29, Samsul Hud...",Kolkata,West Bengal,India,097488 09322,22.536678,88.371232,
7,Premium Paints,"Premium Paints, 32, Jhowtalla Road, Circus Ave...",Kolkata,West Bengal,India,098301 60363,22.537888,88.366141,
8,Haque Paints,"Haque Paints, Ground, 85, Fazlul Haque Sarani,...",Kolkata,West Bengal,India,,22.54129,88.365519,
9,Dey's Hardware Stores,"61, Dey's Hardware Stores, 1, Bondel Rd, Bally...",Kolkata,West Bengal,India,092314 39781,22.529166,88.372134,


In [301]:
# There is unwanted data in 'name' column of this dataset that requires attention. To handle it, we can utilize the '.apply' function in combination with a 'lambda' function,
# followed by the 'split' function to eliminate the garbage and extract the first element as index '[0]'.

wb2['name']=wb2['name'].apply(lambda x: x.split("(")[0])

In [302]:
# Within this dataset, there is some irrelevant information in 'address' column that needs to be handled appropriately.
# To do so, we can make use of the '.apply' function in conjunction with a 'lambda' function and employ the 'split' function to eliminate the unwanted data,
# subsequently taking the last element as index '[-1]'.

wb2['address']=wb2['address'].apply(lambda x: x.split("),")[-1])

In [303]:
# Checking the data once.
wb2.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,The Colour House,"The Colour House, 52/1A, Raja Ram Mohan Sarani...",Kolkata,West Bengal,India,094330 46854,22.578848,88.369772,
1,Gita Paints and Hardwares,"Pottery Rd, Seal Lane, Tangra, Kolkata, West ...",Kolkata,West Bengal,India,,22.554991,88.379876,
2,Sonu Paints,"Sonu Paints, 40, Park Street, Rafi Ahmed Kidwa...",Kolkata,West Bengal,India,,22.553766,88.356726,
3,Indo Paints,"No:37, Indo Paints, Muzaffar Ahmed St, Esplana...",Kolkata,West Bengal,India,033 2229 2656,22.553672,88.357438,
4,New Maa Manasha Paints,"New Maa Manasha Paints, 5/23, Rajendra Prasad ...",Kolkata,West Bengal,India,092316 39272,22.500365,88.357036,
5,Laxmi Hardware Stores & Paints,"Laxmi Hardware Stores & Paints, 152, Dr G S Bo...",Kolkata,West Bengal,India,098317 10071,22.52858,88.375652,http://laxmi-hardware-stores-paints.business.site
6,Hossain Paints Supply,"29, 44C, Hossain Paints Supply, 29, Samsul Hud...",Kolkata,West Bengal,India,097488 09322,22.536678,88.371232,
7,Premium Paints,"Premium Paints, 32, Jhowtalla Road, Circus Ave...",Kolkata,West Bengal,India,098301 60363,22.537888,88.366141,
8,Haque Paints,"Haque Paints, Ground, 85, Fazlul Haque Sarani,...",Kolkata,West Bengal,India,,22.54129,88.365519,
9,Dey's Hardware Stores,"61, Dey's Hardware Stores, 1, Bondel Rd, Bally...",Kolkata,West Bengal,India,092314 39781,22.529166,88.372134,


In [304]:
# City Names: Rāmnagar and Bali and Kesabpur

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/west-bengal/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
wb3=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
wb3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,De Enterprise & paints,"De Enterprise & paints, Ugardahs, Baruipara, W...",Rāmnagar,West Bengal,India,,22.765042,88.246131,
1,"Mahanad Rangmahal Asian ,Berger and ICI Paints.","Mahanad Rangmahal Asian ,Berger and ICI Paints...",Kesabpur,West Bengal,India,094741 28238,23.00748,88.262515,
2,SHRI HARI RANG MOHAL,"SHRI HARI RANG MOHAL, Bora Hawakhana Ground, R...",Rāmnagar,West Bengal,India,,22.76078,88.274478,
3,Gobindapur Paints And Heardwere,"Gobindapur Paints And Heardwere, Diara, Kolkat...",Rāmnagar,West Bengal,India,,22.795507,88.278842,https://gobindapur-paints-and-heardwere.busine...
4,Barada Paint House,"Barada Paint House, Samabaypalli Saheb bagan b...",Bali,West Bengal,India,,22.6587,88.329962,
5,Berger and asian paints,"Berger and asian paints, 405, Grand Trunk Rd, ...",Bali,West Bengal,India,,22.649431,88.350651,
6,Paint Shop,"Paint Shop, 711227, Industrial Housing Estate,...",Bali,West Bengal,India,,22.640192,88.336932,
7,Prabir Colour Palace,"Prabir Colour Palace, Rd Number 31, Bara, Kama...",Rāmnagar,West Bengal,India,,22.761242,88.273008,
8,Sri Hari Rang Mahal,"Sri Hari Rang Mahal, Nr. Boro Baba Mandir, Bar...",Rāmnagar,West Bengal,India,,22.76142,88.273227,
9,New south prints,"New south prints, 277, M.G. Rd, Arobindo Pally...",Santoshpur,West Bengal,India,,22.476334,88.339033,


In [305]:
# Remove unwanted rows that we don't want to keep it in our data set.
wb3.drop(index=wb3.index[-1],axis=0,inplace=True)

In [306]:
# Check the data set.
wb3.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,De Enterprise & paints,"De Enterprise & paints, Ugardahs, Baruipara, W...",Rāmnagar,West Bengal,India,,22.765042,88.246131,
1,"Mahanad Rangmahal Asian ,Berger and ICI Paints.","Mahanad Rangmahal Asian ,Berger and ICI Paints...",Kesabpur,West Bengal,India,094741 28238,23.00748,88.262515,
2,SHRI HARI RANG MOHAL,"SHRI HARI RANG MOHAL, Bora Hawakhana Ground, R...",Rāmnagar,West Bengal,India,,22.76078,88.274478,
3,Gobindapur Paints And Heardwere,"Gobindapur Paints And Heardwere, Diara, Kolkat...",Rāmnagar,West Bengal,India,,22.795507,88.278842,https://gobindapur-paints-and-heardwere.busine...
4,Barada Paint House,"Barada Paint House, Samabaypalli Saheb bagan b...",Bali,West Bengal,India,,22.6587,88.329962,
5,Berger and asian paints,"Berger and asian paints, 405, Grand Trunk Rd, ...",Bali,West Bengal,India,,22.649431,88.350651,
6,Paint Shop,"Paint Shop, 711227, Industrial Housing Estate,...",Bali,West Bengal,India,,22.640192,88.336932,
7,Prabir Colour Palace,"Prabir Colour Palace, Rd Number 31, Bara, Kama...",Rāmnagar,West Bengal,India,,22.761242,88.273008,
8,Sri Hari Rang Mahal,"Sri Hari Rang Mahal, Nr. Boro Baba Mandir, Bar...",Rāmnagar,West Bengal,India,,22.76142,88.273227,


In [307]:
# The 'pd.concat' function enables the concatenation of dataframes and saving the combined output into a variable.
# Compile the dataframes into a list, then apply 'pd.concat' with 'ignore_index=True' to reset the index.
wbl_data = [wb1,wb2,wb3]
wbl_data = pd.concat(wbl_data,ignore_index=True)
wbl_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Nu Unique Sales Agencies,"Nu Unique Sales Agencies, 18/5 Raja Rammohan R...",Santoshpur,West Bengal,India,,22.487124,88.313756,
1,Dawn Paint House,"Dawn Paint House, James Long Sarani, Jagadish ...",Santoshpur,West Bengal,India,,22.472861,88.312059,
2,South Calcutta Paints,"Shop No. 81/1, South Calcutta Paints, M.G. Rd,...",Santoshpur,West Bengal,India,090515 82339,22.486328,88.340014,
3,Bristi Colour Bank,"Bristi Colour Bank, 83, Charulatha Building, J...",Santoshpur,West Bengal,India,078908 57198,22.462901,88.30964,
4,Allied Resins and Chemical Limited,"Allied Resins and Chemical Limited, Paharpur, ...",Santoshpur,West Bengal,India,,22.521309,88.304263,


#**Chandigarh**

Data has been gathered from different cities in Chandigarh, and now we will combine and merge it.

In [308]:
# City Name: Chandigarh

# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://rentechdigital.com/smartscraper/business-report-details/india/chandigarh/paint-stores'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
df = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
cn1=pd.DataFrame(df)

# We used head() function to see the data first 10 rows.
cn1.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Ravinder Traders,"Ravinder Traders, Plot No.153A ,Sector 42B, Ja...",Chandigarh,Chandigarh,India,0172 261 2619,30.726363,76.751098,
1,Ply Home,"Ply Home, S.C.O. 315, Sector 38-D, Chandigarh,...",Chandigarh,Chandigarh,India,0172 500 5440,30.74634,76.743509,
2,Gulzar Paints,"Gulzar Paints, SCF 10, Sector 18-C, Sector 18,...",Chandigarh,Chandigarh,India,0172 508 7903,30.731899,76.789039,
3,Suzuki Paints,"Suzuki Paints, Shop No. 1809, opposite Dev Sam...",Chandigarh,Chandigarh,India,092166 21009,30.707609,76.763289,
4,Naresh Enterprises,"Shop no. 134, Naresh Enterprises, Sector 41D, ...",Chandigarh,Chandigarh,India,098884 08060,30.733023,76.735164,
5,Ganga Painting Works,"Ganga Painting Works, ganga printing, 23 SC-28...",Chandigarh,Chandigarh,India,098144 05823,30.718075,76.804335,
6,R S Enterprises,"R S Enterprises, 89, Press Area, Industrial Ar...",Chandigarh,Chandigarh,India,0172 265 2330,30.700073,76.804561,
7,KARNAL PAINTS & HARDWARE,"KARNAL PAINTS & HARDWARE, Booth No.35 , R1 , C...",Chandigarh,Chandigarh,India,,30.815856,76.723872,
8,Gupta Interiors| Wallpaper store| Texture Pain...,Gupta Interiors| Wallpaper store| Texture Pain...,Chandigarh,Chandigarh,India,098767 66685,30.706687,76.76662,https://guptainteriorswallpaper.business.site
9,SHIV SHAKTI ENTERPRISES(ASIAN PAINTS),"SHIV SHAKTI ENTERPRISES(ASIAN PAINTS), Booth n...",Chandigarh,Chandigarh,India,,30.809985,76.727697,


In [309]:
chn_data = cn1

In [310]:
chn_data.head()

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,Ravinder Traders,"Ravinder Traders, Plot No.153A ,Sector 42B, Ja...",Chandigarh,Chandigarh,India,0172 261 2619,30.726363,76.751098,
1,Ply Home,"Ply Home, S.C.O. 315, Sector 38-D, Chandigarh,...",Chandigarh,Chandigarh,India,0172 500 5440,30.74634,76.743509,
2,Gulzar Paints,"Gulzar Paints, SCF 10, Sector 18-C, Sector 18,...",Chandigarh,Chandigarh,India,0172 508 7903,30.731899,76.789039,
3,Suzuki Paints,"Suzuki Paints, Shop No. 1809, opposite Dev Sam...",Chandigarh,Chandigarh,India,092166 21009,30.707609,76.763289,
4,Naresh Enterprises,"Shop no. 134, Naresh Enterprises, Sector 41D, ...",Chandigarh,Chandigarh,India,098884 08060,30.733023,76.735164,


Now we have to marge all data set into a single variable and export to csv file.

In [311]:
# Create a list.
psd=[ap_data,asm_data,bih_data,chg_data,guj_data,hya_data,jhk_data,ktk_data,krl_data,mpd_data,mhs_data,
     odi_data,pnb_data,rjt_data,tnd_data,tlg_data,upd_data,ukd_data,wbl_data,chn_data]

In [312]:
# Concat all states data frame and store into a variable. For reset the index used 'ignore_index=True'.
psd = pd.concat(psd,ignore_index=True)
psd.head(10)

Unnamed: 0,name,address,city,State/Territory,country,phone,latitude,longitude,url
0,K K Paints,"K K Paints, 15/496/1 SURYA NAGAR 80 FEET ROAD ...",Anantapur,Andhra Pradesh,India,093966 87672,14.678225,77.604274,
1,Partha paints and hardware,"Partha paints and hardware, Rajamma Nagar, Bha...",Anantapur,Andhra Pradesh,India,,14.681712,77.611705,
2,Venkatasai Paintings Hardwares,"Venkatasai Paintings Hardwares, D No.12-5-402,...",Anantapur,Andhra Pradesh,India,099852 79850,14.665382,77.601675,
3,Elahi Enterprises,"Elahi Enterprises, opp:kamma bhavan, Main Rd, ...",Anantapur,Andhra Pradesh,India,070137 52806,14.668793,77.58282,
4,Sudhir paints,"Sudhir paints, Ganesha Nagar, Pilligundla, Ana...",Anantapur,Andhra Pradesh,India,,14.683689,77.576337,
5,Hanuman Paints And Hardwares,"Hanuman Paints And Hardwares, Rangaswamy Nagar...",Anantapur,Andhra Pradesh,India,,14.690774,77.595991,
6,Asian Paints,"Asian Paints, Kamalanagar, Anantapur, Andhra P...",Anantapur,Andhra Pradesh,India,,14.68248,77.596748,https://www.asianpaints.com
7,Brundhavan traders,"Brundhavan traders, Tirupati Rd, Sangameswar C...",Anantapur,Andhra Pradesh,India,,14.67355,77.606613,
8,Asian Paints,"Asian Paints, Maruthi Nagar, Anantapur, Andhra...",Anantapur,Andhra Pradesh,India,,14.669713,77.588092,https://www.asianpaints.com
9,S.K. Paints,"6, S.K. Paints, 71, Bellary - Uravakonda - Ana...",Anantapur,Andhra Pradesh,India,099666 81848,14.687115,77.582064,


In [313]:
# For safty reason take a backup of the data set.
pdf=psd.copy(deep=True)

In [314]:
# Check the rows and column.
psd.shape

(765, 9)

In [315]:
# Remove unwanted columns.
psd.drop(columns=['latitude','longitude'],axis=1,inplace=True)

In [316]:
# Rename the columns name as require.
psd.rename(columns={'name':'Name','address':'Address','city':'City','country':'Country','phone':'Phone','url':'Web Url'},inplace=True)

In [317]:
# Just to check the data set.
psd.head(10)

Unnamed: 0,Name,Address,City,State/Territory,Country,Phone,Web Url
0,K K Paints,"K K Paints, 15/496/1 SURYA NAGAR 80 FEET ROAD ...",Anantapur,Andhra Pradesh,India,093966 87672,
1,Partha paints and hardware,"Partha paints and hardware, Rajamma Nagar, Bha...",Anantapur,Andhra Pradesh,India,,
2,Venkatasai Paintings Hardwares,"Venkatasai Paintings Hardwares, D No.12-5-402,...",Anantapur,Andhra Pradesh,India,099852 79850,
3,Elahi Enterprises,"Elahi Enterprises, opp:kamma bhavan, Main Rd, ...",Anantapur,Andhra Pradesh,India,070137 52806,
4,Sudhir paints,"Sudhir paints, Ganesha Nagar, Pilligundla, Ana...",Anantapur,Andhra Pradesh,India,,
5,Hanuman Paints And Hardwares,"Hanuman Paints And Hardwares, Rangaswamy Nagar...",Anantapur,Andhra Pradesh,India,,
6,Asian Paints,"Asian Paints, Kamalanagar, Anantapur, Andhra P...",Anantapur,Andhra Pradesh,India,,https://www.asianpaints.com
7,Brundhavan traders,"Brundhavan traders, Tirupati Rd, Sangameswar C...",Anantapur,Andhra Pradesh,India,,
8,Asian Paints,"Asian Paints, Maruthi Nagar, Anantapur, Andhra...",Anantapur,Andhra Pradesh,India,,https://www.asianpaints.com
9,S.K. Paints,"6, S.K. Paints, 71, Bellary - Uravakonda - Ana...",Anantapur,Andhra Pradesh,India,099666 81848,


In [318]:
# Count how many null value we have in the data set.
psd.isna().sum()

Name                 0
Address              0
City                 0
State/Territory      0
Country              0
Phone              293
Web Url            675
dtype: int64

In [319]:
# Now convert all 'NaN' to 'NA'. we can apply 'np.nan' with 'replace' function to do that.
psd = psd.replace(np.nan,'NA',regex=True)

In [320]:
# Just to check the last 10 number of rows in data set.
psd.tail(10)

Unnamed: 0,Name,Address,City,State/Territory,Country,Phone,Web Url
755,Ravinder Traders,"Ravinder Traders, Plot No.153A ,Sector 42B, Ja...",Chandigarh,Chandigarh,India,0172 261 2619,
756,Ply Home,"Ply Home, S.C.O. 315, Sector 38-D, Chandigarh,...",Chandigarh,Chandigarh,India,0172 500 5440,
757,Gulzar Paints,"Gulzar Paints, SCF 10, Sector 18-C, Sector 18,...",Chandigarh,Chandigarh,India,0172 508 7903,
758,Suzuki Paints,"Suzuki Paints, Shop No. 1809, opposite Dev Sam...",Chandigarh,Chandigarh,India,092166 21009,
759,Naresh Enterprises,"Shop no. 134, Naresh Enterprises, Sector 41D, ...",Chandigarh,Chandigarh,India,098884 08060,
760,Ganga Painting Works,"Ganga Painting Works, ganga printing, 23 SC-28...",Chandigarh,Chandigarh,India,098144 05823,
761,R S Enterprises,"R S Enterprises, 89, Press Area, Industrial Ar...",Chandigarh,Chandigarh,India,0172 265 2330,
762,KARNAL PAINTS & HARDWARE,"KARNAL PAINTS & HARDWARE, Booth No.35 , R1 , C...",Chandigarh,Chandigarh,India,,
763,Gupta Interiors| Wallpaper store| Texture Pain...,Gupta Interiors| Wallpaper store| Texture Pain...,Chandigarh,Chandigarh,India,098767 66685,https://guptainteriorswallpaper.business.site
764,SHIV SHAKTI ENTERPRISES(ASIAN PAINTS),"SHIV SHAKTI ENTERPRISES(ASIAN PAINTS), Booth n...",Chandigarh,Chandigarh,India,,


In [321]:
# Just to check the data type of each columns in data set.
psd.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 765 entries, 0 to 764
Data columns (total 7 columns):
 #   Column           Non-Null Count  Dtype 
---  ------           --------------  ----- 
 0   Name             765 non-null    object
 1   Address          765 non-null    object
 2   City             765 non-null    object
 3   State/Territory  765 non-null    object
 4   Country          765 non-null    object
 5   Phone            765 non-null    object
 6   Web Url          765 non-null    object
dtypes: object(7)
memory usage: 42.0+ KB


In [323]:
# Save into a csv file.
psd.to_csv('Paint Store Details.csv',index=False,header=True)