Tobacco use has long been a significant public health concern in India, affecting the lives of millions of individuals. The National Family Health Survey (NFHS) provides valuable insights into the prevalence and patterns of tobacco use among Indians, offering a comprehensive understanding of this pressing issue.

The NFHS, conducted periodically by the Government of India, collects data on various health indicators, including tobacco consumption. The survey covers a wide range of demographic, social, and economic factors, providing a reliable and representative picture of tobacco use across different regions of the country.In this exploration, we delve into the findings of the NFHS to gain deeper insights into tobacco use among Indians. We examine key statistics related to tobacco prevalence, age of initiation, gender differences, and socio-economic factors influencing tobacco consumption.

The data from the NFHS allows us to analyze the trends and changes in tobacco use over time, identifying patterns and areas of concern. By understanding the specific demographics and regions where tobacco use is most prevalent, we can develop targeted interventions and policies to address the issue effectively.
Additionally, we explore the health consequences associated with tobacco use, including an increased risk of various diseases, disability, and premature death. Through the NFHS data, we can highlight the impact of tobacco use on the overall well-being of individuals, families, and communities across India.

Furthermore, we examine the initiatives and interventions that have been implemented to combat tobacco use in light of the NFHS findings. Government-led campaigns, tobacco control programs, and awareness initiatives play a crucial role in reducing tobacco consumption and promoting healthier lifestyles.
By utilizing the valuable insights provided by the NFHS data, we aim to raise awareness about the prevalence and consequences of tobacco use among Indians. Together, we can work towards developing evidence-based strategies, policies, and interventions that empower individuals, promote healthier choices, and create a tobacco-free future for the nation.

In [None]:
# Install all require library
!pip install requests
!pip install bs4



In [None]:
# Import all require library like request,os,panda,csv and beautiful.
import requests
import pandas as pd
import os
from bs4 import BeautifulSoup as bs
import csv
import numpy as np

In [None]:
# After import all the libraries we need to headers and save it variable.
headers={"User-Agent":"mozilla/5.0"}

# Now we need to save the url in a variable from where the data will download.
url='https://www.findeasy.in/tobacco-use-among-adults-in-india/'

# Now use requests library to get the url with headers and convert into text and save in a variable.
rqd = requests.get(url,headers=headers).text

# Then use 'panda.read' to read the data in html.
dm = pd.read_html(rqd)[0]

# Now save the data in to a variable after convert into a dataframe with the help of 'pd.DataFrame'.
sm=pd.DataFrame(dm)

# We used head() function to see the data first 10 rows.
sm.head(10)

Unnamed: 0,0,1,2,3
0,#,States & Union Territories,Women (15 years & above),Men (15 years & above)
1,1,Andaman & Nicobar,31.3 %,58.7 %
2,2,AP,3.8%,22.6%
3,3,Arunachal,18.8%,50.3%
4,4,Assam,22.1%,51.8%
5,5,Bihar,5.0%,48.8%
6,6,Chandigarh,0.6%,12.1%
7,7,Chattishgarh,17.3%,43.1%
8,8,D&N,2.9%,38.6%
9,9,Delhi,2.2%,26.3%


In [None]:
# Show last 10 rows in dataframe. So USed tail function.
sm.tail(10)

Unnamed: 0,0,1,2,3
28,28.0,Punjab,0.4%,12.9%
29,29.0,Rajasthan,6.9%,42.0%
30,30.0,Sikkim,11.7%,41.3%
31,31.0,Tamil Nadu,4.9%,20.1%
32,32.0,Telangana,5.6%,22.3%
33,33.0,Tripura,50.4%,56.9%
34,34.0,Uttar Pradesh,8.4%,44.1%
35,35.0,Uttarakhand,4.6%,33.7%
36,36.0,West Bengal,10.8%,48.1%
37,,India,8.9%,38.0%


In [None]:
# We need to make first row as the header so we can do that with the help of 'iloc' fllowed by index '[0]'.
sm.columns=sm.iloc[0]

In [None]:
# We have some unwanted rows which will have to take care so remove unwanted rows from data set.
sm.drop(index=sm.index[37],axis=0,inplace=True)
sm.drop(index=sm.index[0],axis=0,inplace=True)

In [None]:
# Now remove also unwanted column from data set.
sm.drop(['#'],axis=1,inplace=True)

In [None]:
# Rename the columns. We did it in a different way. Create a variable and store columns as columns index position then mapping it with dataframe fllowed by rename function.
sd={sm.columns[0]:'State',sm.columns[1]:'Women',sm.columns[2]:'Men'}
sm=sm.rename(columns=sd)

In [None]:
# Check the data in data set.
sm.head(40)

Unnamed: 0,State,Women,Men
1,Andaman & Nicobar,31.3 %,58.7 %
2,AP,3.8%,22.6%
3,Arunachal,18.8%,50.3%
4,Assam,22.1%,51.8%
5,Bihar,5.0%,48.8%
6,Chandigarh,0.6%,12.1%
7,Chattishgarh,17.3%,43.1%
8,D&N,2.9%,38.6%
9,Delhi,2.2%,26.3%
10,Goa,2.6%,18.2%


In [None]:
# We have to reset the index of dataframe so we used 'reset.index' function.
sm=sm.reset_index(drop=True)

In [None]:
# Checking the data in dataframe.
sm.head()

Unnamed: 0,State,Women,Men
0,Andaman & Nicobar,31.3 %,58.7 %
1,AP,3.8%,22.6%
2,Arunachal,18.8%,50.3%
3,Assam,22.1%,51.8%
4,Bihar,5.0%,48.8%


In [None]:
# How many rows and columns we have in data set.
sm.shape

(36, 3)

In [None]:
# Check the data type
sm.dtypes

0
State    object
Women    object
Men      object
dtype: object

In [None]:
# Need to see more info about data so, use 'info' function to do so.
sm.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 36 entries, 0 to 35
Data columns (total 3 columns):
 #   Column  Non-Null Count  Dtype 
---  ------  --------------  ----- 
 0   State   36 non-null     object
 1   Women   36 non-null     object
 2   Men     36 non-null     object
dtypes: object(3)
memory usage: 992.0+ bytes


In [None]:
# Checking null value in data set.
sm.isna().sum()

0
State    0
Women    0
Men      0
dtype: int64

In [None]:
# Now export the data as csv format.
sm.to_csv('Tobacco Consumption India NFHS.csv',index=False,header=True)