# **Model Building**
We'll split the data into training set and testing set using train_test_split, but first, let's convert the 'Country' feature to an acceptable form for the model.

In [None]:
ad_data.columns

As we can't directly use the 'Country' feature (because it's a categorical string), we have to find another way to feed it into the model.

One way to go about this is to drop the feature, but we risk losing useful information.

So, what we can do is, convert the categorical feature into [dummy variables](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.get_dummies.html) using pandas.

In [None]:
countries = pd.get_dummies(ad_data['Country'],drop_first=True)


Concatenating dummy variables with the original dataset, and dropping other features.

In [None]:
ad_data = pd.concat([ad_data,countries],axis=1)
ad_data.drop(['Country','Ad Topic Line','City','Timestamp'],axis=1,inplace=True)

Splitting the dataset.

In [None]:
X = ad_data.drop('Clicked on Ad',axis=1)
y = ad_data['Clicked on Ad']

In [None]:
from sklearn.model_selection import train_test_split

In [None]:
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=101)


Training the model.

In [None]:
from sklearn.linear_model import LogisticRegression
logclf = LogisticRegression()

In [None]:
logclf.fit(X,y)