In [None]:
print("""Q1. In order to predict house price based on several characteristics, such as location, square footage,
number of bedrooms, etc., you are developing an SVM regression model. Which regression metric in this
situation would be the best to employ?""")
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.svm import SVR
from sklearn.metrics import mean_absolute_error, mean_squared_error, r2_score
import numpy as np

# Load the dataset
url = "https://drive.google.com/uc?id=1Z9oLpmt6IDRNw7IeNcHYTGeJRYypRSC0"
df = pd.read_csv(url)

# Inspect the dataset
print(df.head())

# Drop rows with missing values
df.dropna(inplace=True)

# Inspect data types
print(df.dtypes)

# Convert non-numeric columns to numeric using one-hot encoding
df = pd.get_dummies(df, drop_first=True)

# Separate features and target variable
X = df.drop('price', axis=1)  # Assuming 'price' is the target column
y = df['price']

# Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train the model
svm_regressor = SVR(kernel='rbf')
svm_regressor.fit(X_train, y_train)

# Predict on the test set
y_pred = svm_regressor.predict(X_test)

# Calculate regression metrics
mae = mean_absolute_error(y_test, y_pred)
mse = mean_squared_error(y_test, y_pred)
rmse = np.sqrt(mse)
r2 = r2_score(y_test, y_pred)

print(f"Mean Absolute Error (MAE): {mae}")
print(f"Mean Squared Error (MSE): {mse}")
print(f"Root Mean Squared Error (RMSE): {rmse}")
print(f"R-squared (R²): {r2}")



In [7]:
print("""Q2. You have built an SVM regression model and are trying to decide between using MSE or R-squared as
your evaluation metric. Which metric would be more appropriate if your goal is to predict the actual price
of a house as accurately as possible?

Ans.If our goal is to predict the actual price of a house as accurately as possible, Mean Squared Error (MSE) would be more appropriate as the evaluation metric for your SVM regression model. MSE directly measures the average squared difference between predicted and actual prices, providing a clear measure of prediction accuracy in terms of magnitude.""")

Q2. You have built an SVM regression model and are trying to decide between using MSE or R-squared as
your evaluation metric. Which metric would be more appropriate if your goal is to predict the actual price
of a house as accurately as possible?

Ans.If our goal is to predict the actual price of a house as accurately as possible, Mean Squared Error (MSE) would be more appropriate as the evaluation metric for your SVM regression model. MSE directly measures the average squared difference between predicted and actual prices, providing a clear measure of prediction accuracy in terms of magnitude.


In [8]:
print("""Q3. You have a dataset with a significant number of outliers and are trying to select an appropriate
regression metric to use with your SVM model. Which metric would be the most appropriate in this
scenario?

Ans.When dealing with a dataset that has a significant number of outliers, the most appropriate regression metric to use with your SVM model would be Mean Absolute Error (MAE). MAE is less sensitive to outliers compared to MSE or RMSE, as it measures the average absolute difference between predicted and actual values. This makes it a more robust choice when outliers are present in the dataset.""")

Q3. You have a dataset with a significant number of outliers and are trying to select an appropriate
regression metric to use with your SVM model. Which metric would be the most appropriate in this
scenario?

Ans.When dealing with a dataset that has a significant number of outliers, the most appropriate regression metric to use with your SVM model would be Mean Absolute Error (MAE). MAE is less sensitive to outliers compared to MSE or RMSE, as it measures the average absolute difference between predicted and actual values. This makes it a more robust choice when outliers are present in the dataset.


In [9]:
print("""Q4. You have built an SVM regression model using a polynomial kernel and are trying to select the best
metric to evaluate its performance. You have calculated both MSE and RMSE and found that both values
are very close. Which metric should you choose to use in this case?

Ans.When both Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) values are very close after evaluating your SVM regression model with a polynomial kernel, either metric can be chosen to evaluate its performance. MSE is simpler to interpret as it directly measures the average squared difference between predicted and actual values. RMSE, on the other hand, gives a measure of the average magnitude of the errors in the same units as the target variable, which can be advantageous for understanding the scale of the errors. Therefore, you can choose either MSE or RMSE based on your preference for interpretation.""")

Q4. You have built an SVM regression model using a polynomial kernel and are trying to select the best
metric to evaluate its performance. You have calculated both MSE and RMSE and found that both values
are very close. Which metric should you choose to use in this case?

Ans.When both Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) values are very close after evaluating your SVM regression model with a polynomial kernel, either metric can be chosen to evaluate its performance. MSE is simpler to interpret as it directly measures the average squared difference between predicted and actual values. RMSE, on the other hand, gives a measure of the average magnitude of the errors in the same units as the target variable, which can be advantageous for understanding the scale of the errors. Therefore, you can choose either MSE or RMSE based on your preference for interpretation.


In [10]:
print("""Q5. You are comparing the performance of different SVM regression models using different kernels (linear,
polynomial, and RBF) and are trying to select the best evaluation metric. Which metric would be most
appropriate if your goal is to measure how well the model explains the variance in the target variable?

Ans.If your goal is to measure how well the SVM regression model explains the variance in the target variable, the most appropriate evaluation metric would be \( R^2 \) (R-squared). \( R^2 \) indicates the proportion of the variance in the dependent variable (target) that is predictable from the independent variables (features) in the model. It ranges from 0 to 1, where 1 indicates a perfect fit. Therefore, \( R^2 \) is well-suited for assessing the explanatory power of your SVM regression models with different kernels (linear, polynomial, and RBF).""")

Q5. You are comparing the performance of different SVM regression models using different kernels (linear,
polynomial, and RBF) and are trying to select the best evaluation metric. Which metric would be most
appropriate if your goal is to measure how well the model explains the variance in the target variable?

Ans.If your goal is to measure how well the SVM regression model explains the variance in the target variable, the most appropriate evaluation metric would be \( R^2 \) (R-squared). \( R^2 \) indicates the proportion of the variance in the dependent variable (target) that is predictable from the independent variables (features) in the model. It ranges from 0 to 1, where 1 indicates a perfect fit. Therefore, \( R^2 \) is well-suited for assessing the explanatory power of your SVM regression models with different kernels (linear, polynomial, and RBF).
