GitHub

DRUG_CLASSIFICATION

GOAL

This project aims to classify drugs based on patient characteristics such as age, sex, blood pressure (BP), cholesterol level, and sodium-to-potassium ratio (Na_to_K). We employ various machine learning algorithms and techniques to achieve accurate drug classification.

DATASET

The dataset used for this project is available in CSV format with 200 rows and 6 columns The dataset used in this project contains the following columns:

Age: Age of the patient.
Sex: Gender of the patient.
BP: Blood pressure level.
Cholesterol: Cholesterol level.
Na_to_K: Sodium-to-potassium ratio.
Drug: The target variable representing the prescribed drug.

INTEL DEVELOPERS CLOUD SERVICE ENV

Here , we used IDC service ; That is Intel® Max Series GPU (PVC) on 4th Gen Intel® Xeon® processors - 1100 series (4x) (Batch Processing/Scheduled access) to make our project to run in pytorch_xpu Environment.

Intel oneAPI Data Analytics Library (oneDAL)

Intel® oneAPI Data Analytics Library (oneDAL) is a library that helps speed up big data analysis by providing highly optimized algorithmic building blocks for all stages of data analytics (preprocessing, transformation, analysis, modeling, validation, and decision making) in batch, online, and distributed processing modes of computation.

ML ALGORITHMS

We have implemented and evaluated the following machine learning algorithms for drug classification:

(1)KNN (2)Random Forest (3)Logistic Regression (4)SVM (5)Naives Bayes (6)Adaboost (7)Voting Classifier (8)Multinomial Logistic Regression

LIBRARIES USED

Pandas: for data analysis Numpy: for data analysis Matplotlib: for data visualization Seaborn: for data visualization Scikit-learn: for data analysis Intel oneDAL: for enhanced performance and efficiency

DISTRIBUTIONS

GRID SEARCH

"Grid search" is a hyperparameter optimization technique used in machine learning to systematically search for the best combination of hyperparameter values for a given model have been used. Hyperparameters are parameters that are not learned from the data but are set prior to training a machine learning model.

ACCURACIES

RESULTS

We have evaluated each algorithm's performance and included the results in the Jupyter Notebook files , which we run through IDC Environment and also by the Grid Search we got our best tuning parameters.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
#ONEAPI.png		#ONEAPI.png
BESTMODEL.ipynb		BESTMODEL.ipynb
IDC.jpg		IDC.jpg
LICENSE		LICENSE
NOTONLYONEDAL.png		NOTONLYONEDAL.png
ONEAPI2.png		ONEAPI2.png
ONEAPI3.jpg		ONEAPI3.jpg
ONEDAL1.png		ONEDAL1.png
README.md		README.md
SCAP.jpg		SCAP.jpg
SKLEARNEX.png		SKLEARNEX.png
oneapi.png		oneapi.png
onedal.png		onedal.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRUG_CLASSIFICATION

GOAL

DATASET

INTEL DEVELOPERS CLOUD SERVICE ENV

Intel oneAPI Data Analytics Library (oneDAL)

ML ALGORITHMS

LIBRARIES USED

DISTRIBUTIONS

GRID SEARCH

ACCURACIES

RESULTS

About

Releases

Packages

Languages

License

harishraaghavdv/DRUG_CLASSIFICATION

Folders and files

Latest commit

History

Repository files navigation

DRUG_CLASSIFICATION

GOAL

DATASET

INTEL DEVELOPERS CLOUD SERVICE ENV

Intel oneAPI Data Analytics Library (oneDAL)

ML ALGORITHMS

LIBRARIES USED

DISTRIBUTIONS

GRID SEARCH

ACCURACIES

RESULTS

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages