Data-Science-with-Ml

Data Science and Machine Learning

PLATFORM USED : DATABRICKS

LIBRARIES USED :

1)Pandas

•Pandas was used to read and organize the data since data representation by pandas is suitable for data analysis.

•Pandas makes it easier to represent the data and helps in performing operations on individual columns when required to filter the data.

•To manage memory pandas helps ease the process to convert a column from one data type to another and also helps in filling up empty parts of the data .

•Coding in pandas is similar to python and helps in learning and implementing things quickly.

2)Spark

•Spark supports various languages like python , scala, java , R and sql.

•Spark supports lots of different

•Spark is supported on lots of platforms and operations performed on Apache Spark are very fast compared to mapreduce.

•The Ml lib library of Spark helps in using various machine learning models like Linear Regression, Logistic Regression, Decision trees and K means Clustering.

3)Mat plotlib

•The matplot lib library was used to represent data in pictorial form.

•Data can be represented in histograms , bar graphs or pi charts etc using this library.

•It is easier to use with the pandas library.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
Kaggle		Kaggle
Regression		Regression
Country gdp.ipynb		Country gdp.ipynb
Customers Prediction.ipynb		Customers Prediction.ipynb
DecisionTrees.ipynb		DecisionTrees.ipynb
LICENSE		LICENSE
Machine project.ipynb		Machine project.ipynb
README.md		README.md
comapny_growth.ipynb		comapny_growth.ipynb
first.ipynb		first.ipynb
realGDPanalysis.ipynb		realGDPanalysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kaggle

Kaggle

Regression

Regression

Country gdp.ipynb

Country gdp.ipynb

Customers Prediction.ipynb

Customers Prediction.ipynb

DecisionTrees.ipynb

DecisionTrees.ipynb

LICENSE

LICENSE

Machine project.ipynb

Machine project.ipynb

README.md

README.md

comapny_growth.ipynb

comapny_growth.ipynb

first.ipynb

first.ipynb

realGDPanalysis.ipynb

realGDPanalysis.ipynb

Repository files navigation

Data-Science-with-Ml

Data Science and Machine Learning

PLATFORM USED : DATABRICKS

LIBRARIES USED :

1)Pandas

2)Spark

3)Mat plotlib

About

Releases

Packages

Languages

License

DiptoChakrabarty/Data-Science-with-Ml

Folders and files

Latest commit

History

Repository files navigation

Data-Science-with-Ml

Data Science and Machine Learning

PLATFORM USED : DATABRICKS

LIBRARIES USED :

1)Pandas

2)Spark

3)Mat plotlib

About

Resources

License

Stars

Watchers

Forks

Languages