Data Science Utilities from Microsoft
This repository contains a set of Data Science Utilities developed for use in the context of the Team Data Science Process (TDSP).
This is an early preview release of the Data Science Utilities for TDSP. We are continuously improving our data science utilities based on our further accumulated experience and customer requests. Stay tuned for future releases.
Currently, the Data Science Utilities released in this repository include:
- Interactive Data Exploration, Analysis, and Reporting (IDEAR) in R.
- Interactive Data Exploration, Analysis, and Reporting (IDEAR) in MRS.
- Interactive Data Exploration, Analysis, and Reporting (IDEAR) in Python.
- Automated Modeling and Reporting in R (AMAR in R)
You can easily run these utilities on sample data in the Data/Common directory. If you are using Azure Data Science Virtual Machine, all three utilities are instantly ready to run.
We would love to hear back from you. Should you have any questions or suggestions, or you want to report a bug, please create issues at TDSP/Issues
Help to Enrich the Set of Utilities
We believe that with the help of the data science community, the set of data science utilities can be significantly enriched and can become more powerful, and can benefit more enterprises and more data scientists. We warmly welcome you to contribute to the data science utilities for TDSP.
What Is TDSP
To know more details about TDSP, check out the TDSP documents here.
In documentations, Screenshots of RStudio are from the Open Source Edition.