Skip to content

microsoft/sqlworkshops-pythonfordatapros

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lab: Python Basics for Data Professionals

A Microsoft Course from the SQL Server team

About this lab
Business Applications of this lab
Technologies used in this lab
Before Taking this lab
lab Details
Related labs
Lab Modules
Next Steps

NOTE: This course is in active re-development. The course files are complete, and located here, but this page is currently being worked on.

Welcome to this Microsoft solutions lab on the architecture on Python Basics for the Data Professional. In this lab, you'll learn basic Python structures, programming and data flow. You'll get resources to go much further in your learning journey, but this short lab will get you up and running quickly.

The focus of this lab is to familiarize the database professional in the basics of Python, while implementing it in SQL Server Stored Procedures using SQL Server's Machine Learning Services. After this basic introduction, the professional can move on to more in-depth training in Python if desired.

You'll start by setting up your system to work with Python, then move to understanding the course itself. From there, you will move though programming basics, working with data, and then on to understanding the concepts of Python environments and how to deploy Python code.

This github README.MD file explains how the workshop is laid out, what you will learn, and the technologies you will use in this solution. To download this Lab to your local computer, click the Clone or Download button you see at the top right side of this page. More about that process is here.

You can view all of the courses and other labs our team has created at this link - open in a new tab to find out more.

Learning Objectives

In this lab you'll learn:

  • How to set up a Python environment for SQL Server using Machine Learning Services
  • The Basics of programming in Python including code syntax, getting help, variables, operators, and functions
  • Working with data structures, and understanding popular data libraries
  • Data Ingestion and access
  • Machine Learning in Python
  • Environments and code deployment

The goal of this lab is to familiarize the data professional with Python environments and programming.

The concepts and skills taught in this lab form the starting points for:

  • Data professionals that wish to include Python code in their data access and programming
  • Security professionals who wish to understand how to securely implement secure Python coding practices
  • Anyone interested in learning more about programming with Python and databases

Businesses require the ability to securely access their data for many workloads, including various programming languages. Python (along with the R language) has merged as a powerful tool for data ingestion, processing and analysis. Previously, Python programmers accessed various databases and retrieved data over a network connection like any application, but this often means pulling large amounts of data over a potentially insecure network to bring multiple copies to each developer to work with locally. The SQL Server Machine Learning Services feature allows Python code to run inside a Stored Procedure in SQL Server, which then accesses data directly. This also allows the Python developer to create code locally, and then send that code on to the Database Administrator for installation on the server - the developer never has to touch the production server or data.

This couse explains how to work with Python, and then how to operationalize the code on a SQL Server.

The solution includes the following technologies - although you are not limited to these, they form the basis of the lab. At the end of the lab you will learn how to extrapolate these components into other solutions. You will cover these at an overview level, with references to much deeper training provided.

Technology Description
Python*An Open-Source, multiple paradigm coding language with extensible packages
Microsoft SQL Server*A complete data platform, including a Relational Database Management System (RDBMS), Data Pipeline, Business Intelligence, Graph Database Processing, and other constructs to work securely with multiple forms of data, including structured, semi-structured and unstructured.

You'll need a local system that you are able to install software on. The lab demonstrations use Microsoft Windows as an operating system and all examples use Windows for the lab. Optionally, you can use a Microsoft Azure Virtual Machine (VM) to install the software on and work with the solution.

This lab expects that you understand data structures and working with SQL Server and computer networks. This lab does not expect you to have any prior data science knowledge, but a basic knowledge of programming and statistics is helpful.

If you are new to these, here are a few references you can complete prior to class:

Setup

A full prerequisites document is located here. These instructions should be completed before the lab starts, since you will not have time to cover these in class. Remember to turn off any Virtual Machines from the Azure Portal when not taking the class so that you do incur charges (shutting down the machine in the VM itself is not sufficient).

This lab uses the Microsoft Windows operating system, although Linux is also supported once you have completed the exercises.

Primary Audience:Data Professionals tasked with implementing Big Data, Machine Learning and AI solutions
Secondary Audience: Security Architects and Developers
Level: 300
Type:In-Person
Length: 8-9 hours

This is a modular lab, and in each section, you'll learn concepts, technologies and processes to help you complete the solution.

ModuleTopics
01 - Overview and Course Setup In this Module you will cover and overview of the Python language and set up your system for the course.
02 - Programming Basics This Module covers the commands and procedures for getting help in Python, code syntax and structure, variables, and operators and functions.
03 - Working with Data In this Module you will learn more about data types, ingestion, inpsection, and graphing, with a brief introduction to Data Science with Python.
04 - Environments and Deployment In this Module you will learn more about Python environments such as Conda, and how to deploy your code using the "pickle" library.

Next Steps

Next, Continue to 00 - Prerequisites

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Legal Notices

License

Microsoft and any contributors grant you a license to the Microsoft documentation and other content in this repository under the Creative Commons Attribution 4.0 International Public License, see the LICENSE file, and grant you a license to any code in the repository under the MIT License, see the LICENSE-CODE file.

Microsoft, Windows, Microsoft Azure and/or other Microsoft products and services referenced in the documentation may be either trademarks or registered trademarks of Microsoft in the United States and/or other countries. The licenses for this project do not grant you rights to use any Microsoft names, logos, or trademarks. Microsoft's general trademark guidelines can be found at http://go.microsoft.com/fwlink/?LinkID=254653.

Privacy information can be found at https://privacy.microsoft.com/en-us/

Microsoft and any contributors reserve all other rights, whether under their respective copyrights, patents, or trademarks, whether by implication, estoppel or otherwise.

About

Python Basics for the Data Professional

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published