Linear regression Module Review #907

dsbuddy · 2024-03-21T15:16:46Z

No description provided.

rosemm

I put a bunch of comments and suggestions throughout, let me know if anything is unclear or if you'd like further explanation on anything!

rosemm · 2024-04-02T13:17:36Z

python_linear_regression/python_linear_regression.md

+***
+<div class = "answer">
+
+This question is more difficult than the previous one because it requires the test-taker to have a deeper understanding of the characteristics of linear regression. The test-taker must be able to identify which of the answer choices is not a characteristic of linear regression, even though all of the other answer choices are valid characteristics.


One general tip: The quiz questions and answers should be designed for learners to work through independently (not for an instructor to administer, for example). So the follow-up text for a quiz question should do things like provide more context about the correct answer, explain why the other options are incorrect, etc., all with the learner in mind as the audience. I seems like you've written your quiz follow-ups more as notes for an instructor explaining the rationale behind the question.

python_linear_regression/python_linear_regression.md

rosemm · 2024-04-02T13:22:18Z

python_linear_regression/python_linear_regression.md

+## What is linear regression?
+- Linear regression is a supervised machine learning algorithm that learns to predict a continuous target variable based on one or more predictor variables. Linear regression models the relationship between the target variable and the predictor variables using a linear equation.
+- In the case of linear regression, the target variable is a continuous variable. In a supervised learning problem, the machine learning algorithm is given a set of training data and asked to learn a function that can map the input variables to the output variable. The training data consists of pairs of input and output variables. The algorithm learns the function by finding the best fit line to the data. Once the algorithm has learned the function, it can be used to make predictions on new data. To make a prediction, the algorithm simply plugs the values of the input variables into the function.
+- Linear regression is a popular supervised learning algorithm because it is simple to implement and understand. It is also a versatile algorithm that can be used to solve a variety of problems.


I recommend avoiding telling learners a topic is simple, or easy to understand -- it risks making them feel inadequate if they don't feel like it's clicking right away.

Suggested change

- Linear regression is a popular supervised learning algorithm because it is simple to implement and understand. It is also a versatile algorithm that can be used to solve a variety of problems.

- Linear regression is a popular supervised learning algorithm because it is computationally simple (even if it's not always simple to interpret!). It is also a versatile algorithm that can be used to solve a variety of problems.

I saw you resolved this without changing anything; was that just a mistake, or do you feel strongly about keeping this language?
By the by, here's a handy resource on this topic (we should probably reference that in our authoring guidelines!)

rosemm · 2024-04-02T13:24:15Z

python_linear_regression/python_linear_regression.md

+-   **Predicting customer churn:**  Linear regression can be used to predict whether a customer is likely to churn based on their past purchase history, demographics, and other factors.
+-   **Predicting the risk of a customer defaulting on a loan:**  Linear regression can be used to predict the risk of a customer defaulting on a loan based on their credit score, income, and other factors.
+-   **Predicting the likelihood of a patient having a particular disease:**  Linear regression can be used to predict the likelihood of a patient having a particular disease based on their medical history, symptoms, and other factors.


These three are all examples of logistic regression, probably, rather than linear regression per se

rosemm · 2024-04-02T16:49:19Z

python_linear_regression/python_linear_regression.md

+***
+
+
+### Applications of linear regression in machine learning


I really like the impulse here to show concrete examples and highlight real-world use cases, but I'm concerned that the specific examples here won't necessarily be relevant to our learners.
I think you could make this section much stronger by replacing this list (and the similar one in the next section) with a much shorter but more targeted list of linear regression applications in biomedical research. It would be ideal to find actual published studies using linear regression and link to those.
I know this is a big ask -- I'm happy to help try to find appropriate examples for this!

rosemm · 2024-04-02T16:55:04Z

python_linear_regression/python_linear_regression.md

+
+### Python Implementation of Linear Regression
+
+To implement linear regression in Python using Scikit-learn, we can follow these steps:


I think this is an excellent example, and it would be great to break it up a little more, especially steps 3-5. I think each of the step here could probably be its own subsection, with a header, and the explanation you're currently providing via comments could be moved out into regular text accompanying each code chunk.

rosemm · 2024-04-02T16:59:36Z

python_linear_regression/python_linear_regression.md

+data.info()
+```
+
+3.  Split the data into training and testing sets:


I love the inclusion of this, and I'm thinking this could be something learners may be encountering for the first time here -- cross validation / machine learning techniques are not currently part of the pre-reqs for this module. There are few different possible ways to approach this, but I think one thing that may work well is to have a new section before this example that explains at a high level some of the implementation stuff you then use in this example. I'd recommend short explanations (especially why we do this) of splitting data into training and test, recoding categorical predictors, scaling continuous predictors, and evaluating model predictions (i.e. what is MSE, conceptually?)

rosemm · 2024-04-02T17:05:07Z

python_linear_regression/python_linear_regression.md

+print(diabetes)
+print(diabetes.DESCR)
+
+# Now we will split the data into the independent and independent variable


Suggested change

# Now we will split the data into the independent and independent variable

# Now we will split the data into the independent and dependent variable

rosemm · 2024-04-02T17:06:48Z

python_linear_regression/python_linear_regression.md

+
+
+
+### Real World Code Example


I think this is great, and I think you could just do this example or the above, we probably don't need both.

rosemm · 2024-04-02T17:07:57Z

python_linear_regression/python_linear_regression.md

+
+## Conclusion
+
+At the end of the lesson, students should have a good understanding of the concept of linear regression and how to implement the linear regression algorithm in Python. They should also be able to apply linear regression to real-world datasets to make predictions and insights.


Like all the module text, this should be written with learners as the audience, not instructors.

rosemm · 2024-04-24T16:56:54Z

python_linear_regression/python_linear_regression.md

+-   Understand the concept of linear regression and its applications in machine learning
+-   Learn how to implement the linear regression algorithm in Python


These are both pretty big topics, actually, and I'm wondering now if it might be worthwhile splitting this module into two separate modules: "Intro to Linear Regression for Machine Learning", and "Linear Regression in Python".
That will allow you to focus more attention on actually teaching the python code, which would be great. The prereqs don't list specific experience with scikit learn or anything like that, so we want to write this with a learner in mind who has some python experience but has maybe never done machine learning before. Linear regression is a really natural place to start with modeling, so I love the idea of this module being something someone could work through as their first attempt at machine learning in python.
I'll start a new branch for the intro to linear regression module, and we can keep this one for the python piece.

rosemm · 2024-04-24T18:29:15Z

Hi @dsbuddy ! I took another look and added in some more comments and suggestions, the biggest of which is that we split this into two separate modules (see my comment above). That will give you a lot more space to teach the python piece of it more thoroughly, which I think will be really valuable. I'll start a new branch now for the "intro to regression for ML" module (edit: I did! https://github.com/arcus/education_modules/tree/intro_regression_ml ), and you can update the draft on this linear_regression branch here to just focus on teaching the method in python.
My advice is to read through your "Python Implementation of Linear Regression" section thinking about how a learner would work through that content (Which terms might be new to them? What questions might they have as they go through the code? Which pieces might feel confusing to someone who's never used scikit learn before?). You may also find it helpful to go back to what you have listed as the prereqs for this module and imagine a learner who is brand new to this topic but does (just barely) meet the prereqs as defined. Keep this imaginary learner in mind as you read through -- if the module would go over that person's head, then we need to adjust the module, the prereqs, or both.
As you realize which bits need more explanation and/or links to relevant resources (some learn-more, options, or help boxes might be appropriate!), start filling those in. I think you'll end up wanting to break that section into multiple subsections as you add more explanation, so feel free to put in subheaders as appropriate, too.

rosemm · 2024-04-24T21:15:41Z

FYI: Here is the PR with the new "Intro to Linear Regression for Machine Learning" module: #923

rosemm · 2024-05-03T16:55:58Z

Hi @dsbuddy let me know if you have any questions!

dsbuddy added 6 commits November 1, 2023 13:14

initial draft linear regression module

57d90da

Added data for code exercise

72a1a87

Added python code exercise to module

2d5f77a

Added real world example for python linear regression

5954c8c

Added real-world example to linear regression

0fa3e71

Changed real world data to continuous diabetes example

0130958

dsbuddy requested a review from rosemm March 21, 2024 15:16

rosemm reviewed Apr 2, 2024

View reviewed changes

dsbuddy added 2 commits April 18, 2024 11:43

Updated module given Rose's comments in PR

a8bc950

Updated answers to quiz questions to be more contextualized

c3d0c93

rosemm reviewed Apr 24, 2024

View reviewed changes

rosemm added the Quality Assurance label Apr 24, 2024

dsbuddy added 2 commits May 27, 2024 15:34

Updated python implementation of linear regression

74e2707

Updated changes based off Elizabeth's comments

2a1dc6a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear regression Module Review #907

Linear regression Module Review #907

dsbuddy commented Mar 21, 2024

rosemm left a comment

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 24, 2024

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 2, 2024

rosemm Apr 24, 2024

rosemm commented Apr 24, 2024 •

edited

Loading

rosemm commented Apr 24, 2024

rosemm commented May 3, 2024

	- Linear regression is a popular supervised learning algorithm because it is simple to implement and understand. It is also a versatile algorithm that can be used to solve a variety of problems.
	- Linear regression is a popular supervised learning algorithm because it is computationally simple (even if it's not always simple to interpret!). It is also a versatile algorithm that can be used to solve a variety of problems.

		***


		### Applications of linear regression in machine learning


		### Python Implementation of Linear Regression

		To implement linear regression in Python using Scikit-learn, we can follow these steps:

	# Now we will split the data into the independent and independent variable
	# Now we will split the data into the independent and dependent variable


		## Conclusion

		At the end of the lesson, students should have a good understanding of the concept of linear regression and how to implement the linear regression algorithm in Python. They should also be able to apply linear regression to real-world datasets to make predictions and insights.

		- Understand the concept of linear regression and its applications in machine learning
		- Learn how to implement the linear regression algorithm in Python

Linear regression Module Review #907

Are you sure you want to change the base?

Linear regression Module Review #907

Conversation

dsbuddy commented Mar 21, 2024

rosemm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rosemm commented Apr 24, 2024 • edited Loading

rosemm commented Apr 24, 2024

rosemm commented May 3, 2024

rosemm commented Apr 24, 2024 •

edited

Loading