Skip to content

Basic data science project on proving true mean/median and calculating an accurate dividend growth rate.

Notifications You must be signed in to change notification settings

TeaZea/SAS-Analysis_DividendGrowthRate

Repository files navigation

SAS Analysis - Dividend Growth Rate

TLDR;

This is a basic data science project on proving true mean/median and calculating an accurate dividend growth rate. This is my first data science project; albeit with a relativly easy concept. I noticed that there were some individual stocks that were giving the impression that the porfolio was performing better than average. The analysis was conducted to determine if that was the case or not.

This readme will speak to the experiences of the project as well as give a little more insight at a micro-level in regards to the code. The PDF speaks more so to the actual analysis.


Setup

Simply download both the .sas and the dataset, then upload the file into SAS Studio.


Dataset

The dataset is fairly simple and was inputted manually. There is currently no pipeline in place to collect this data. I feel like manual insertion of datapoints is good enough as the portfolio is small and required minimal work. More about the dataset and key variables can be found in the analysis pdf.

dataset used for analysis


Overview of the code

I decided to clean up the data a bit and add a growth average variable (column) by creating a DO LOOP to iterate through the data (rows). This will be the variable that I conduct the analysis with.

Code for creating a new variable

Analysis is conducted by observing the normal attributes of the data. The MU and SIGMA are also kept with their default values. This is important because I want to confirm a normal distribution of the data. The reference line for the box plot is set to the median (2.08) as data is too volatile.

PROC MEANS, UNIVARIATE and SGPLOT

This is just a simple loop that removes the specified outliers from the dataset before the analysis is replicated without them.

Simple observation cleanup

You can find the full analysis here.


Challenges

The most challenging aspect of this analysis was the general understanding of statistical concepts. I didn't study it at an advanced level academically, so grasping things like IQR and STD came a bit slower than I would have like it to. I did take a statistical concept introduction course provided by SAS which helped me out greatly (some of my certifications can be found on my github home page).

Another challenge was understanding histograms as I don’t have a lot of experience with them.

About

Basic data science project on proving true mean/median and calculating an accurate dividend growth rate.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages