Skip to content

Michigan State Bootcamp Module 1: Excel Challenge - Analyze Excel data from 4,000 past projects of from Kickstarter in order to find market trends.

Notifications You must be signed in to change notification settings

molleighH/Module-1-Challenge-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation

MSU Data Analysis Bootcamp

Module 1 Challenge: Microsoft Excel

Excel Challange Analytic Report

  1. Given the provided data, what are three conclusions that we can draw about crowdfunding campaigns?

    1. Based on the dataset, I can conclude that the three most successful crowdfunding campaigns come from theater, music, and film & video categories; however, theater, music, and film & video are also the most failed and most canceled categories. Therefore, success is not guaranteed for new crowdfunding campaigns based solely on selecting a campaign within one of these three (parent) categories.
    2. Based on the dataset, I can conclude that for each of the seven countries listed, the sub-category of ‘plays’ typically has the most successes and the most failures within the sub-categories, regardless of which country is being examined. This commonality is especially interesting when considering the varying sizes of these nations, varying languages, varying educational standards, varying political experiences, etc.
    3. Based on the dataset, I can conclude that the most successful time of year for crowdfunding campaigns is in July. In addition, the relationship between the rate of failed campaigns and the rate of canceled campaigns seem to be the most correlated relationship. This suggests that similar factors may be affecting the likelihood of a campaign’s failure or cancellation.
  2. What are some limitations of this dataset?

    1. There are various limitations that can affect the quality and the capabilities of a dataset; for example, this dataset may suffer from sampling bias, which suggests that the data may not be collected randomly or that the data is built with biased samples that may not truly represent the population. These complications may hinder the accuracy or the effectiveness of the dataset.
    2. In addition, any outliers may skew the results, therefore, sabotaging the analysis of the dataset. As a result, any insights or deductions pulled from the dataset may be misleading or inaccurate.
    3. Finally, this dataset may be prone to selection bias; for example, based on the names and descriptions in ‘blurbs’, there is a possibility that these values were carefully chosen based on the specific information they provide.
    4. Therefore, the dataset may be biased, thus narrowing and distorting any conclusions the dataset may generate.In conclusion, there are numerous limitations that may exist in a dataset; sampling bias, outliers, and/or selection bias may be some of the more likely limitations of this dataset, thus skewing any analysis.
  3. What are some other possible tables and/or graphs that we could create, and what additional value would they provide?

    1. Another valuable way to analyze this dataset is to organize the highest ‘percent funded’ by country, and then use the ‘blurbs’ to understand which organizations or groups typically exceed their funding ‘goals’. This table/graph would make it easier to deduce which groups are most likely to exceed their funding goals, in each country. Thus, this analysis would suggest which type of groups/organizations, in each country, are most likely to succeed in their crowdfunding campaign. Who has the best chance?
  4. Use your data to determine whether the mean or the median better summarizes the data.

    1. In this case, the data is more accurately represented by the mean, because there is such a range between the data points. The median for both the successful and failed campaigns falls far below the actual average; therefore, the mean is more instructive with this data.
  5. Use your data to determine if there is more variability with successful or unsuccessful campaigns. Does this make sense? Why or why not?

    1. There is more variability with the successful campaigns than the unsuccessful; perhaps this does make sense, because it seems likely that a more successful campaign would have more backers. The juxtaposition of the variance and standard deviation of both campaigns illustrates this variability.

About

Michigan State Bootcamp Module 1: Excel Challenge - Analyze Excel data from 4,000 past projects of from Kickstarter in order to find market trends.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published