## **Big Idea 5.4 Crowdsourcing**

![image.png](attachment:image.png)

### **What is Crowdsourcing?**
- Crowdsourcing is a method of collecting help, ideas, or input from a large and varied group of people, typically through online platforms. Rather than depending on just a few individuals or experts, crowdsourcing uses the shared knowledge, experiences, and skills of many people to solve problems, collect data, or generate content. This approach can help limit Computer Bias by including more diverse perspectives.
- The broader the crowdsourcing effort, the more you go beyond your immediate circle, the better chance you have to reduce Computer Bias. Crowdsourcing allows people to share knowledge, access shared resources, and work together across different locations through distributed computing.

![image.png](attachment:image.png)

### **Types of Crowdsourcing**

Crowdsourcing is a method of obtaining ideas, content, or services by soliciting contributions from a large group of people, typically from an online community. There are four main types of crowdsourcing:

#### 1. Crowdfunding
- Purpose: Raising money by collecting small contributions from a large group of people.  
- Examples: Kickstarter, GoFundMe, Indiegogo.  
- Use Case: A startup company seeking funds to launch a new product.

#### 2. Crowd Creation
- Purpose: Gathering creative input from a crowd, often for content generation or design.  
- Examples: Threadless (design competitions), Wikipedia (content creation).  
- Use Case: A company asking graphic designers to submit logo ideas.

#### 3. Crowd Voting
- Purpose: Collecting public opinion or feedback to make decisions or rank options.  
- Examples: Reddit upvotes, talent show voting systems.  
- Use Case: A company asking users to vote on their next product color.

#### 4. Crowd Wisdom
- Purpose: Using collective intelligence for decision-making or problem-solving.  
- Examples: Prediction markets, Stack Overflow, Quora.  
- Use Case: Gathering expert opinions to forecast stock market trends.

---

#### Key Benefits of Crowdsourcing
- Access to diverse ideas and skills.  
- Cost-effective for organizations.  
- Encourages community engagement.

#### Challenges of Crowdsourcing
- Ensuring quality control.  
- Managing intellectual property rights.  
- Avoiding exploitation of contributors.

![image.png](attachment:image.png)

### **Introduction to Data Crowdsourcing**

Data crowdsourcing is a method of collecting data from a large group of people, typically through online platforms. It involves the collaboration of many individuals to gather, verify, or analyze information.

#### 1. What is Data Crowdsourcing?
- Definition: Collecting data, ideas, or solutions from a large group of people, often via the internet.
- Purpose: To gather diverse, large-scale, or hard-to-collect data from volunteers or participants.

#### 2. Examples of Data Crowdsourcing
- Wikipedia: Collaborative content creation and editing.
- Google Maps: Users contribute real-time traffic data and location details.
- Amazon Mechanical Turk: Individuals complete microtasks to gather or analyze data.

---

#### Key Benefits of Data Crowdsourcing
- Quick data collection: Access to vast amounts of data in a short time.
- Diverse input: Contributions from a broad and varied group.
- Cost-effective: Collects data without the need for a large paid workforce.
- Scalability: Easily adapts to large-scale data gathering.

#### Challenges of Data Crowdsourcing
- Quality control: Ensuring data accuracy and reliability from diverse contributors.
- Privacy concerns: Protecting personal information when crowdsourcing data.
- Participant motivation: Encouraging continued contributions without direct compensation.

---

#### Ethical Considerations
- Compensation: Should participants be paid for their time or contributions?
- Data Privacy: How should personal data be handled and protected?
- Exploitation: Are crowdsourcing platforms fairly compensating volunteers for their work?

---

#### Activity
- Crowdsourcing Simulation:
  - Students contribute ideas or vote on a shared document or poll.
  - Discuss results as a class and reflect on the collective data gathered.

---


![image.png](attachment:image.png)

### **Open Source Development**

- Open source development is a way of building software where the code is shared openly with the public. This means anyone can view, use, improve, and share the code. People from around the world can work together to make the software better. A popular place for open source projects is GitHub, where developers can add code, report problems, and discuss ideas.

Examples of Successful Open Source Projects:
- Linux Operating System – Built and updated by a global group of contributors.
- Apache HTTP Server – A key tool used to run websites on the internet.
- WordPress – A popular platform for creating websites and blogs.

Benefits of Open Source Development:
- Encourages teamwork and shared ideas.
- Allows faster improvements because many people contribute.
- Promotes transparency since anyone can see and change the code.

Challenges:
- It can be hard to manage lots of different contributors.
- Project leaders can get overwhelmed (maintainer burnout).
- It's important to follow legal rules about using and sharing code (licenses).

- Even with challenges, open source development is a great way to build software that is creative, reliable, and open for everyone to use—a perfect match for crowdsourcing projects.

![image.png](attachment:image.png)

#### **Popcorn Hack 1** 
Open Source Development Popcorn Hack:
- 1. Go to github and find an open source project
- 2. Understand the code and know how it works
- 3. Find a bug or small issue with the code and submit your changes
- 4. Take a screenshot or write the new code here and explain what you changed to improve the code or fix a bug
- 5. Comment on an issue or note that you fixed one of the bugs

### **Evidence of Public Datasets**

Public datasets have become invaluable resources for research, innovation, and development across various industries. Below are key examples that highlight the impact of publicly available data:

#### 1. Kaggle Datasets
- Evidence: Hosts thousands of publicly available datasets for data science and machine learning projects.  
- Impact: Supports data scientists by providing structured data for competitions, research, and model development.  
- Notable Fact: Datasets like the Titanic passenger list and COVID-19 statistics have led to impactful insights.

#### 2. Google Open Images
- Evidence: A collection of over 9 million annotated images for machine learning projects.  
- Impact: Provides high-quality image data to improve computer vision models.  
- Notable Fact: Widely used in academic research and AI development.

#### 3. UCI Machine Learning Repository
- Evidence: Offers hundreds of datasets for ML research, dating back to the 1980s.  
- Impact: Became a foundational resource for learning algorithms and testing models.  
- Notable Fact: The Iris dataset, one of the most famous in data science, originates here.

#### 4. NASA Earth Observations (NEO)
- Evidence: Provides global climate, atmospheric, and environmental data.  
- Impact: Used by scientists and researchers for climate change analysis and environmental monitoring.  
- Notable Fact: NEO data has contributed to critical studies on global warming and natural disasters.

#### 5. World Bank Open Data
- Evidence: Offers economic, social, and demographic data from countries worldwide.  
- Impact: Supports policy development, academic research, and global economic analysis.  
- Notable Fact: Data from this platform has helped track global poverty trends.

---

#### Key Insights from Public Datasets
- Accessibility: Public datasets provide free and open resources for learning, research, and development.  
- Innovation Driver: These datasets have fueled advancements in AI, healthcare, economics, and more.  
- Global Impact: Public data empowers individuals and organizations to address real-world challenges.

Public datasets continue to democratize data-driven decision-making, fostering progress across various fields.

![image.png](attachment:image.png)

distributed computing - aarush

![image.png](attachment:image.png)

### **Innovations Through Crowdsourcing**

- Crowdsourcing helps create new and exciting products by bringing together ideas and resources from many people. One great example is Spotify, which uses crowdsourcing to improve music recommendations. When users make collaborative playlists, like for a party, Spotify learns about different music tastes. By looking at what people listen to and share, Spotify suggests new songs that match each person’s preferences. Users also help by sharing details about songs, and this information is checked and approved by the community.

- Another example is Kickstarter, a platform that helps people fund creative ideas. Instead of depending on a few big investors, creators can share their projects with the public and get small donations from many supporters. This gives inventors the money they need to turn their ideas into real products.

- In short, crowdsourcing helps drive innovation by letting people work together, share knowledge, and support projects. It allows companies and creators to develop things like personalized music suggestions and new inventions that better serve the needs of their audience.

![image.png](attachment:image.png)

### **Evidence of Crowdsourcing**

Crowdsourcing has played a significant role in shaping industries, solving problems, and creating impactful projects. Below are key examples that highlight the success and influence of crowdsourcing:

#### 1. Wikipedia
- Evidence: Over 6 million English articles created by volunteers worldwide.  
- Impact: Provides free, accessible knowledge across various subjects.  
- Notable Fact: Wikipedia thrives on crowd contributions for editing, verifying, and improving content.

#### 2. Duolingo
- Evidence: Early language content was sourced from volunteers and users.  
- Impact: Helped the platform expand rapidly with accurate translations and course material.  
- Notable Fact: Community contributions played a key role in growing over 40 language courses.

#### 3. Waze
- Evidence: Real-time traffic updates rely on driver reports and community insights.  
- Impact: Enables more accurate navigation by integrating user-contributed road conditions.  
- Notable Fact: Millions of drivers actively contribute data daily.

#### 4. LEGO Ideas
- Evidence: Fans submit designs, and winning ideas are turned into official LEGO sets.  
- Impact: Engages LEGO enthusiasts while inspiring product innovation.  
- Notable Fact: The "Women of NASA" set, created through LEGO Ideas, became a top-selling kit.

#### 5. SETI@home
- Evidence: Utilized idle computer power from volunteers to analyze radio signals for extraterrestrial life.  
- Impact: Enabled researchers to process vast amounts of data without investing in expensive infrastructure.  
- Notable Fact: Millions of users joined, turning personal computers into part of a global science project.

---

#### Key Insights from Crowdsourcing Evidence
- Diverse Contributions: Crowdsourcing taps into global talent and knowledge.  
- Efficiency Gains: Projects like SETI@home demonstrate how distributed efforts save resources.  
- Community Engagement: Platforms like LEGO Ideas foster strong user involvement and creativity.

Crowdsourcing continues to revolutionize industries, proving that collective effort can achieve remarkable outcomes.

![image.png](attachment:image.png)

#### **Popcorn Hack 2**
- Picture an example that includes crowdsourcing like Duolingo/Wikipedia, and describe how that example uses crowdsourcing.

obtaining data via crowdsourcing - aarush

![image.png](attachment:image.png)

Popcorn Hack 3 - aarush

#### **Homework Question 1**
- Explain how crowdsourcing contributes to innovation. Provide 3 examples of successful products or services that have been created through crowdsourcing, and describe how the  efforts of many people helped make it successful.

#### **Homework Question 2**
#### The Collaborative Study Guide Builder
Leverage crowdsourcing methods and public datasets to create a powerful, student-driven study guide.

##### Steps:

##### 1. Form a Study Group (Crowdsourcing Element)
- Divide your classmates into small teams and assign each group a specific topic or chapter.  
- Each team will be responsible for gathering key points, examples, and practice questions.  

##### 2. Use Public Datasets for Evidence and Insights
- Find relevant datasets to add real-world examples to your study guide:
  - History Class? Use **World Bank Open Data** for economic trends.  
  - Science Class? Use **NASA NEO** data for environmental patterns.  
  - Math Class? Try **Kaggle** datasets for number-crunching examples.

Homework Question 3 - aarush