# Research Proposal: Neighbors and Mental - Wellbeing

## 1. Research Question
Is there a relationship between the number of neighbors someone knows by name and their mental well-being?

**Rationale:** This question explores the potential association between social familiarity (knowing neighbors) and mental health. Knowing neighbors could indicate social connection, which may serve as a buffer against stress or loneliness. Understanding this connection could help guide community programs that foster neighborhood ties, thereby promoting mental well-being.

## 2. Population Parameter of Interest
**Parameter of Interest:** The population parameter of interest is the mean mental well-being score among individuals with varying levels of neighbor familiarity.

**Outcome of Interest:** We aim to explore the association between the number of neighbors known and mental well-being, identifying any linear relationship between the two and examining its direction and strength.

## 3. Variables and Exploration Plan
**Independent Variable:**  
- **`CONNECTION_neighbours_name_num`**: Number of neighbors known by name (quantitative variable).  
  **Label:** "How many of your neighbors do you know by name?"  
  **Description:** This variable records the number of neighbors an individual knows by name, including those who live next door, in the same building, or on the same street.  

**Dependent Variable:**  
- **`WELLNESS_self_rated_mental_health`**: Mental well-being score (quantitative variable).  
  **Label:** "At the present time, would you say your MENTAL HEALTH is:"  
  **Description:** This variable captures an individual's self-rated mental health status.  

**Justification for Choosing These Variables:** These variables allow us to explore whether there’s a measurable association between community connections and mental well-being, an important topic with practical implications.

## 4. Planned Visualizations
#### For Mental Well-Being Scores
- **Histogram:** A histogram will visualize the distribution of mental well-being scores, allowing us to assess normality and understand the overall mental health landscape.

- **Box Plot:** A box plot will provide summary statistics (median, quartiles) of mental well-being scores, highlighting the central tendency and variability within the data.

#### For Number of Neighbors Known
- **Histogram:** A histogram will display the distribution of the number of neighbors known by name, providing insight into how many neighbors participants typically know.

### Overall Relationship Between Variables
- **Scatter Plot:** A scatter plot will visualize the relationship between the number of neighbors known and mental well-being scores, revealing any linear or non-linear trends that may exist between the two variables.

- **Box Plot (Grouped by Neighbor Counts):** We will group participants by ranges of neighbors known (e.g., 0-2, 3-5, 6-8, 9+) and create box plots to compare their mental well-being scores. This will highlight any trends in mental well-being based on neighborhood familiarity.


## 5. Planned Analysis
### Hypothesis Testing
- **Hypotheses:**
  - **Null Hypothesis (H₀):** $( \beta_1 \leq 0 $) (there is no positive association between the number of neighbors known and mental well-being).
  - **Alternative Hypothesis (H₁):** $( \beta_1 > 0 $) (there is a positive association).

### Steps:
1. **Simulating the Null Hypothesis:** We’ll generate a sampling distribution under the null hypothesis, which assumes no positive relationship between the number of neighbors known and mental well-being.
   
2. **Calculate Test Statistic:** For each simulated dataset, we’ll calculate the slope ($( \beta_1 $)) of the regression line between the permuted mental well-being scores and the number of neighbors known.

3. **Observing the Actual Slope:** We’ll calculate the slope ($( \beta_1 $)) for the actual data (non-permuted). This value will represent the strength and direction of the observed relationship.

4. **Calculate p-value:** The proportion of permuted slopes greater than or equal to the observed slope (for a one-sided test) will be our p-value.

5. **Decision Rule:** If the p-value is below our chosen significance level (e.g., 0.05), we’ll reject the null hypothesis, suggesting a statistically significant positive association.

### Simple Linear Regression
- **Method:** A simple linear regression will quantify the effect of the number of neighbors known on mental well-being scores.
  
- **Interpretation:** The slope ($( \beta_1 $)) will tell us if there’s an increase in mental well-being with each additional neighbor known by name.

### Analysis Method

The selected analysis methods for exploring the relationship between neighbor familiarity and mental well-being include data summarization, visualization, hypothesis testing, and simple linear regression. These methods are appropriate as they allow for a comprehensive examination of the association between the independent variable (number of neighbors known) and the dependent variable (mental well-being score). The assumptions for linear regression include linearity, independence, homoscedasticity, and normality of residuals. Addressing these assumptions ensures the validity of the analysis results. The visualizations (scatter plot, box plot, and histogram) will help in assessing the distribution of the data and the relationship between the variables, while the hypothesis testing will provide insights into the statistical significance of the observed relationships.

### Assumptions for Analysis

1. **Linearity:** The relationship between the number of neighbors known and mental well-being should be linear, which will be checked using scatter plots.

2. **Independence:** The observations should be independent of one another. This means that the mental well-being of one individual should not influence that of another.

3. **Homoscedasticity:** The variance of mental well-being scores should remain constant across all levels of neighbor familiarity. This will be assessed through residual plots.

4. **Normality of Residuals:** The residuals from the linear regression model should be approximately normally distributed, which will be checked using a histogram.


## 6. Hypothesis and Expected Results
### Hypothesis:
The hypothesis for this analysis is that knowing a greater number of neighbors by name will be positively associated with better mental well-being.

**Expected Results:** We expect to observe a positive slope in the simple linear regression, indicating that increased neighbor familiarity correlates with higher mental well-being scores. If the p-value confirms this, it suggests that social connectedness within one’s neighborhood may contribute positively to mental health.

## 7. Ethical Considerations
- **Privacy and Confidentiality:** Anonymity is essential due to the sensitive nature of mental health data. 

- **Informed Consent and Transparency:** Participants should be fully informed about the use of their data for research purposes.

- **Avoiding Harm and Misinterpretation:** Findings must be reported with caution to prevent misinterpretation and ensure results are used constructively.


# __________________________________________________________

# Research Proposal: Everyday Discrimination and Social Isolation

### 1. Research Question
Does a larger gap between the amount of time people wish to spend with family and friends versus the actual time spent correlate with higher levels of everyday discrimination experiences?

**Rationale**: This question investigates the impact of social isolation and unmet social needs on perceived discrimination. It’s interesting because it could highlight the psychosocial effects of discrimination, leading to a better understanding of how to support marginalized communities.

### 2. Population Parameter of Interest
- **Parameter of Interest**: The population parameter of interest is the correlation between the gap in desired and actual social time with family members and the level of everyday discrimination experienced.
  
- **Outcome of Interest**: We want to explore whether a larger gap between desired and actual time spent with family correlates with increased levels of perceived everyday discrimination. This could help identify if unmet social needs are linked to feelings of being discriminated against.

### 3. Variables and Exploration Plan

**Independent Variable**:
- **Variable Name**: `CONNECTION_social_time_family_p7d_grouped` (Actual time spent with family).
- **Label**: "In the PAST WEEK, how many hours in total did you spend socializing with others from the following groups? - Family Members"
- **Description**: This variable captures the actual amount of time individuals spent socializing with family members in the past week.

**Desired Time Variable**:
- **Variable Name**: `CONNECTION_preference_time_family_grouped` (Preferred time with family).
- **Label**: "How much time per week would you like to spend socializing with others from the following groups? - Family Members"
- **Description**: This variable captures the amount of time individuals wish they could spend socializing with family members.

**Gap Variable**:
- **Gap Calculation**: The gap is calculated as the difference between the desired time and actual time spent with family:
  
  $( \text{Gap} = \text{CONNECTION_preference_time_family_grouped} - \text{CONNECTION_social_time_family_p7d_grouped} $)
  
- **Description**: A larger gap indicates a higher unmet social need.

**Dependent Variable**:
- **Variable Name**: `LIFECOURSE_everyday_discrimination_respect` (Everyday discrimination experiences).
- **Label**: "In your day-to-day life, how often do you feel you are treated with less respect than other people?"
- **Description**: This variable assesses how often individuals feel they are treated with less respect in their everyday lives, serving as an indicator of perceived discrimination.

**Justification for Choosing These Variables**: These variables are chosen to explore the relationship between unmet social needs and feelings of discrimination. The gap between desired and actual time spent with family may reveal psychological impacts, such as increased feelings of disrespect or marginalization.

### 4. Planned Visualizations

1. **Box Plot (Actual Time Spent with Family)**: A box plot will visualize the distribution of actual time spent socializing with family members. This will help identify the median, quartiles, and any outliers in the data, providing insights into how much time individuals typically spend with their family.

2. **Box Plot (Preferred Time with Family)**: A box plot will display the distribution of preferred time spent with family members. This visualization will similarly illustrate the median, quartiles, and outliers, allowing for a comparison of how much time individuals wish to spend with family versus how much time they actually spend.

3. **Histogram (Gap in Time)**: A histogram will show the distribution of the gap between preferred and actual time spent with family. This will provide a clear visual representation of how significant the unmet social needs are among individuals and identify any outliers in the data.

4. **Bar Chart (Categorical Discrimination Levels)**: A bar chart will visualize the frequency of reported discrimination experiences across different categories. This will help understand how many individuals report feeling respected versus those who feel disrespected in their day-to-day lives.

Once individual visualizations have provided insights, we can examine the relationship between the gap in social time and discrimination experiences:

##### Discrimination Scoring
To facilitate correlation analysis, we will assign numerical scores to the categorical discrimination levels reported by participants. For example:
- "Always respected" = 0
- "Often respected" = 1
- "Sometimes respected" = 2
- "Rarely respected" = 3
- "Never respected" = 4

This scoring system will enable us to perform correlation and regression analyses on the relationship between the gap in social time and perceived discrimination.

- **Scatter Plot with Linear Regression Line (Gap and Discrimination Levels)**: A scatter plot will depict the relationship between the gap in time spent with family and the numerical discrimination scores. Including a linear regression line will help illustrate any trends or correlations, allowing for a clearer understanding of how the gap in social time correlates with perceived discrimination levels.

These visualizations are chosen to suit the nature of each variable, ensuring we gain a well-rounded understanding of both individual data and variable relationships.


### 5. Planned Analysis
##### Hypothesis Testing
- **Null Hypothesis (H₀)**: There is no correlation between the gap in time spent with family and everyday discrimination experiences.

- **Alternative Hypothesis (H₁)**: A larger gap between desired and actual time spent with family correlates with higher levels of everyday discrimination experiences.

##### Analysis Process
To test these hypotheses, I will:
1. **Simulate the Null Hypothesis**: Generate a sampling distribution under the null hypothesis.
2. **Calculate Test Statistic**: Calculate the correlation for each simulated dataset.
3. **Observe the Actual Correlation**: Calculate the correlation for the actual data using the numerical scores assigned to discrimination experiences.
4. **Calculate p-value**: Determine the proportion of simulated correlations that are greater than or equal to the observed correlation.
5. **Decision Rule**: If the p-value < 0.05, reject H₀.

##### Simple Linear Regression
- **Interpretation**: A simple linear regression will quantify the relationship between the gap in time spent with family and the numerical discrimination scores. The slope (β₁) will indicate how an increase in the gap correlates with an increase in perceived discrimination experiences, suggesting that unmet social needs may predict higher discrimination levels.

### 6 Hypothesis and Expected Results
If there is a larger gap between the desired time spent with family and the actual time spent, individuals are likely to report higher numerical scores of everyday discrimination experiences. This suggests that unmet social needs contribute to feelings of disrespect and marginalization.
**Relevance of Results**: The results will clarify whether unmet social needs contribute to perceptions of discrimination. A significant correlation would imply that addressing social isolation could be an essential step in reducing discrimination experiences, thereby aiding in the development of supportive community programs.


### 7. Ethical Considerations
**Privacy and Confidentiality**: Protect anonymity due to sensitive data.

**Informed Consent**: Ensure participants understand data use.

**Avoiding Harm and Misinterpretation**: Report findings carefully.

# __________________________________________________________

# Research Proposal: Community Ties and Academic Success

## 1. Research Question
How does knowing a higher number of neighbors by name relate to academic success indicators?

**Rationale:** This question explores how community ties can influence academic performance. Understanding this relationship could inform educational strategies aimed at fostering student support systems within neighborhoods.

## 2. Population Parameter of Interest
**Parameter of Interest:** The population parameter I’m interested in estimating is the correlation between the number of neighbors known by name and various academic success indicators.

**Outcome of Interest:** I want to investigate if there is a positive relationship between knowing a higher number of neighbors and achieving higher levels of educational attainment.

## 3. Variables and Exploration Plan
**Independent Variable:**
- **Variable Name:** `CONNECTION_neighbours_name_num`
- **Label:** "How many of your neighbors do you know by name? Note: By neighbors we mean people who live next door, in your building, and/or on your street."
- **Description:** This variable captures the number of neighbors that individuals are familiar with by name, serving as an indicator of social capital and community engagement.

**Dependent Variable:**
- **Variable Name:** `DEMO_education_diploma`
- **Label:** "Have you received any of the following degrees or certifications? (Check all that apply) - High school diploma or high school equivalency certificate."
- **Categories:**
  - High school diploma or high school equivalency certificate
  - Certificate of Apprentice, Certificate of Qualification (Journeypersons designation) or other trade certificate or diploma
  - College, CEGEP or other non-university certificate or diploma
  - University certificate or diploma below bachelor level
  - Bachelor's degree (e.g., B.A., B.Sc., B.Ed., etc.)
  - University certificate or diploma above bachelor level
  - Degree in medicine, dentistry, veterinary medicine or optometry (M.D., D.D.S., D.M.D., D.V.M., O.D.)
  - Master's degree (e.g., M.A., M.Sc., M.Ed., M.B.A.)
  - Doctorate (e.g., Ph.D.)

**Justification for Choosing These Variables:** These variables are selected to explore the relationship between community engagement, as indicated by the number of neighbors known, and academic success. Understanding this relationship could provide insights into how social networks influence educational outcomes.

**Planned Visualizations:**
- **Scatter Plot:** This will visualize the relationship between the number of neighbors known by name and the highest level of education attained, illustrating trends and correlations.
- **Bar Chart:** A bar chart could display the distribution of education levels across different ranges of neighbors known, helping to see how academic success varies with community ties.

## 4. Planned Analysis
### Analysis Method(s):
The analysis will include the following steps:

1. **Data Summarization:** Calculate summary statistics (mean, median, frequency distribution) for the number of neighbors known and the various levels of educational attainment to understand the data distributions.
   
2. **Visualization:** Create scatter plots and bar charts to inspect the relationships and distributions visually.
   
3. **Hypothesis Testing:** 
   - **Null Hypothesis (H₀):** There is no positive correlation between the number of neighbors known by name and academic success indicators (i.e., the correlation is less than or equal to zero).
   - **Alternative Hypothesis (H₁):** Knowing a higher number of neighbors by name is positively correlated with higher academic success indicators (i.e., the correlation is greater than zero).

   To test these hypotheses, I will:
   - **Simulate a Sampling Distribution:** Using the data, I will create a sampling distribution under the null hypothesis to observe how the number of known neighbors relates to academic success indicators.
   - **Compute a p-value:** From this sampling distribution, I will compute a one-sided p-value to assess the strength of evidence against the null hypothesis. A low p-value would suggest a significant positive relationship, supporting the alternative hypothesis.

4. **Simple Linear Regression:** A simple linear regression analysis will be performed to quantify the relationship between the number of neighbors known and academic success indicators. This will allow for estimating the slope (β₁) and testing its significance to assess if there’s a meaningful correlation.

### Assumptions:
- **Linearity:** The relationship between the independent and dependent variables is linear.
- **Normality:** The residuals of the regression model are normally distributed.
- **Homoscedasticity:** The variance of residuals is constant across all levels of the independent variable.
- **Independence:** The observations are independent of one another.

## 5. Hypothesis and Expected Results
**Hypothesis Statement:** If individuals know a higher number of neighbors by name, then they are likely to attain higher levels of educational qualifications, as social ties may facilitate support and resources that enhance academic performance.

**Relevance of Results:** The results will help clarify whether community engagement contributes to academic success. A significant correlation would suggest that fostering community ties could be an essential strategy in educational interventions aimed at improving student outcomes.

## 6. Ethical Considerations
When conducting this research, the following ethical considerations will be prioritized:

- **Informed Consent:** Participants will be fully informed about the study's purpose, procedures, potential risks, and benefits before giving their consent to participate.
- **Confidentiality:** All data collected will be kept confidential and anonymized to protect participants' identities and personal information.
- **Sensitive Topics:** Participants may feel discomfort discussing their community ties or academic performance. Researchers will provide resources for support and allow participants to withdraw from the study at any time without consequences.
- **Fair Treatment:** Participants from diverse backgrounds should be treated fairly and equitably throughout the research process, ensuring that their voices are heard and respected.

By addressing these ethical considerations, the research will prioritize participant well-being and integrity, fostering a respectful and supportive environment.


# __________________________________________________________

pls pair me with smart people i work hard pls