# EXPLORING FINANCIAL BARRIERS TO SCIENTIFIC CONFERENCES IN NIGERIA: An Analysis of Registration Costs Among Medical Students

## Sample Size estimation

In [44]:
import math

We want to estimate a representative sample size needed to analyze the mean attendance of medical students to scientific conferences.

We are working with:
- a 95% confidence level
- a 5% margin of error
- an estimated 75000 medical students are in Nigeria
- an estimated 50% of these medical students attend scientific conferences

In [45]:
def calculate_sample_size(z_score, proportion, margin_of_error, population_size):
    # Calculate the sample size without finite population correction
    conference_sample_size = ((z_score**2) * proportion * (1 - proportion)) / (margin_of_error**2)
    
    return math.ceil(conference_sample_size)  # Round up to the nearest whole number

confidence_level = 0.95  # 95% confidence level
z_score = 1.96  # Z-score for 95% confidence
estimated_proportion = 0.5  # 50% of Nigerian medical students attend scientific conferences (estimated)
margin_of_error = 0.05  # 5% margin of error
total_population = 75000  # Total population size

conference_sample_size = calculate_sample_size(z_score, estimated_proportion, margin_of_error, total_population)

In [46]:
print(conference_sample_size)

385


## Proportional sample size per geo-political zone

We are working with:
- an estimated 75000 medical students in Nigeria
- a minimum of 383 medcial students as representative sample
- a total of 73 NiMSA accredited medical schools in Nigeria
- 16 medical schools in the South-South, 16 in the South-West, 15 in the South-East, 8 in the North-Central, 9 in the North-West, and 7 in the North-East.
- An average graduation quota of 200 students for the South-West, 180 for South-South and South-East, and 150 for North-Central, North-West and North-East. 

In [47]:
"""number of students per zone, calculated by multiplying the estimated quota per school by 6 (for the 6 academic levels). 
It is pertinent to note that the actual number is way more than this as most schools admit a lot more than their quota"""

s_s = 19200
s_w = 17280
s_e = 16200
n_c = 7200
n_w = 8100
n_e = 6300
total = 74280

south_south = math.ceil((s_s * conference_sample_size)/total)
south_west = math.ceil((s_w * conference_sample_size)/total)
south_east = math.ceil((s_e * conference_sample_size)/total)
north_central = math.ceil((n_c * conference_sample_size)/total)
north_west = math.ceil((n_w * conference_sample_size)/total)
north_east = math.ceil((n_e * conference_sample_size)/total)

print(f"{south_south} students required from the South-South")
print(f"{south_west} students required from the South-West")
print(f"{south_east} students required from the South-East")
print(f"{north_central} students required from the North-Central")
print(f"{north_west} students required from the North-West")
print(f"{south_east} students required from the North-East")

100 students required from the South-South
90 students required from the South-West
84 students required from the South-East
38 students required from the North-Central
42 students required from the North-West
84 students required from the North-East


It is important to note that this is the minimum number, judging from the official statistics, required to have a representative sample. However, considering that the actual percieved population as we know is much more than this calculated (but data on this percieved population is unavailable), the respondents should be a lot more per zone. 