## HSI school stats (physical sciences)

In [1]:
import pandas as pd
import numpy as np
import glob
import os
import matplotlib.pyplot as plt
import completions_module as cmodule

This notebook contains various tables for `Physical Sciences` degrees (2009-2019).

- Table 1: Bachelors / Hispanic + Latinx (total sum for year range)
- Table 2: Bachelors / Hispanic + Latinx (annual mean for year range)
- Table 3: Bachelors / Grand total (total sum for year range)
- Table 4: Bachelors / Grand total (annual mean for year range)

- Table 5: Masters / Hispanic + Latinx (total sum for year range)
- Table 6: Masters / Hispanic + Latinx (annual mean for year range)
- Table 7: Masters / Grand total (total sum for year range)
- Table 8: Masters / Grand total (annual mean for year range)

- Table 9: Doctorate / Hispanic + Latinx (total sum for year range)
- Table 10: Doctorate / Hispanic + Latinx (annual mean for year range)
- Table 11: Doctorate / Grand total (total sum for year range)
- Table 12: Doctorate / Grand total (annual mean for year range)


_Author: Maria J. Molina (molina@ucar.edu)_

**First, lets grab hsi unis that are in ucar provided list**

In [21]:
hsi = pd.read_excel('../data/HACU-2019-20-Hispanic-Serving-Institutions.xlsx', engine='openpyxl')
hsi_uni_list = hsi['Name'].unique()

all_files = glob.glob("../data/completions_*.csv")
all_files = sorted(all_files)
df = cmodule.open_and_concat(all_files)
ucar_uni_list = df['institution name'].unique()

ucar_list_hsi = ucar_uni_list[np.isin(ucar_uni_list, hsi_uni_list)]

In [26]:
df_ucar_hsi = df.loc[df['institution name'].isin(ucar_list_hsi)]

**Now, lets grab hsi unis that are not in the ucar provided list**

In [28]:
all_files = glob.glob("../data/hsi_completions_*.csv")  # HSI list
all_files = sorted(all_files)
df_hsi = cmodule.open_and_concat(all_files)

**Finally, append the two lists of unis together**

In [36]:
all_hsi_unis = pd.concat([df_ucar_hsi, df_hsi])

### Top 30 US institutions: Hispanic/Latinx bachelor degree recipients in the Physical Sciences (2009-2019)

In [39]:
df = cmodule.bs_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').sum()['Hispanic or Latino total'].sort_values(ascending=False).head(30)  # TOTAL SUM (2009-2019)

institution name
Florida International University                       876
University of Puerto Rico-Mayaguez                     692
University of Puerto Rico-Rio Piedras                  563
The University of Texas at El Paso                     365
The University of Texas Rio Grande Valley              350
University of Puerto Rico-Humacao                      330
University of California-Irvine                        323
University of California-Santa Barbara                 311
University of California-Santa Cruz                    310
University of Houston                                  302
California State University-San Bernardino             289
University of New Mexico-Main Campus                   272
The University of Texas at San Antonio                 265
University of Arizona                                  238
California State Polytechnic University-Pomona         230
University of Puerto Rico-Cayey                        225
University of California-Riverside     

In [40]:
df = cmodule.bs_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').mean()['Hispanic or Latino total'].sort_values(ascending=False).head(30)  # ANNUAL MEAN (2009-2019)

institution name
University of Puerto Rico-Mayaguez                     62.909091
Florida International University                       46.105263
University of Puerto Rico-Rio Piedras                  40.214286
University of Puerto Rico-Humacao                      30.000000
University of Houston                                  27.454545
University of Puerto Rico-Cayey                        20.454545
The University of Texas at El Paso                     18.250000
Pontifical Catholic University of Puerto Rico-Ponce    17.636364
The University of Texas Rio Grande Valley              15.909091
University of California-Santa Cruz                    15.500000
University of California-Irvine                        14.681818
University of California-Santa Barbara                 14.136364
Inter American University of Puerto Rico-San German    14.090909
The University of Texas at San Antonio                 13.947368
California State University-San Bernardino             13.136364
Universi

### Top 30 US institutions: Grand Total bachelor degree recipients in the Physical Sciences (2009-2019)

In [41]:
df = cmodule.bs_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').sum()['Grand total'].sort_values(ascending=False).head(30)  # TOTAL SUM (2009-2019)

institution name
University of California-Irvine                   2174
University of California-Santa Barbara            2090
CUNY Hunter College                               1772
University of Arizona                             1727
University of California-Santa Cruz               1666
University of Houston                             1413
Northern Arizona University                       1274
Florida International University                  1273
San Diego State University                         890
California State Polytechnic University-Pomona     857
University of California-Riverside                 822
San Jose State University                          816
Texas Tech University                              815
University of New Mexico-Main Campus               815
California State University-Sacramento             770
University of Illinois Chicago                     767
California State University-San Bernardino         734
California State University-Long Beach          

In [42]:
df = cmodule.bs_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').mean()['Grand total'].sort_values(ascending=False).head(30)  # ANNUAL MEAN (2009-2019)

institution name
University of Houston                             128.454545
University of California-Irvine                    98.818182
University of California-Santa Barbara             95.000000
CUNY Hunter College                                84.380952
University of California-Santa Cruz                83.300000
University of Arizona                              78.500000
Florida International University                   67.000000
University of Puerto Rico-Mayaguez                 64.363636
CUNY Graduate School and University Center         60.250000
Northern Arizona University                        57.909091
The University of Texas at Arlington               51.307692
San Jose State University                          45.333333
Texas Tech University                              45.277778
University of Puerto Rico-Rio Piedras              45.142857
University of Central Florida                      45.090909
San Diego State University                         44.500000
Univers

### Top 30 US institutions: Hispanic/Latinx master degree recipients in the Physical Sciences (2009-2019)

In [43]:
df = cmodule.ms_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').sum()['Hispanic or Latino total'].sort_values(ascending=False).head(30)  # TOTAL SUM (2009-2019)

institution name
California State University-Los Angeles          89
University of Puerto Rico-Mayaguez               87
The University of Texas at El Paso               81
University of Arizona                            70
CUNY Graduate School and University Center       58
Florida International University                 57
University of Houston                            54
The University of Texas Rio Grande Valley        50
University of California-Riverside               48
California State University-Long Beach           43
California State University-Fresno               40
University of New Mexico-Main Campus             36
The University of Texas at San Antonio           35
University of Puerto Rico-Rio Piedras            34
University of California-Santa Cruz              34
San Diego State University                       33
University of California-Irvine                  33
California State University-Northridge           30
University of California-Santa Barbara         

In [44]:
df = cmodule.ms_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').mean()['Hispanic or Latino total'].sort_values(ascending=False).head(30)  # annual mean (2009-2019)

institution name
California State University-Los Angeles          8.090909
University of Puerto Rico-Mayaguez               7.909091
The University of Texas at El Paso               7.363636
University of Arizona                            6.363636
CUNY Graduate School and University Center       5.800000
Florida International University                 5.181818
University of Houston                            4.909091
The University of Texas Rio Grande Valley        4.545455
University of California-Riverside               4.363636
California State University-Fresno               3.636364
University of New Mexico-Main Campus             3.272727
The University of Texas at San Antonio           3.181818
University of Puerto Rico-Rio Piedras            3.090909
University of California-Santa Cruz              3.090909
San Diego State University                       3.000000
University of California-Irvine                  3.000000
California State University-Long Beach           2.8666

### Top 30 US institutions: Grand total master degree recipients in the Physical Sciences (2009-2019)

In [45]:
df = cmodule.ms_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').sum()['Grand total'].sort_values(ascending=False).head(30)  # TOTAL SUM (2009-2019)

institution name
University of Arizona                            1065
University of Houston                             777
University of California-Irvine                   492
University of California-Riverside                444
University of California-Santa Barbara            436
University of New Mexico-Main Campus              415
CUNY Graduate School and University Center        353
San Diego State University                        352
University of California-Santa Cruz               348
Texas Tech University                             326
California State University-Los Angeles           287
California State University-Long Beach            278
San Francisco State University                    274
New Mexico State University-Main Campus           247
The University of Texas at El Paso                244
University of Illinois Chicago                    238
Northern Arizona University                       220
The University of Texas at San Antonio            213
San Jose St

In [46]:
df = cmodule.ms_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').mean()['Grand total'].sort_values(ascending=False).head(30)  # annual mean (2009-2019)

institution name
University of Arizona                            96.818182
University of Houston                            70.636364
University of California-Irvine                  44.727273
University of California-Riverside               40.363636
University of California-Santa Barbara           39.636364
University of New Mexico-Main Campus             37.727273
CUNY Graduate School and University Center       35.300000
San Diego State University                       32.000000
University of California-Santa Cruz              31.636364
Texas Tech University                            29.636364
California State University-Los Angeles          26.090909
San Francisco State University                   24.909091
New Mexico State University-Main Campus          22.454545
The University of Texas at El Paso               22.181818
University of Illinois Chicago                   21.636364
Northern Arizona University                      20.000000
The University of Texas at San Antonio 

### Top 30 US institutions: Hispanic/Latinx doctorate degree recipients in the Physical Sciences (2009-2019)

In [47]:
df = cmodule.phd_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').sum()['Hispanic or Latino total'].sort_values(ascending=False).head(30)  # total sum (2009-2019)

institution name
University of Puerto Rico-Rio Piedras                           101
Florida International University                                 46
CUNY Graduate School and University Center                       44
University of Puerto Rico-Mayaguez                               44
University of California-Santa Cruz                              40
University of California-Irvine                                  34
University of Arizona                                            28
University of California-Riverside                               27
University of California-Santa Barbara                           23
The University of Texas at El Paso                               22
University of Central Florida                                    21
University of New Mexico-Main Campus                             15
University of Houston                                            12
University of North Texas                                        12
The University of Texas at San 

In [48]:
df = cmodule.phd_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').mean()['Hispanic or Latino total'].sort_values(ascending=False).head(30)  # annual mean (2009-2019)

institution name
University of Puerto Rico-Rio Piedras                           8.416667
Florida International University                                4.181818
CUNY Graduate School and University Center                      4.000000
University of Puerto Rico-Mayaguez                              4.000000
University of California-Santa Cruz                             3.636364
University of California-Irvine                                 3.090909
University of Arizona                                           2.545455
University of California-Riverside                              2.454545
University of California-Santa Barbara                          2.090909
The University of Texas at El Paso                              2.000000
University of Central Florida                                   1.909091
University of New Mexico-Main Campus                            1.363636
University of Houston                                           1.090909
University of North Texas         

### Top 30 US institutions: Grand total doctorate degree recipients in the Physical Sciences (2009-2019)

In [49]:
df = cmodule.phd_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').sum()['Grand total'].sort_values(ascending=False).head(30)  # TOTAL SUM (2009-2019)

institution name
University of Arizona                                           912
University of California-Santa Barbara                          741
University of California-Irvine                                 697
University of Houston                                           482
CUNY Graduate School and University Center                      452
University of California-Riverside                              443
University of California-Santa Cruz                             437
University of New Mexico-Main Campus                            326
University of Illinois Chicago                                  272
Florida International University                                248
University of Central Florida                                   247
University of North Texas                                       193
Texas Tech University                                           188
The University of Texas at Arlington                            169
New Mexico State University-Mai

In [50]:
df = cmodule.phd_degrees(all_hsi_unis)
df = cmodule.physical_sciences(df)
df.groupby('institution name').mean()['Grand total'].sort_values(ascending=False).head(30)  # annual mean (2009-2019)

institution name
University of Arizona                                           82.909091
University of California-Santa Barbara                          67.363636
University of California-Irvine                                 63.363636
University of Houston                                           43.818182
CUNY Graduate School and University Center                      41.090909
University of California-Riverside                              40.272727
University of California-Santa Cruz                             39.727273
University of New Mexico-Main Campus                            29.636364
University of Illinois Chicago                                  24.727273
Florida International University                                22.545455
University of Central Florida                                   22.454545
University of North Texas                                       17.545455
Texas Tech University                                           17.090909
The University of Tex