# Texas Cosmetologist Violations

Texas has a system for [searching for license violations](https://www.tdlr.texas.gov/cimsfo/fosearch.asp). You're going to search for cosmetologists!

## Setup: Import what you'll need to scrape the page

We'll be using Selenium for this, *not* BeautifulSoup and requests.

In [1]:
from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.support.ui import Select
from selenium.webdriver.support.ui import WebDriverWait
import pandas as pd

## Starting your search

Starting from [here](https://www.tdlr.texas.gov/cimsfo/fosearch.asp), search for cosmetologist violations for people with the last name **Nguyen**.

In [2]:
driver = webdriver.Chrome()

In [3]:
driver.get('https://www.tdlr.texas.gov/cimsfo/fosearch.asp')

In [4]:
dropdown = Select(driver.find_element_by_name('pht_status'))
dropdown.select_by_visible_text('Cosmetologists')

In [5]:
last_name = driver.find_element_by_name('pht_lnm')
last_name.send_keys('Nguyen')

In [6]:
search = driver.find_element_by_name('B1')
driver.execute_script('arguments[0].scrollIntoView(true)', search)
search.click()

## Scraping

Once you are on the results page, do this.

### Loop through each result and print the entire row

Okay wait, that's a heck of a lot. Use `[:10]` to only do the first ten (`listname[:10]` gives you the first ten).

In [7]:
results = driver.find_elements_by_tag_name('tr')[1:]
for result in results[:10]:
    print('-------')
    print(result.text)

-------
NGUYEN, TOAN HUU
City: SAN ANTONIO
County: BEXAR
Zip Code: 78217


License #(s): 780948, 1706491, 1699123

Complaint # COS20180004289 Date: 5/30/2018

Respondent is assessed an administrative penalty in the amount of $500. Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
NGUYEN, HANH CONG
City: EL PASO
County: EL PASO
Zip Code: 79934


License #: 737708

Complaint # COS20180006594 Date: 5/30/2018

Respondent is assessed an administrative penalty in the amount of $1,000. Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to use items subject to possible cross contamination in a manner that does not contaminate the remaining product.
-------
NGUYEN, KHIEM VAN
City: LONGVIEW
County: GREGG
Zip Code: 75604


License #: 731665

Complaint # COS20180000257 Date: 5/17/2018

Respondent is assessed an administrative penalty in the amount of $1,250. Respondent failed to fol

### Loop through each result and print each person's name

You'll get an error because the first one doesn't have a name. How do you make that not happen?! If you want to ignore an error, you use code like this:

```python
try:
   try to do something
except:
   print("It didn't work')
```

It should help you out. If you don't want to print anything, you can type `pass` instead of the `print` statement.

**Why doesn't the first one have a name?**

In [8]:
# Some cells contain info of more than 1 person.
# This bit tries to get an idea of how many span tags each cell holds.

# lengths = []
# for result in results:
#     cells = result.find_elements_by_tag_name('td')
#     spans = cells[0].find_elements_by_tag_name('span')
#     lengths.append(len(spans))
# print(list(set(lengths)))

# The output is [18, 11, 13] as of Jun 11, 2018.
# After a bit digging it turns out there's only one row that yields a 13 length. The row is shown below:

```
NGUYEN, KENNY KHANH

Company: CK NAILS LIC 745308  
City: LEANDER  
County: WILLIAMSON  
Zip Code: 78641  


License #: 745308

Complaint # COS20150021110 Date: 12/16/2015

Respondent is assessed an administrative penalty in the amount of $500. Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
```

In [9]:
# Apparently the reason why the len of its first cell is 13 is the extra 'Company' info it got.

In [10]:
for result in results:
    print('-------')
    cells = result.find_elements_by_tag_name('td')
    spans = cells[0].find_elements_by_tag_name('span')
    if len(spans) == 11 or len(spans) == 13:
        name = spans[0]
        print(name.text)
    elif len(spans) == 18:
        name1 = spans[0]
        name2 = spans[7]
        print(name1.text)
        print(name2.text)

-------
NGUYEN, TOAN HUU
-------
NGUYEN, HANH CONG
-------
NGUYEN, KHIEM VAN
-------
NGUYEN, DIEP THI NGOC
-------
NGUYEN, LAN T-THUY
NGUYEN, SAMLOI
-------
NGUYEN, TUAN A
NGUYEN, TUAN VAN
-------
NGUYEN, THAO B
-------
NGUYEN, BETH MARIA
-------
NGUYEN, TRUNG N
-------
NGUYEN, NGAT THI
-------
NGUYEN, KELLY PHUONG N
-------
NGUYEN, CHAU THI
-------
NGUYEN, XUAN T
-------
NGUYEN, THANH C
-------
NGUYEN, HAI
-------
NGUYEN, JENNIFER T
-------
NGUYEN, TONY VAN
-------
NGUYEN, HANH THAO TRAN
-------
NGUYEN, QUYEN THI MAI
-------
NGUYEN, OANH THI
-------
NGUYEN, THU NHU
-------
NGUYEN, PHUNG THI
-------
NGUYEN, TUAN
-------
NGUYEN, PHUOC BA
-------
NGUYEN, THAI VAN
-------
NGUYEN, JIMMY
-------
NGUYEN, QUI VAN
-------
NGUYEN, KIM LIEN TRAN
-------
NGUYEN, THUY HONG
-------
NGUYEN, TRANG YEN
-------
NGUYEN, BINH THANH
-------
NGUYEN, MUA THI
-------
NGUYEN, TRUNG H
-------
NGUYEN, TRANG N
-------
NGUYEN, PHUONG TUYET TH
-------
NGUYEN, KIM AN THI
-------
NGUYEN, DUC V
-------
NGUYEN, DUC VA

NGUYEN, HANH THI
-------
NGUYEN, THAO NGOC
-------
NGUYEN, ANTHONY V
-------
NGUYEN, QUYEN V
-------
NGUYEN, BILL
-------
NGUYEN, THAO P
-------
NGUYEN, THUY T
-------
NGUYEN, LIEN T
-------
NGUYEN, LOAN THI
-------
NGUYEN, MINH P
-------
NGUYEN, BINH VAN
-------
NGUYEN, HOA T
-------
NGUYEN, MONG THUY
-------
NGUYEN, KHUYEN THI
-------
NGUYEN, THUY KIM THI
-------
NGUYEN, THANH DUONG
-------
NGUYEN, NAM QUANG
-------
NGUYEN, MY THI
-------
NGUYEN, DANNY
-------
NGUYEN, ANH THI-TUYET
-------
NGUYEN, QUY KIM
-------
NGUYEN, KEVIN
-------
NGUYEN, ANDY
-------
NGUYEN, MARY VU
-------
NGUYEN, HIEU
-------
NGUYEN, CHINH THI
-------
NGUYEN, PHU XUAN
-------
NGUYEN, THU ANH T
-------
NGUYEN, SCOTT
-------
NGUYEN, DAM D
-------
NGUYEN, NGOC HIEU THI
-------
NGUYEN, HAU V
-------
NGUYEN, KHANH
-------
NGUYEN, DIEM THUY THI
-------
NGUYEN, HUY CAN MINH
-------
NGUYEN, VAN THI
-------
NGUYEN, KEVIN
-------
NGUYEN, TRAM B
-------
NGUYEN, OANH
-------
NGUYEN, LINH T
-------
NGUYEN, SON HOANG
------

## Loop through each result, printing each violation description ("Basis for order")

> - *Tip: You'll get an error even if you're ALMOST right - which row is causing the problem?*
> - *Tip: You can get the HTML of something by doing `.get_attribute('innerHTML')` - it might help you diagnose your issue.*
> - *Tip: Or I guess you could just skip the one with the problem...

In [11]:
for result in results:
    print('-------')
    basis = result.find_elements_by_tag_name('td')[2]
    print(basis.text)

-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to use items subject to possible cross contamination in a manner that does not contaminate the remaining product.
-------
Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required; Respondent failed to clean, disinfect, and sterilize manicure and pedicure implements after each use; Respondent failed to clean and disinfect all wax pots.
-------
Respondent failed to disinfect tools, implements, and supplies with an EPA-registered disinfectant solution; Respondent failed to disinfect multi-use equipment, implements, and tools prior to use on each client.
-------
Respondent failed to clean, disinfect, and sterilize manicure and pedicure implements after each use.
-------
Respondent failed to clean and disinfect all wax pots; Resp

Respondent failed to disinfect tools, implements, and supplies with an EPA-registered disinfectant solution.
-------
Respondent left applicators standing in the wax.
-------
Respondent performed or attempted to perform a practice of cosmetology without a license.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to clean and disinfect all wax pots.
-------
Respondent leased space in a salon to an individual who engaged in the practice of cosmetology but had not obtained a cosmetology license.
-------
Respondent performed or attempted to perform a practice of cosmetology without a license.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to clean, disinfect, and sterilize metal instruments with a Department-approved sterilizer.
-------
Respon

Respondent left applicators standing in the wax.
-------
Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required.
-------
Respondent practiced cosmetology services in an unlicensed beauty salon.
-------
Respondent performed or attempted to perform a practice of cosmetology without a license.
-------
Respondent engaged in fraud or deceit in obtaining a certificate, license, or permit.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if

Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent failed to comply with an order previously issued by the Executive Director.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to keep floors, walls, ceilings, shelves, furniture, furnishings, and fixtures clean and in good repair.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent offered specialty cosmetology services outside the scope of Respondent's specialty salon license.
-------
Respondent failed to keep a record of the da

Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required; Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to c

Respondent leased space in a salon to an individual who engaged in the practice of cosmetology but had not obtained a cosmetology license; Respondent failed to wash towels in hot water and chlorine bleach; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly.
-------
Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to wash towels in hot water and chlorine bleach.
-------
Respondent left applicators standing in the wax.
-------
Respondent performed or attempted to perform a practice of cosmetology with an expired license.
-------
Respondent failed to comply with an order previously issued by t

Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent offered specialty cosmetology services outside the scope of Respondent's specialty salon license.
-------
Respondent practiced cosmetology services in an unlicensed beauty salon.
-------
Respondent performed or attempted to perform a practice of cosmetology without a license.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to comply with an order previously issued by the Executive Director.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent possessed an electric drill other 

Respondent failed to clean diamond, carbide, natural and metal bits after each use with a brush or ultrasonic cleaner, or by immersing in acetone; Respondent failed to store in a clean, dry, debris-free environment, separate from soiled implements and materials, all cleaned and disinfected implements and materials when not in use.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to clean and disinfect soiled waxing implements.
-------
Respondent performed cosmetology services outside the scope of their esthetician license.
-------
Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required before use by each patron; R

Respondent failed to cooperate with the inspector in the performance of an inspection.
-------
Respondent operated a cosmetology salon with an expired license; Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly.
-------
Respondent failed to keep a record of the date and time of each 

Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to clean and sanitize whirlpool foot spas as required before use by each patron.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used; Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning and if the foot spa was not used.
-------
Respondent failed to prepare fresh disinfectant solution daily or more often as needed, for immersion of implements; Responde

Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning; Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day.
-------
Respondent left applicators standing in the wax.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning.
-------
Respondent failed to make cleaning and disinfectant records available upon request.
-------
Respondent leased space in a salon to an individual who engaged in the practice of cosmetology but had not obtained a cosmetology license; Respondent failed to follow whirlpool foot spas cleaning and sanitization procedures as required bi-weekly; Respondent failed to maintain the required cleaning and disinfecting records.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning; Respondent offered specialty cosmetology services outside the scope of Respondent's specialty salon 

Respondent failed to clean and sanitize whirlpool foot spas as required before use by each patron; Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning.
-------
Respondent operated a cosmetology salon without the appropriate license.
-------
Respondent operated a cosmetology salon without the appropriate license.
-------
Respondent leased space in a salon to an individual who engaged in the practice of cosmetology but had not obtained a cosmetology license.
-------
Respondent leased space in a salon to an individual who engaged in the practice of cosmetology but had not obtained a cosmetology license.
-------
Respondent failed to keep a record of the date and time of each foot spa daily or bi-weekly cleaning; Respondent failed to properly ventilate the salon to eliminate strong odors away from the public area.
-------
Respondent failed to clean and sanitize whirlpool foot spas as required at the end of each day; Respondent failed to keep 

## Loop through each result, printing the complaint number

- TIP: Think about the order of the elements

In [12]:
for result in results:
    complaint_num = result.find_elements_by_tag_name('td')[0].find_elements_by_tag_name('span')[-1]
    print(complaint_num.text)

COS20180004289
COS20180006594
COS20180000257
COS20180004915
COS20180009255
COS20140018343
COS20180008846
COS20180000897
COS20170023893
COS20180004076
COS20180004498
COS20180008220
COS20170009055
COS20180002334
COS20170019449
COS20170021681
COS20180004089
COS20180004300
COS20180004340
COS20180004475
COS20180004720
COS20180004864
COS20180006279
COS20180004329
COS20170020336
COS20180000630
COS20180003692
COS20180002266
COS20180003857
COS20180004081
COS20180005797
COS20170018997
COS20180002614
COS20180003845
COS20170008359
COS20180004075
COS20170021316
COS20170022035
COS20180004639
COS20170009421
COS20180003532
COS20180004016
COS20170022895
COS20180001881
COS20170017965
COS20180001081
COS20180001141
COS20180003707
COS20170014082
COS20180001604
COS20180000225
COS20170022848
COS20180002313
COS20180002216
COS20180000650
COS20180001594
COS20180002227
COS20170018593
COS20170019077
COS20170022385
COS20170022737
COS20180000914
COS20170008502
COS20170015324
COS20170020893
COS20170022810
COS2018000

## Saving the results

### Loop through each result to create a list of dictionaries

Each dictionary must contain

- Person's name
- Violation description
- Violation number
- License Numbers
- Zip Code
- County
- City

Create a new dictionary for each result (except the header).

> *Tip: If you want to ask for the "next sibling," you can't use `find_next_sibling` in Selenium, you need to use `element.find_element_by_xpath("following-sibling::div")` to find the next div, or `element.find_element_by_xpath("following-sibling::*")` to find the next anything.

In [13]:
data = []

for result in results:
    
    row = {}
    
    cells = result.find_elements_by_tag_name('td')
    spans = cells[0].find_elements_by_tag_name('span')
    
    # Whatever kind of cell it is, the following 4 bits' positions won't be affected:
    name = spans[0]
    license_num = spans[-3]
    complaint_num = spans[-1]
    basis = cells[2]
    
    if len(spans) == 11:
        city = spans[2]
        county = spans[4]
        zipcode = spans[6]
        row.update({
        'Name': name.text,
        'Violation description': basis.text,
        'Violation number': complaint_num.text,
        'License numbers': license_num.text,
        'Zip code': zipcode.text,
        'County': county.text,
        'City': city.text
    })
        data.append(row)
    
    elif len(spans) == 13:
        city = spans[4]
        county = spans[6]
        zipcode = spans[8]
        row.update({
        'Name': name.text,
        'Violation description': basis.text,
        'Violation number': complaint_num.text,
        'License numbers': license_num.text,
        'Zip code': zipcode.text,
        'County': county.text,
        'City': city.text
    })
        data.append(row)
    
    elif len(spans) == 18:
        city = spans[2]
        county = spans[4]
        zipcode = spans[6]
        row.update({
        'Name': name.text,
        'Violation description': basis.text,
        'Violation number': complaint_num.text,
        'License numbers': license_num.text,
        'Zip code': zipcode.text,
        'County': county.text,
        'City': city.text
    })
        data.append(row)
        row1 = {}
        name1 = spans[7]
        city1 = spans[9]
        county1 = spans[11]
        zipcode1 = spans[13]
        row1.update({
        'Name': name1.text,
        'Violation description': basis.text,
        'Violation number': complaint_num.text,
        'License numbers': license_num.text,
        'Zip code': zipcode1.text,
        'County': county1.text,
        'City': city1.text
    })
        data.append(row1)

### Save that to a CSV

- Tip: You'll want to use pandas here

In [14]:
df = pd.DataFrame(data)

In [15]:
df.to_csv('cosmetology-violations.csv', index=False)

### Open the CSV file and examine the first few. Make sure you didn't save an extra weird unnamed column.

In [16]:
pd.read_csv('cosmetology-violations.csv').head(30)

Unnamed: 0,City,County,License numbers,Name,Violation description,Violation number,Zip code
0,SAN ANTONIO,BEXAR,"780948, 1706491, 1699123","NGUYEN, TOAN HUU",Respondent failed to clean and sanitize whirlp...,COS20180004289,78217
1,EL PASO,EL PASO,737708,"NGUYEN, HANH CONG",Respondent failed to clean and sanitize whirlp...,COS20180006594,79934
2,LONGVIEW,GREGG,731665,"NGUYEN, KHIEM VAN",Respondent failed to follow whirlpool foot spa...,COS20180000257,75604
3,HOUSTON,HARRIS,"1347649, 760528","NGUYEN, DIEP THI NGOC","Respondent failed to disinfect tools, implemen...",COS20180004915,77014
4,SAN ANTONIO,BEXAR,767339,"NGUYEN, LAN T-THUY","Respondent failed to clean, disinfect, and ste...",COS20180009255,78255
5,SAN ANTONIO,BEXAR,767339,"NGUYEN, SAMLOI","Respondent failed to clean, disinfect, and ste...",COS20180009255,78255
6,AUSTIN,TRAVIS,681274,"NGUYEN, TUAN A",Respondent failed to clean and disinfect all w...,COS20140018343,78723
7,ARLINGTON,TARRANT,681274,"NGUYEN, TUAN VAN",Respondent failed to clean and disinfect all w...,COS20140018343,76011
8,EULESS,TARRANT,"721373, 1142884","NGUYEN, THAO B",Respondent failed to clean and sanitize whirlp...,COS20180008846,76039
9,HOUSTON,HARRIS,1470271,"NGUYEN, BETH MARIA",The Respondent's license was revoked upon Resp...,COS20180000897,77083
