# Sample Workflow with quickerstats
---
Aaron Anderson <br>
https://github.com/anderaa/quickerstats

---

In this example, we will demonstrate how to use the quickerstats package to search for and download data from the NASS Quick Stats database. Our goal will be to produce a map that shows discrepencies between acres of corn harvested and acres of corn planted at the county level.

### 1. Installation and setup

Install the package from the current github repo, then load it.

In [1]:
devtools::install_github('anderaa/quickerstats', force=TRUE, build_vignettes=TRUE, ref='development')
library('quickerstats')

Downloading GitHub repo anderaa/quickerstats@development



[32m✔[39m  [38;5;247mchecking for file ‘/private/var/folders/xb/3_n3p00j4_x83n2s7tpn95gm0000gn/T/RtmpPjEY9I/remotes2fa778a66e95/anderaa-quickerstats-1578aab/DESCRIPTION’[39m[36m[39m
[38;5;247m─[39m[38;5;247m  [39m[38;5;247mpreparing ‘quickerstats’:[39m[36m[39m
[32m✔[39m  [38;5;247mchecking DESCRIPTION meta-information[39m[36m[39m
[38;5;247m─[39m[38;5;247m  [39m[38;5;247minstalling the package to build vignettes[39m[36m[39m
[32m✔[39m  [38;5;247mcreating vignettes[39m[36m[36m (1m 7.7s)[36m[39m
[38;5;247m─[39m[38;5;247m  [39m[38;5;247mchecking for LF line-endings in source and make files and shell scripts[39m[36m[39m
[38;5;247m─[39m[38;5;247m  [39m[38;5;247mchecking for empty or unneeded directories[39m[36m[39m
[38;5;247m─[39m[38;5;247m  [39m[38;5;247mbuilding ‘quickerstats_0.0.0.9002.tar.gz’[39m[36m[39m
   


Now setup your NASS key. Go to https://quickstats.nass.usda.gov/api and get your key. I recommend storing your key as an environmental variable. In R studio, type `file.edit("~/.Renviron")` and add `NASS_KEY='your_nass_key'` to the file. Alternatively, open a terminal (mac/linux) and type `nano ~/.Renviron` and add the same line. Save it with ctl+o then press enter. Then exit with ctl+x.

Now load the key into the current R session:

In [2]:
key = Sys.getenv('NASS_KEY')

### 2. Searching for data 
We need to find data series that indicate the amounts of corn planted and harvested. To do this, we use the search function.

In [12]:
r <- search_data_items(key, c('corn', 'harvested'))
print(r)

 [1] "CORN - ACRES HARVESTED"                                                           
 [2] "CORN - OPERATIONS WITH AREA HARVESTED"                                            
 [3] "CORN, FORAGE - ACRES HARVESTED"                                                   
 [4] "CORN, GRAIN - ACRES HARVESTED"                                                    
 [5] "CORN, GRAIN - OPERATIONS WITH AREA HARVESTED"                                     
 [6] "CORN, GRAIN - PROGRESS, 5 YEAR AVG, MEASURED IN PCT HARVESTED"                    
 [7] "CORN, GRAIN - PROGRESS, MEASURED IN PCT HARVESTED"                                
 [8] "CORN, GRAIN - PROGRESS, PREVIOUS YEAR, MEASURED IN PCT HARVESTED"                 
 [9] "CORN, GRAIN, IRRIGATED - ACRES HARVESTED"                                         
[10] "CORN, GRAIN, IRRIGATED - AREA HARVESTED, MEASURED IN PCT BY METHOD"               
[11] "CORN, GRAIN, IRRIGATED - AREA HARVESTED, MEASURED IN PCT OF OPERATIONS BY METHOD" 
[12] "CORN, GRAIN, IR

It looks like the first result is the one we want.

In [15]:
data_item <- r[4]
print(data_item)

[1] "CORN, GRAIN - ACRES HARVESTED"


Now, let's get the options associated with this data item. We need make sure we can get it for a recent year and for individual counties.

In [16]:
df <- get_options(key, data_item)

Retrieving options...this may take a minute...


In [17]:
df

source_desc,year,agg_level_desc,domain_desc
<chr>,<chr>,<chr>,<chr>
CENSUS,1997,COUNTY,TOTAL
CENSUS,1997,STATE,AREA HARVESTED
CENSUS,1997,STATE,TOTAL
CENSUS,2002,COUNTY,TOTAL
CENSUS,2002,STATE,AREA HARVESTED
CENSUS,2002,STATE,TOTAL
CENSUS,2007,COUNTY,TOTAL
CENSUS,2007,STATE,AREA HARVESTED
CENSUS,2007,STATE,TOTAL
CENSUS,2012,COUNTY,OPERATORS


In [8]:
data_item <- r[22]
print(data_item)

[1] "CORN, IRRIGATED - ACRES PLANTED"


In [9]:
get_options(key, data_item)

Retrieving options...this may take a minute...


source_desc,year,agg_level_desc,domain_desc
<chr>,<chr>,<chr>,<chr>
SURVEY,1997,COUNTY,TOTAL
SURVEY,1997,STATE,TOTAL
SURVEY,1998,COUNTY,TOTAL
SURVEY,1998,STATE,TOTAL
SURVEY,1999,COUNTY,TOTAL
SURVEY,1999,STATE,TOTAL
SURVEY,2000,COUNTY,TOTAL
SURVEY,2000,STATE,TOTAL
SURVEY,2001,COUNTY,TOTAL
SURVEY,2001,STATE,TOTAL
