# Searching datasets


erddapy can wrap the same form-like search capabilities of ERDDAP with the *search_for* keyword.

In [1]:
from erddapy import ERDDAP


e = ERDDAP(
    server="https://upwell.pfeg.noaa.gov/erddap",
    protocol="griddap"
)

Single word search.

In [2]:
import pandas as pd

search_for = "HFRadar"

url = e.get_search_url(search_for=search_for, response="csv")

pd.read_csv(url)["Dataset ID"]

0     erdC125h_LonPM180
1              erdC125h
2              erdC1nma
3              erdC1sfc
4              erdCA25h
5              erdCAnma
6              erdCAsfc
7              erdCB25h
8              erdCBsfc
9              erdCM25h
10             erdCMnma
11             erdCMsfc
12             erdCS25h
13             erdCSsfc
14    erdC1nma_LonPM180
15    erdC1sfc_LonPM180
16    erdCA25h_LonPM180
17    erdCAsfc_LonPM180
18    erdCM25h_LonPM180
19    erdCMsfc_LonPM180
20    erdCS25h_LonPM180
21    erdCSsfc_LonPM180
22    erdCAnma_LonPM180
23    erdCMnma_LonPM180
24    erdCB25h_LonPM180
25    erdCBsfc_LonPM180
26            ucsdHfrA6
27            ucsdHfrH1
28            ucsdHfrP2
29            ucsdHfrW1
30            ucsdHfrW2
31          ucsdHfrW500
32            ucsdHfrW6
33            ucsdHfrE1
34            ucsdHfrE2
35            ucsdHfrE6
36            ucsdHfrP6
Name: Dataset ID, dtype: object

Filtering the search with extra words.

In [3]:
search_for = "HFRadar 2km"

url = e.get_search_url(search_for=search_for, response="csv")

pd.read_csv(url)["Dataset ID"]

0    ucsdHfrP2
1    ucsdHfrW2
2    ucsdHfrE2
Name: Dataset ID, dtype: object

Filtering the search with words that should **not** be found.

In [4]:
search_for = "HFRadar -EXPERIMENTAL"

url = e.get_search_url(search_for=search_for, response="csv")

pd.read_csv(url)["Dataset ID"]

0       ucsdHfrA6
1       ucsdHfrH1
2       ucsdHfrP2
3       ucsdHfrW1
4       ucsdHfrW2
5     ucsdHfrW500
6       ucsdHfrW6
7       ucsdHfrE1
8       ucsdHfrE2
9       ucsdHfrE6
10      ucsdHfrP6
Name: Dataset ID, dtype: object

Quoted search or "phrase search," first let us try the unquoted search.

In [5]:
search_for = "wind speed"

url = e.get_search_url(search_for=search_for, response="csv")

len(pd.read_csv(url)["Dataset ID"])

504

Too many datasets because wind, speed, and wind speed are matched.
Now let's use the quoted search to reduce the number of results to only wind speed.

In [6]:
search_for = '"wind speed"'

url = e.get_search_url(search_for=search_for, response="csv")

len(pd.read_csv(url)["Dataset ID"])

483