# Searching datasets


erddapy can wrap the same form-like search capabilities of ERDDAP with the *search_for* keyword.

In [1]:
from erddapy import ERDDAP


e = ERDDAP(
    server="https://upwell.pfeg.noaa.gov/erddap",
    protocol="griddap"
)

Single word search.

In [2]:
import pandas as pd

search_for = "HFRadar"

url = e.get_search_url(search_for=search_for, response="csv")

pd.read_csv(url)["Dataset ID"]

0     erdC1nma_LonPM180
1     erdCAsfc_LonPM180
2     erdCS25h_LonPM180
3     erdCM25h_LonPM180
4              erdC125h
5              erdC1nma
6              erdC1sfc
7              erdCA25h
8              erdCAnma
9              erdCAsfc
10             erdCB25h
11             erdCBsfc
12             erdCM25h
13             erdCMnma
14             erdCMsfc
15             erdCS25h
16             erdCSsfc
17    erdC125h_LonPM180
18    erdC1sfc_LonPM180
19    erdCA25h_LonPM180
20    erdCAnma_LonPM180
21    erdCB25h_LonPM180
22    erdCBsfc_LonPM180
23    erdCMnma_LonPM180
24    erdCMsfc_LonPM180
25    erdCSsfc_LonPM180
26            ucsdHfrA6
27            ucsdHfrH1
28            ucsdHfrE2
29            ucsdHfrE6
30          ucsdHfrW500
31            ucsdHfrW1
32            ucsdHfrW2
33            ucsdHfrE1
34            ucsdHfrP2
35            ucsdHfrW6
36            ucsdHfrP6
Name: Dataset ID, dtype: object

Filtering the search with extra words.

In [3]:
search_for = "HFRadar 2km"

url = e.get_search_url(search_for=search_for, response="csv")

pd.read_csv(url)["Dataset ID"]

0    ucsdHfrE2
1    ucsdHfrW2
2    ucsdHfrP2
Name: Dataset ID, dtype: object

Filtering the search with words that should **not** be found.

In [4]:
search_for = "HFRadar -EXPERIMENTAL"

url = e.get_search_url(search_for=search_for, response="csv")

pd.read_csv(url)["Dataset ID"]

0       ucsdHfrA6
1       ucsdHfrH1
2       ucsdHfrE2
3       ucsdHfrE6
4     ucsdHfrW500
5       ucsdHfrW1
6       ucsdHfrW2
7       ucsdHfrE1
8       ucsdHfrP2
9       ucsdHfrW6
10      ucsdHfrP6
Name: Dataset ID, dtype: object

Quoted search or "phrase search," first let us try the unquoted search.

In [5]:
search_for = "wind speed"

url = e.get_search_url(search_for=search_for, response="csv")

len(pd.read_csv(url)["Dataset ID"])

487

Too many datasets because wind, speed, and wind speed are matched.
Now let's use the quoted search to reduce the number of results to only wind speed.

In [6]:
search_for = '"wind speed"'

url = e.get_search_url(search_for=search_for, response="csv")

len(pd.read_csv(url)["Dataset ID"])

469