# Search Terms

This project starts with curated collections of terms, including blue-health factors and related associations, such as psychological, social, environmental, and policy aspects. Automated literature collection then gathers information from papers using those terms, utilizing [LISC](https://lisc-tools.github.io/).

Current analysis takes two forms:
- `Words` analyses: analyzes text data from articles that discuss blue-health factors and wellbeing outcomes
    - This approach collects text and metadata from papers, and builds data-driven profiles for different factors
- `Count` analyses: searches for co-occurrences of terms, between blue-health factors and associated terms
    - This approach identifies patterns based on how commonly terms appear together

This notebook introduces the terms that are used in the project.


In [1]:
import lisc
print(lisc.__version__)
!pip install --upgrade --force-reinstall lisc




0.4.0
Collecting lisc
  Using cached lisc-0.4.0-py3-none-any.whl.metadata (8.5 kB)
Collecting numpy>=1.17.1 (from lisc)
  Using cached numpy-2.3.4-cp311-cp311-macosx_14_0_arm64.whl.metadata (62 kB)
Collecting requests (from lisc)
  Using cached requests-2.32.5-py3-none-any.whl.metadata (4.9 kB)
Collecting lxml (from lisc)
  Using cached lxml-6.0.2-cp311-cp311-macosx_10_9_universal2.whl.metadata (3.6 kB)
Collecting beautifulsoup4 (from lisc)
  Using cached beautifulsoup4-4.14.2-py3-none-any.whl.metadata (3.8 kB)
Collecting soupsieve>1.2 (from beautifulsoup4->lisc)
  Using cached soupsieve-2.8-py3-none-any.whl.metadata (4.6 kB)
Collecting typing-extensions>=4.0.0 (from beautifulsoup4->lisc)
  Using cached typing_extensions-4.15.0-py3-none-any.whl.metadata (3.3 kB)
Collecting charset_normalizer<4,>=2 (from requests->lisc)
  Using cached charset_normalizer-3.4.4-cp311-cp311-macosx_10_9_universal2.whl.metadata (37 kB)
Collecting idna<4,>=2.5 (from requests->lisc)
  Using cached idna-3.11-py

In [2]:
import lisc.io
print(dir(lisc.io))


['SCDB', '__builtins__', '__cached__', '__doc__', '__file__', '__loader__', '__name__', '__package__', '__path__', '__spec__', 'create_file_structure', 'db', 'io', 'load_api_key', 'load_meta_data', 'load_object', 'load_time_results', 'load_txt_file', 'save_meta_data', 'save_object', 'save_time_results', 'utils']


In [3]:
from collections import Counter

# Import Base LISC object to load and check search terms
from lisc.objects.base import Base
from lisc.io import load_txt_file


In [4]:
import seaborn as sns
sns.set_context('talk')

In [5]:
# Import custom project code
import sys
sys.path.append('../code')
from plts import plot_latencies

In [6]:
# Set the location of the terms
term_dir = '../terms/'

In [7]:
# Load a test object to check the terms
risk_factors = Base()

## Risk Factor Terms

First, we can check the list of search terms used to find articles about blue-health factors and wellbeing outcomes.



In [10]:
# Load risk factors and labels terms from file
risk_factors.add_terms('blue_health_factors.txt', directory=term_dir)
risk_factors.add_labels('blue_health_factor_labels.txt', directory=term_dir)

Unloading terms.
Unloading labels.


In [11]:
# Check the number of risk factor terms
print('Number of risk factor terms: {}'.format(risk_factors.n_terms))

Number of risk factor terms: 60


# Risk Factor Term Formatting

## Explanation of search term formatting
In the list below, the left-most term is the label of the search term (not necessarily used as a search term), with any terms to the right of the colon listing search terms that were used. Any synonyms are separated by commas, and were used together in searches, with an OR operator.


In [12]:
# Check list of search terms for the risk factor categories
risk_factors.check_terms()

List of terms used: 

Mental health               : mental health, psychological health, mental wellbeing, psychological wellbeing, subjective wellbeing, SWB, quality of life, QoL, life satisfaction, happiness
Affect balance              : affect balance, positive affect, negative affect, mood, hedonic tone, eudaimonic wellbeing, vitality
Psychological distress      : psychological distress, distress, general distress, K10, GHQ-12
Perceived stress            : perceived stress, stress, PSS, tension, strain
Anxiety                     : anxiety, generalized anxiety, GAD, GAD-7, state anxiety, trait anxiety, STAI
Depression                  : depression, depressive symptoms, PHQ-9, CES-D, BDI, low mood, anhedonia
Rumination                  : rumination, worry, brooding, repetitive negative thinking
Post-traumatic stress       : post-traumatic stress, PTSD symptoms, IES-R, PCL, trauma symptoms
Fear of water               : fear of water, aquaphobia, water-related anxiety
Resilience      

### Risk Factor Exclusion Terms

To exclude articles that might include unrelated meanings of our search terms (for example, the term 'debt' referring to economic policy rather than criminology), 
we use exclusion terms to remove irrelevant papers. 

These terms are integrated into the overall search query using a NOT operator to filter out articles that contain them.


In [13]:
# Add exclusion words
risk_factors.add_terms('erps_exclude.txt', term_type='exclusions', directory=term_dir)

In [14]:
# Check the risk factor exclusion terms used
risk_factors.check_terms('exclusions')

List of exclusions used: 

Mental health               : SWB soil water balance, SWB surface water body, SWB salt water battery, QoL lighting, happiness economics, rat, rats, mouse, mice, murine, rodent, zebrafish, drosophila, xenopus, canine, feline, equine, bovine, porcine, ovine, primate, veterinary, livestock, cattle, swine, poultry, avian, in vitro, cell line, cell culture, organoid, HeLa, A549, U937, Caco-2, PC12, fibroblast, astrocyte, microglia, yeast, arabidopsis, maize, rice, plant, crop, fish, fisheries, aquaculture, coral, algae, seaweed, phytoplankton, zooplankton, cyanobacteria, algal bloom, microcystin, saxitoxin, domoic acid, oyster, mussel
Affect balance              : affect heuristic (economics), vitality assay (cell), rat, rats, mouse, mice, murine, rodent, zebrafish, drosophila, xenopus, canine, feline, equine, bovine, porcine, ovine, primate, veterinary, livestock, cattle, swine, poultry, avian, in vitro, cell line, cell culture, organoid, HeLa, A549, U937, Caco-2

## Association Terms

As well as search terms for blue-health factors, we collected lists of potential association terms.

Groups of association terms include:
- blue-health-related terms
- wellbeing-related terms



### Blue-Health-Related Terms

First, we curated a list of blue-health-related association terms, to investigate research on blue-health connections and their associated factors.



In [15]:
# Load blue-health-related terms from file
blue_health_terms = Base()
blue_health_terms.add_terms('blue_health_activities.txt', directory=term_dir)


In [16]:
# Check the number of blue-health-related terms
print('Number of blue-health-related terms: {}'.format(blue_health_terms.n_terms))


Number of blue-health-related terms: 28


In [17]:
# Check the blue-health-related terms used
blue_health_terms.check_terms()


List of terms used: 

# ACTIVITATS (ordenades de més passives → molt actives)  : # ACTIVITATS (ordenades de més passives → molt actives)
contemplation of water                                   : contemplation of water, viewing water, listening to waves, waterscape viewing, ocean viewing, sea breeze appreciation, sunset watching at the coast
sitting by the sea                                       : sitting by the sea, sitting on the beach, deckchair resting, quiet stay on promenade
mindfulness by the sea                                   : mindfulness by the sea, beach meditation, breathing by the sea, yoga on the beach (gentle)
coastal strolling                                        : coastal strolling, coastal walking, beach walking, promenade walking
birdwatching (coastal)                                   : birdwatching (coastal), shorebird watching, seabird watching
rockpooling                                              : rockpooling, tidepool exploration, beachcombing (non-co

### Wellbeing-Related Terms

Finally, we curated a list of wellbeing-related terms to search for research related to holistic health and wellbeing outcomes.



In [18]:
# Load wellbeing-related terms from file
wellbeing_terms = Base()
wellbeing_terms.add_terms('blue_health_exposure_metrics.txt', directory=term_dir)


In [19]:
# Check the number of wellbeing-related terms
print('Number of wellbeing-related terms: {}'.format(wellbeing_terms.n_terms))


Number of wellbeing-related terms: 18


In [20]:
# Check the wellbeing-related terms used
wellbeing_terms.check_terms()


List of terms used: 

# METRIQUES D'EXPOSICIÓ (no ordenades per intensitat; 1 línia = terme canònic + sinònims)  : # METRIQUES D'EXPOSICIÓ (no ordenades per intensitat; 1 línia = terme canònic + sinònims)
coastal residence                                                                          : coastal residence, coastal proximity, distance to coast, distance to shoreline, distance to beach, coastal living
blue space proximity                                                                       : blue space proximity, blue-space exposure, perceived proximity, euclidean distance to water
amount of blue space                                                                       : amount of blue space, % blue space, blue space coverage, NDWI, normalized difference water index
street-view blueness                                                                       : street-view blueness, blue view index, view of water, window view of water, waterscape view, VVI
access to blue space   