# 1_get_radx_mentions
This notebook queries full-text articles in PubMed Central for the keyword **"RADx"**. The retrieved publications include those related to the **RADx Initiative** as well as false positives, such as articles mentioning the [**RADX gene**](https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:25486). 

The publications that mention RADx projects are used as a test set to validate the selection of RADx related publications (see 3_analyze_publications_likert.ipynb)

**Author:** Peter W. Rose ([pwrose@ucsd.edu](mailto:pwrose@ucsd.edu))  
**Date:** 2025-03-13

In [1]:
import requests
import json
import pandas as pd

In [2]:
RADX_RAD_MENTION = "../derived_data/publications_radx_mentions.csv"

In [3]:
project = "RADx"
start_date = "2020-01-01" # RADx projects started in 2020

In [4]:
# Query publications and preprints that mention the RADx project (source: Europe PMC)
url = f"https://www.ebi.ac.uk/europepmc/webservices/rest/search?query=%22{project}%22%20AND%20(FIRST_PDATE:{start_date})%20AND%20sort_date:y&resultType=core&pageSize=1000&format=json&cursorMark=*"

response = requests.get(url)
response.raise_for_status()
data = response.json()

df = pd.json_normalize(data["resultList"]["result"])
df = df.astype(str)
df.fillna("", inplace=True)
df.replace("nan", "", inplace=True)

df.rename(columns={"fullTextIdList.fullTextId": "fullTextId", "pmid": "pm_id", "pmcid": "pmc_id"}, inplace=True)

In [5]:
print(f"Number of publications mentioning {project}: {df.shape[0]}")

Number of publications mentioning RADx: 845


In [6]:
df.head()

Unnamed: 0,id,source,pm_id,pmc_id,doi,title,authorString,pubYear,pageInfo,abstractText,...,tmAccessionTypeList.accessionType,commentCorrectionList.commentCorrection,manuscriptId,embargoDate,investigatorList.investigator,dbCrossReferenceList.dbName,versionNumber,bookOrReportDetails.publisher,bookOrReportDetails.yearOfPublication,versionList.version
0,40470163,MED,40470163,PMC12136053,10.1093/jamiaopen/ooaf046,A community-engaged approach to developing com...,"Pike Welch HL, Guest G, Garba H, Carrillo GA, ...",2025,ooaf046,<h4>Objectives</h4>In response to requests fro...,...,,,,,,,,,,
1,40452232,MED,40452232,,10.1021/acsinfecdis.5c00249,Multifunctional Silk Fibroin Nanocarriers Load...,"Xu F, Cui T, Long Z, Shen H, Xie H, Liu H, Xie...",2025,1707-1718,The development of drug-resistant biofilms has...,...,,,,,,,,,,
2,40442774,MED,40442774,PMC12123805,10.1186/s12943-025-02355-1,Clinical approaches to overcome PARP inhibitor...,"Zou Y, Zhang H, Chen P, Tang J, Yang S, Nicot ...",2025,156,PARP inhibitors have profoundly changed treatm...,...,['nct'],,,,,,,,,
3,40432125,MED,40432125,PMC12116134,10.3390/vaccines13050517,SARS-CoV-2 Antibodies in Response to COVID-19 ...,"Huang Y, Fonseca HR, Acuna L, Wu W, Wang X, Go...",2025,517,<b>Background</b>: Understanding immune respon...,...,,,,,,,,,,
4,40359415,MED,40359415,PMC12074607,10.1371/journal.pone.0323651,Differences in COVID-19 testing perceptions am...,"Singh-Verdeflor KD, Kelly MM, DeMuri GP, Warne...",2025,e0323651,<h4>Background</h4>COVID-19 testing safeguards...,...,,,,,,,,,,


In [7]:
df.to_csv(RADX_RAD_MENTION, index=False)