# User prompt
The user define the research objective a prompt. I use openAI to identify relevant key words for the research.
Documentation:
* https://platform.openai.com/docs/guides/text?api-mode=responses&lang=python

In [59]:
import os
from dotenv import load_dotenv
from openai import OpenAI
from openai.types.responses import Response

# Load the API key
load_dotenv()

client: OpenAI = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

# Research objective
research_objective: str = """
At present, one of our clients is looking to speak with professionals who have insights about the emerging technologies 
in soft contact lens manufacturing, particularly non-injection moulded methods. They would broadly like to understand how 
these technologies are reshaping the industry—from on-demand manufacturing to smart, drug-delivery-enabled lenses.
"""

prompt: str = f"""
You are assisting a researcher in generating targeted search terms for academic and patent literature related to the research topic described below.

Return a JSON object with the following structure:
- "main_topic": a concise phrase (2–5 words) that reflects the core technological focus of the research
- "openalex": exactly 5 academic search terms, each a single word
- "patentview": exactly 5 patent-related keywords, each a single word
- "cpc_codes": exactly 5 valid CPC classification codes relevant to the topic

Instructions:
- Output must be valid JSON only — no markdown, comments, or extra text
- All terms in "openalex" and "patentview" must be single words (no spaces, no hyphens)
- Do NOT include any words or close variants from "main_topic" in "openalex" or "patentview"
- All terms across the fields must be unique — no repetition or synonyms
- Each term in "openalex" and "patentview" must be conceptually compatible with the "main_topic" so that combining them (e.g. "main_topic" AND "keyword") produces a realistic and meaningful research query
- Use language and terminology commonly found in scientific publications and patent documents

Research Objective:
\"\"\" 
{research_objective} 
\"\"\"
"""

# Selected model
# model="gpt-3.5-turbo",
# model="gpt-4-turbo",
# model="gpt-4o", 
GPT_MODEL = "gpt-4o-mini"

response: Response = client.responses.create(

    model=GPT_MODEL,
    input = prompt
)

# print(type(response))
print(response.output_text)

{
  "main_topic": "soft contact lenses",
  "openalex": ["manufacturing", "technology", "design", "materials", "optics"],
  "patentview": ["innovation", "delivery", "smart", "process", "application"],
  "cpc_codes": ["A61F2/16", "A61F2/20", "B29C45/00", "B29D11/00", "C08J5/18"]
}


In [66]:
import os
from dotenv import load_dotenv
from openai import OpenAI
from openai.types.responses import Response

# Load the API key
load_dotenv()

client: OpenAI = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

# Research objective
research_objective: str = """
At present, one of our clients is looking to speak with professionals who have insights about the emerging technologies 
in soft contact lens manufacturing, particularly non-injection moulded methods. They would broadly like to understand how 
these technologies are reshaping the industry—from on-demand manufacturing to smart, drug-delivery-enabled lenses.
"""

prompt: str = f"""
You are assisting a researcher in generating targeted search terms for academic and patent literature related to the research topic described below.

Return a JSON object with the following structure:
- "main_topic": a concise phrase (2–5 words) that reflects the core technological focus of the research
- "openalex": exactly 5 academic search terms, each a single word
- "patentview": exactly 5 patent-related keywords, each a single word
- "cpc_codes": exactly 5 valid CPC classification codes relevant to the topic

Instructions:
- Output must be valid JSON only — no markdown, comments, or extra text
- All terms in "openalex" and "patentview" must be single words (no spaces, no hyphens)
- Do NOT include any words or close variants from "main_topic" in "openalex" or "patentview"
- All terms across the fields must be unique — no repetition or synonyms
- Each term in "openalex" and "patentview" must be conceptually compatible with the "main_topic" so that combining them (e.g. "main_topic" AND "keyword") produces a realistic and meaningful research query
- Use language and terminology commonly found in scientific publications and patent documents

Research Objective:
\"\"\" 
{research_objective} 
\"\"\"
"""

# Selected model
GPT_MODEL = "gpt-4o-mini"

response: Response = client.responses.create(

    model=GPT_MODEL,
    input = prompt
)

# print(type(response))
print(response.output_text)

{
  "main_topic": "soft contact lenses",
  "openalex": [
    "manufacturing",
    "technology",
    "innovation",
    "materials",
    "design"
  ],
  "patentview": [
    "optics",
    "biocompatible",
    "hydrogels",
    "coatings",
    "sustainability"
  ],
  "cpc_codes": [
    "G02C7/04",
    "A61F2/34",
    "B29C45/15",
    "B29D99/00",
    "G01N33/53"
  ]
}


In [None]:
import json

def load_json(response_output: Response) -> dict[str, str | list[str]]:
    # Load the json file into dictionary
    json_dict:dict[str, str | list[str]] = json.loads(response_output)
    # print("Parsed dict:", research_key_words)
    return json_dict

# Print the main topic
research_key_words = load_json(response.output_text)
print(research_key_words["main_topic"])
# print(research_key_words["openalex"][0])

soft contact lenses


# Openalex API
https://docs.openalex.org/

Valid parameters are: 
* apc_sum, 
* cited_by_count_sum, 
* cursor, 
* filter, 
* format, 
* group_by, 
* group-by, 
* group_bys, 
* group-bys, 
* mailto, 
* page, 
* per_page, 
* per-page, 
* q, 
* sample, 
* seed, 
* search, 
* select, 
* sort, 
* warm.'

In [None]:
import requests
import pandas as pd
from typing import Any

def reconstruct_abstract(abstract_inverted_index: dict[str, list[int]]) -> str:
    '''
    Reconstruct the abstract from abstract_inverted_index
    '''
    
    # Some works don't have an abstract
    if not abstract_inverted_index:
        return ""
        
    # Variable to store the highest index
    max_value: int = 0
     # Loop through all the list of position in the abstract_inverted_index dictionary.
    for values in abstract_inverted_index.values():
        # Loop through all the index value
        for value in values:
            # identify the highest value index
            if value >= max_value:
                max_value = value
                
    # Create an empty list with abstract size        
    abstract: list[str] = [None] * (max_value +1)

    # Loop through each word in the abstract_inverted_index:
    for word, positions in abstract_inverted_index.items():
        # For each word, get the list of positions it appears in.
        for position in positions:
            # Insert each word into its correct position in the list.
            abstract[position]= word
                        
    # Join all the words in the list into a single string, separated by spaces.
    abstract_text: str = " ".join(abstract)
    # print("\n", abstract_text)
    
    return abstract_text

url: str = "https://api.openalex.org/works"

# TODO: create a loop to make the search on all the identified key words.
main_topic: str = research_key_words["main_topic"]
research_key_word: str = research_key_words["openalex"][0]

search_terms: str = f"({main_topic} AND {research_key_word})"
mailto: str = "adyl.elguamra@gmail.com" #For best performance, add your email to all API requests

per_page: int = 15 # By default there are 25 results per page
page: int = 1 # Get the result from page number

params: dict = {
    "search": search_terms, # searches across titles, abstracts, and fulltext.
    "per_page": per_page, 
    "page": page, # if needed I can loop over pages. for example from page 1 to 5 with a for loop
    "sort": "relevance_score:desc",
    "mailto": mailto   
}
response: requests.Response = requests.get(url, params=params)

if response.status_code == 200:
    
    data: dict[str, Any] = response.json()

    works: list[dict[str, Any]] = data.get("results", []) # access a key in dictionary

    # Extract info into list of dicts
    records: list = []
    # loop in the list
    for work in works:
        
        abstract_inverted_index: dict[str, list[int]] = work.get("abstract_inverted_index", [])
        # print(type(abstract_inverted_index))
        abstract = reconstruct_abstract(abstract_inverted_index)

        record: dict[str, Any] = {
            "title": work.get("title"),
            "abstract": abstract,
            "publication_date": work.get("publication_date"),
            "year": work.get("publication_year"),
            "citations": work.get("cited_by_count"),
            "authors": [auth["author"]["display_name"] for auth in work.get("authorships", [])],
            "openAlex id": work.get("id"),
        }
        records.append(record)

    print("Final URL:", response.url)
    print("JSON Data:", response.json())

    # Convert to DataFrame
    df: pd.DataFrame = pd.DataFrame(records)
    display(df)

else:
    print(f"Failed to fetch data. Status code: {response.status_code}")
    print(response.text)

Final URL: https://api.openalex.org/works?search=%28soft+contact+lenses+AND+manufacturing%29&per_page=15&page=1&sort=relevance_score%3Adesc&mailto=adyl.elguamra%40gmail.com
JSON Data: {'meta': {'count': 26400, 'db_response_time_ms': 282, 'page': 1, 'per_page': 15, 'groups_count': None}, 'results': [{'id': 'https://openalex.org/W2044512869', 'doi': 'https://doi.org/10.1016/j.eurpolymj.2014.11.024', 'title': 'Biomedical applications of hydrogels: A review of patents and commercial products', 'display_name': 'Biomedical applications of hydrogels: A review of patents and commercial products', 'relevance_score': 639.4715, 'publication_year': 2014, 'publication_date': '2014-11-29', 'ids': {'openalex': 'https://openalex.org/W2044512869', 'doi': 'https://doi.org/10.1016/j.eurpolymj.2014.11.024', 'mag': '2044512869'}, 'language': 'en', 'primary_location': {'is_oa': True, 'landing_page_url': 'https://doi.org/10.1016/j.eurpolymj.2014.11.024', 'pdf_url': None, 'source': {'id': 'https://openalex.or

Unnamed: 0,title,abstract,publication_date,year,citations,authors,openAlex id
0,Biomedical applications of hydrogels: A review of patents and commercial products,"Hydrogels have become very popular due to their unique properties such as high water content, softness, flexibility and biocompatibility. Natural and synthetic hydrophilic polymers can be physically or chemically cross-linked in order to produce hydrogels. Their resemblance to living tissue opens up many opportunities for applications in biomedical areas. Currently, hydrogels are used for manufacturing contact lenses, hygiene products, tissue engineering scaffolds, drug delivery systems and wound dressings. This review provides an analysis of their main characteristics and biomedical applications. From Wichterle's pioneering work to the most recent hydrogel-based inventions and products on the market, it provides the reader with a detailed introduction to the topic and perspective on further potential developments.",2014-11-29,2014,2291,"[Enrica Caló, Vitaliy V. Khutoryanskiy]",https://openalex.org/W2044512869
1,Multistate Outbreak of Fusarium Keratitis Associated With Use of a Contact Lens Solution,"Fusarium keratitis is a serious corneal infection, most commonly associated with corneal injury. Beginning in March 2006, the Centers for Disease Control and Prevention received multiple reports of Fusarium keratitis among contact lens wearers.To define the specific activities, contact lens hygiene practices, or products associated with this outbreak.Epidemiological investigation of Fusarium keratitis occurring in the United States. A confirmed case was defined as keratitis with illness onset after June 1, 2005, with no history of recent ocular trauma and a corneal culture growing Fusarium species. Data were obtained by patient and ophthalmologist interviews for case patients and neighborhood-matched controls by trained personnel. Available Fusarium isolates from patients' clinical and environmental specimens were genotyped by multilocus sequence typing. Environmental sampling for Fusarium was conducted at a contact lens solution manufacturing plant.Keratitis infection with Fusarium species.As of June 30, 2006, we identified 164 confirmed case patients in 33 states and 1 US territory. Median age was 41 years (range, 12-83 years). Corneal transplantation was required or planned in 55 (34%). One hundred fifty-four (94%) of the confirmed case patients wore soft contact lenses. Forty-five case patients and 78 controls were included in the case-control study. Case patients were significantly more likely than controls to report using a specific contact lens solution, ReNu with MoistureLoc (69% vs 15%; odds ratio, 13.3; 95% confidence interval, 3.1-119.5). The prevalence of reported use of ReNu MultiPlus solution was similar between case patients and controls (18% vs 20%; odds ratio, 0.7; 95% confidence interval, 0.2-2.8). Fusarium was not recovered from the factory, warehouse, solution filtrate, or unopened solution bottles; production of implicated lots was not clustered in time. Among 39 isolates tested, at least 10 different Fusarium species were identified, comprising 19 unique multilocus genotypes.The findings from this investigation indicate that this outbreak of Fusarium keratitis was associated with use of ReNu with MoistureLoc contact lens solution. Contact lens users should not use ReNu with MoistureLoc.",2006-08-22,2006,592,"[Douglas C. Chang, Gavin B. Grant, Kerry O’Donnell, Kathleen Wannemuehler, Judith Noble‐Wang, Carol Y. Rao, Lara M. Jacobson, Claudia S. Crowell, Rodlescia Sneed, Felicia M.T. Lewis, Joshua K. Schaffzin, Marion Kainer, Carol A. Genese, Eduardo C. Alfonso, Dan B. Jones, Arjun Srinivasan, Scott K. Fridkin, Benjamin J. Park, for the Fusarium Keratitis Investigation Team]",https://openalex.org/W2116582646
2,Contact lens practice,Part 1 Introduction Historical perspective. The anterior eye Visual optics Clinical instruments Part 2 Soft contact lenses Soft lens materials Soft lens manufacture Soft lens optics Soft lens measurement Soft lens design and fitting Soft toric lens design and fitting Soft lens care systems Part 3 Rigid contact lenses Rigid lens materials Rigid lens manufacture Rigid lens optics Rigid lens measurement Rigid lens design and fitting Rigid toric lens design and fitting Rigid lens care systems Part 4 Lens replacement modalities Unplanned lens replacement Daily soft lens replacement Planned soft lens replacement Planned rigid lens replacement Part 5 Special lenses and fitting considerations Scleral lenses Tinted lenses Presbyopia Continuous wear Sport Keratoconus High ametropia Paediatric fitting Therapeutic applications Post-refractive Surgery Post-keratoplasty Orthokeratology Diabetes Part 6 Patient examination and management History taking Preliminary examination Patient education Aftercare Complications Digital imaging Compliance Practice management Appendices Index,2002-01-01,2002,169,[Nathan Efron],https://openalex.org/W1746299914
3,Therapeutic Contact Lenses with Polymeric Vehicles for Ocular Drug Delivery: A Review,"The eye has many barriers with specific anatomies that make it difficult to deliver drugs to targeted ocular tissues, and topical administration using eye drops or ointments usually needs multiple instillations to maintain the drugs’ therapeutic concentration because of their low bioavailability. A drug-eluting contact lens is one of the more promising platforms for controllable ocular drug delivery, and, among various manufacturing methods for drug-eluting contact lenses, incorporation of novel polymeric vehicles with versatile features makes it possible to deliver the drugs in a sustained and extended manner. Using the diverse physicochemical properties of polymers for nanoparticles or implants that are selected according to the characteristics of drugs, enhancement of encapsulation efficiency and prolonged drug release are possible. Even though therapeutic contact lenses with polymeric vehicles allow us to achieve sustained ocular drug delivery, drug leaching during storage and distribution and the possibility of problems related to surface roughness due to the incorporated vehicles still need to be discussed before application in a real clinic. This review highlights the overall trends in methodology to develop therapeutic contact lenses with polymeric vehicles and discusses the limitations including comparison to cosmetically tinted soft contact lenses.",2018-07-01,2018,96,"[Seung Woo Choi, Jaeyun Kim]",https://openalex.org/W2810764086
4,Poly(vinyl alcohol) Hydrogels Reinforced with Nanocellulose for Ophthalmic Applications: General Characteristics and Optical Properties,"Globally, uncorrected refractive errors are one of the main causes of visual impairment, and contact lenses form an important part of modern day eye care and culture. Several hydrogels with varying physicochemical properties are in use to manufacture soft contact lenses. Hydrogels are generally too soft and reinforcement with appropriate materials is desirable to achieve high water content without compromising mechanical properties. In this study, we have developed a highly transparent macroporous hydrogel with water content >90%, by combining poly(vinyl alcohol) with nanocellulose. Furthermore, the results show that the composite hydrogel has refractive index close to that of water and very good UV-blocking properties.",2016-12-01,2016,73,"[Gopi Krishna Tummala, Ramiro Rojas, Albert Mihranyan]",https://openalex.org/W2558993627
5,Impact of Manufacturing Technology and Material Composition on the Clinical Performance of Hydrogel Lenses,"Purpose. To establish the clinical impact of three different methods of manufacture used to produce soft contact lenses. Methods. Clinical performance of five lens types was investigated by undertaking a prospective, double-masked, randomized, crossover study. Three of the lenses were made from poly(hydroxyethyl methacrylate) (pHEMA) by three different manufacturing processes (lathing, spin casting, and cast molding), and the remaining two lenses were cast molded from different materials—hydroxyethyl methacrylate/methacrylic acid and hydroxyethyl methacrylate/glycerol methacrylate (HEMA/GMA). All lenses were specially fabricated for this work at the same manufacturing plant. Thirty-four soft contact lens wearers wore each lens for 1 month on a daily-wear basis. Several clinical variables, such as ocular response, visual acuity, lens fitting, prelens tear film, lens surface dehydration, subjective response, and protein deposition, were measured. Results. In general, the spun-cast pHEMA lens performed inferiorly compared with the other pHEMA lenses. This lens induced significantly more limbal and conjunctival hyperemia than the cast-molded lens and provided poorer low contrast visual acuity (LCVA) than the other two lenses. It dehydrated more and had the least on-eye movement. However, the spun-cast lens deposited the least protein of the pHEMA lenses. In general, the HEMA/GMA lens performed inferiorly compared with the other cast-molded lenses. LCVA was worse with this lens, and subjective responses showed that this lens was thought to give the worst visual performance of the cast-molded lenses. It was also thought to be the most difficult lens to handle. Significantly more breakages occurred with this lens than any other. Conclusions. Overall, this work has shown that manufacturing method and material composition have a fundamental effect on many clinical properties of a lens. Therefore, method of manufacture is also an important consideration in the overall production of a soft lens.",2004-06-01,2004,61,"[Carole Maldonado‐Codina, Nathan Efron]",https://openalex.org/W1979623023
6,Development of Contact Lenses and Their Worldwide Use,"The concept of applying a lens to the cornea as a refractive appliance was first proposed in the early 19th century. By 1888, glass scleral lenses for the correction of optical defects and irregularities were manufactured and used. New materials, especially soft hydrogel lenses and rigid gas-permeable lenses, became available in the 20th century and allowed comfortable contact lenses to be made in any design needed. By the 21st century, the increasing use of silicone hydrogel lenses to address the oxygen need of the cornea has led to increased worldwide use. Of the 125 million global contact lens wearers, most are female and relatively young. Soft lenses are by far the dominant modality used, with silicone hydrogel lenses taking an increasing share of new fittings, particularly for overnight wear. Microbial keratitis, although relatively uncommon, remains the most serious potential complication for these lens wearers. Ongoing basic research, more powerful antimicrobial agents, and the development of safer lens materials are helping to alleviate this problem.",2007-11-01,2007,61,[James E. Key],https://openalex.org/W2058037352
7,Initial in Vivo Tear Protein Deposition on Individual Hydrogel Contact Lenses,"We investigated and compared the initial composition, morphology, and time course of deposits on individual soft contact lenses of different water contents and surface charges in order to evaluate the potential for antigenic reactions and to predict the optimal frequency of lens replacement. Newly manufactured lenses were worn for graduated periods of time from 1 min to 8 h by subjects who were first adapted to daily wear soft lenses. The morphology and composition of the deposits were analyzed by histological staining, light microscopy, scanning electron microscopy (SEM), sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDSPAGE) with silver nitrate staining, and immunofluorescence microscopy. The protein bands of the acrylamide gels were divided according to their molecular weights into six groups which have been defined in the literature from tear analyses by electrophoretic techniques and include lysozyme, proteins migrating faster than albumin (PMFA), protein G, albumin, lactoferrin, and other proteins heavier than albumin such as Ig-G and secretory Ig-A. Specific proteins (lysozyme, PMFA, and protein G) were detected on individual lenses after as little as 1 min of wear. There was an increasing amount of protein deposited as the wearing time increased. Differences in the rates and amounts of deposition were more dependent on lens water content and ionic characteristics than on intersubject differences. Such early significant protein deposition may occur in wearers of disposable lenses as well as in those subject to complications due to accumulation of protein.",1990-07-01,1990,94,"[Charles Leahy, Robert B. Mandell, S. Lin]",https://openalex.org/W2033044886
8,Overnight Corneal Reshaping versus Soft Disposable Contact Lenses: Vision-Related Quality-of-Life Differences From a Randomized Clinical Trial,"Purpose. The purpose of this article is to evaluate patients' visual acuity, symptoms, and perceptions of vision-related quality of life in a randomized crossover clinical trial of overnight corneal reshaping (OCR) and daily wear soft lenses (SCL). Methods. Qualified subjects were randomly assigned to wear one mode of contact lens for 8 weeks and then, after a washout period, they wore the alternate mode for 8 weeks. On concluding each contact lens wear mode, subjects completed the NEI-RQL42 questionnaire. During the SCL mode, subjects wore lenses during their waking hours. During the OCR mode, subjects wore lenses only while sleeping. Soft lenses were Biomedics 55 2-week disposable lenses. OCR lenses were CRT lenses by Paragon. (Three subjects were fit with custom-designed OCR lenses in Boston XO material, manufactured by Art Optical.) LogMAR acuity was measured and slit lamp evaluation was performed at specified intervals during follow up. After completing both phases of the study, patients chose which mode they preferred. Results. Of 81 enrolled patients, 65 completed both phases and 16 dropped out during the study. Significant differences (p < 0.01) favoring SCL wear included better visual acuity and less trouble with glare. Significant differences (p < 0.01) favoring OCR wear included less activity limitations, less trouble with symptoms, and less dependence on refractive correction. Of 65 completing both phases, 44 preferred the OCR lenses and 21 preferred the soft lenses. Subjects who preferred the OCR lenses were less myopic and had steeper K readings at baseline, and showed less difference between visual acuity during OCR wear and visual acuity with SCL. Conclusion. In subjects with mild myopia who experienced both SCL and OCR, better visual acuity and less glare resulted from SCL wear, whereas activity limitations, symptoms, and dependence on refractive correction were less troublesome with OCR wear. When the study was completed, 67.7% chose OCR lenses worn only while sleeping, whereas 32.3% preferred 2-week disposable soft lenses worn during the day as their preferred correction.",2005-10-01,2005,61,"[Michael Lipson, Alan Sugar, David C. Musch]",https://openalex.org/W2018650273
9,A Wirelessly Powered Smart Contact Lens with Reconfigurable Wide Range and Tunable Sensitivity Sensor Readout Circuitry,"This study presented a wireless smart contact lens system that was composed of a reconfigurable capacitive sensor interface circuitry and wirelessly powered radio-frequency identification (RFID) addressable system for sensor control and data communication. In order to improve compliance and reduce user discomfort, a capacitive sensor was embedded on a soft contact lens of 200 μm thickness using commercially available bio-compatible lens material and a standard manufacturing process. The results indicated that the reconfigurable sensor interface achieved sensitivity and baseline tuning up to 120 pF while consuming only 110 μW power. The range and sensitivity tuning of the readout circuitry ensured a reliable operation with respect to sensor fabrication variations and independent calibration of the sensor baseline for individuals. The on-chip voltage scaling allowed the further extension of the detection range and prevented the implementation of large on-chip elements. The on-lens system enabled the detection of capacitive variation caused by pressure changes in the range of 2.25 to 30 mmHg and hydration level variation from a distance of 1 cm using incident power from an RFID reader at 26.5 dBm.",2017-01-07,2017,61,"[Jin‐Chern Chiou, Shun-Hsi Hsu, Yu-Chieh Huang, Guan-Ting Yeh, Wei-Ting Liou, Cheng-Kai Kuei]",https://openalex.org/W2568988523


## Openalex: Scoring and justification  with openai

Documentation:
* https://platform.openai.com/docs/guides/embeddings

In [None]:
import pandas as pd
from sklearn.metrics.pairwise import cosine_similarity
from openai import OpenAI
import time

def get_embedding(text, model="text-embedding-ada-002"):
    text = text.replace("\n", " ")
    response = client.embeddings.create(input=[text], model=model)
    return response.data[0].embedding


print("Embedding query...")
query_embedding = get_embedding(research_objective)

df["text"] = df["title"] + ". " + df["abstract"]


print("Embedding papers...")
df["embedding"] = df["text"].apply(get_embedding)

df["similarity"] = df["embedding"].apply(
    lambda x: cosine_similarity([query_embedding], [x])[0][0]
)


def get_justification(title, abstract, objective):
    prompt = f"""
    You are the CEO, as well as a scientific and regulatory analyst, evaluating academic research for a company exploring new technologies in soft contact lenses.

    Below is a paper's title and abstract, followed by the company's research objective. Assess the paper across eight dimensions critical to industry adoption. Your evaluation should reflect both business and technical perspectives.

    Title:  
    {title}

    Abstract:  
    {abstract}

    Research Objective:  
    {objective}

    For each dimension below, start with **Yes** or **No**, followed by a **1–2 sentence explanation** based only on the title and abstract. Be **concise**, **specific**, and **fact-based**. Avoid speculation or vague generalizations.

    1. **Technical Relevance** – Does the core technology directly relate to soft contact lenses or relevant materials/devices?  
    2. **Innovation** – Is there a clear, specific novelty or improvement over existing technologies?  
    3. **Feasibility** – Is the approach practical for real-world use or scalable manufacturing?  
    4. **Regulatory Fit** – Does it show potential to meet medical device or material safety regulations?  
    5. **Commercial Potential** – Is there a clear path to productization, monetization, or licensing?  
    6. **Research Credibility** – Are the science, methods, or authors/institutions reputable?  
    7. **IP / Competition** – Is the idea likely protectable or positioned ahead of competitors?  
    8. **Overall Relevance** – Is the paper relevant overall? Start with Yes or No, then briefly explain why.

    Format your response exactly like this:

    Technical Relevance: Yes/No – [reason]  
    Innovation: Yes/No – [reason]  
    Feasibility: Yes/No – [reason]  
    Regulatory Fit: Yes/No – [reason]  
    Commercial Potential: Yes/No – [reason]  
    Research Credibility: Yes/No – [reason]  
    IP / Competition: Yes/No – [reason]  
    Overall Relevance: Yes/No – [reason]

    Only use what is stated or implied in the title and abstract.
    
    Return a valid JSON object with exactly the following keys:

    - "justification": a string with the full 8-dimension evaluation in the exact format described above
    - "is_relevant": a string, either "Yes" or "No", based on your answer to the "Overall Relevance" question

    Ensure the JSON object is valid and returned as plain text — no markdown or extra explanation.
    """


    try:
        response: Response = client.responses.create(
        model=GPT_MODEL,
        input = prompt
        )
        return response.output_text
    except Exception as e:
        return f"ERROR: {e}"
    
print("Generating GPT justifications...")

# justification_dict = load_json(get_justification(df.at[0,"title"], df.at[0,"abstract"], research_objective))
# justification_dict["is_relevant"]

# TODO: optimize the code with .apply()
# loop through the dataframe and update the paper justification
for idx, row in df.iterrows():
    justification_dict = justification_dict = load_json(get_justification(row["title"], row["abstract"], research_objective))
    df.at[idx, "justification"] = justification_dict["justification"]
    df.at[idx, "is_relevant"] = justification_dict["is_relevant"]
    

Embedding query...
Embedding papers...
Generating GPT justifications...


In [71]:
# Show full column contents and more columns
pd.set_option("display.max_colwidth", None)    # Show full text in cells
# pd.set_option("display.max_columns", None)     # Show all columns
# pd.set_option("display.width", 0)              # Auto-detect width (or set a high number)

df[["title","similarity", "is_relevant", "justification"]].head(15)

Unnamed: 0,title,similarity,is_relevant,justification
0,Biomedical applications of hydrogels: A review of patents and commercial products,0.769526,Yes,"Technical Relevance: Yes – The paper discusses hydrogels, which are used in manufacturing contact lenses. Innovation: No – The abstract does not indicate any specific novel advancements beyond existing uses of hydrogels. Feasibility: Yes – Hydrogels are established in the market and feasible for manufacturing processes. Regulatory Fit: Yes – The mention of biocompatibility suggests potential compliance with medical device regulations. Commercial Potential: Yes – The review identifies ongoing applications and therefore hints at productization opportunities. Research Credibility: Yes – The paper reviews existing patents and products, implying a credible overview of the field. IP / Competition: Yes – The discussion of patents suggests protectable innovations within hydrogels. Overall Relevance: Yes – The content of the paper is pertinent to soft contact lens technology and the research objective."
1,Multistate Outbreak of Fusarium Keratitis Associated With Use of a Contact Lens Solution,0.78325,No,"Technical Relevance: Yes – The paper addresses a specific incident related to contact lens solutions and their impact on soft contact lens users. Innovation: No – The paper does not introduce new technologies or improvements; it reports an outbreak connected to an existing product. Feasibility: No – The research focuses on outbreak investigation rather than manufacturing methods, making practical implementation unclear. Regulatory Fit: Yes – The findings indicate the requirement for safety evaluations in specific solutions, aligning with regulatory frameworks. Commercial Potential: No – The paper does not present a pathway for new product development or monetization, focusing instead on a negative outcome associated with a product. Research Credibility: Yes – The publication is based on extensive epidemiological research and includes data from health authorities like the CDC. IP / Competition: No – The study is more about addressing a health crisis than presenting a novel concept that would be protectable. Overall Relevance: No – While the paper is informative, its focus on an outbreak may not align with the exploration of new contact lens technologies as stated in the research objective."
2,Contact lens practice,0.820748,No,"Technical Relevance: Yes – The paper discusses various aspects of soft contact lenses, including materials and manufacturing processes. Innovation: No – There is no specific novelty or improvement mentioned in the abstract. Feasibility: Yes – The paper suggests established practices for manufacturing and fitting contact lenses. Regulatory Fit: Yes – Mention of clinical considerations implies compliance with medical device regulations is likely. Commercial Potential: No – The abstract does not indicate a clear path to commercialization or licensing opportunities. Research Credibility: Yes – The paper addresses well-known practices in the contact lens field, suggesting credible background knowledge. IP / Competition: No – There is no indication of unique ideas that might be protectable, nor does it discuss competitive positioning. Overall Relevance: No – While the paper covers soft contact lenses, it does not specifically align with the focus on emerging non-injection molded technologies."
3,Therapeutic Contact Lenses with Polymeric Vehicles for Ocular Drug Delivery: A Review,0.8238,Yes,"Technical Relevance: Yes – The technology discussed directly relates to soft contact lenses and ocular drug delivery. Innovation: Yes – The incorporation of novel polymeric vehicles represents an advancement over conventional ocular drug delivery methods. Feasibility: No – Challenges like drug leaching during storage and potential surface roughness issues raise questions about practical implementation. Regulatory Fit: Yes – The technology appears to align with medical device regulations, but concerns about material safety need further examination. Commercial Potential: Yes – The ability to create drug-eluting contact lenses opens up opportunities for productization in the pharmaceutical market. Research Credibility: Yes – The abstract implies that the review is well-researched, indicating credibility in the field. IP / Competition: Yes – Novel polymeric vehicles may provide opportunities for patent protection, positioning the technology ahead of competitors. Overall Relevance: Yes – The review is relevant as it discusses emerging technologies in soft contact lenses, aligning with the company's research objectives."
4,Poly(vinyl alcohol) Hydrogels Reinforced with Nanocellulose for Ophthalmic Applications: General Characteristics and Optical Properties,0.796677,Yes,"Technical Relevance: Yes – The study focuses on hydrogels, a foundational material for soft contact lenses. Innovation: Yes – The combination of poly(vinyl alcohol) with nanocellulose represents a novel reinforcement strategy aimed at enhancing the mechanical properties of hydrogels. Feasibility: Yes – The development of a transparent hydrogel with high water content and favorable optical properties suggests a practical approach for manufacturing. Regulatory Fit: Yes – The emphasis on UV-blocking properties and potential safety aligns with medical device regulations. Commercial Potential: Yes – The technology’s enhancement of lens properties could lead to new products in the ophthalmic market. Research Credibility: Yes – The concepts presented are grounded in established scientific principles and materials research, suggesting reputable methodology. IP / Competition: Yes – The unique composition using nanocellulose provides a basis for potential intellectual property protections. Overall Relevance: Yes – The paper addresses critical advancements in materials relevant to emerging technologies in soft contact lens manufacturing."
5,Impact of Manufacturing Technology and Material Composition on the Clinical Performance of Hydrogel Lenses,0.821391,Yes,"Technical Relevance: Yes – The study examines manufacturing methods and materials directly applicable to soft contact lenses. Innovation: No – While the paper explores different manufacturing methods, it does not propose any significant new technologies or improvements over existing methods. Feasibility: Yes – The research uses established manufacturing processes, suggesting practical implications for industry adoption. Regulatory Fit: Yes – The focus on clinical performance indicates attention to medical device regulations and safety standards. Commercial Potential: No – The abstract does not indicate a clear path to commercialization or a specific market need addressed by the findings. Research Credibility: Yes – The methodology, including a randomized crossover study, suggests that the research is credible. IP / Competition: No – The findings address known manufacturing methods without a specific focus on novel intellectual property or competitive advantage. Overall Relevance: Yes – The paper provides valuable insights that align with current trends in soft contact lens technologies, particularly regarding manufacturing methods."
6,Development of Contact Lenses and Their Worldwide Use,0.83302,Yes,"Technical Relevance: Yes – The technology discussed is directly related to soft contact lenses and their materials. Innovation: No – The abstract does not present specific novel technologies or improvements beyond existing lenses. Feasibility: No – The focus on historical development does not address pragmatic manufacturing or scalability concerns. Regulatory Fit: Yes – The discussion of safer materials and antimicrobial agents implies adherence to medical device regulations. Commercial Potential: No – While there is mention of growing market trends, there is no clear productization pathway presented. Research Credibility: Yes – The historical context and focus on established developments suggest credible sources. IP / Competition: No – There is no indication of protectable innovations or competitive advantage. Overall Relevance: Yes – The paper provides valuable insights into the evolution of contact lenses, which is relevant to the current research objectives."
7,Initial in Vivo Tear Protein Deposition on Individual Hydrogel Contact Lenses,0.799695,Yes,"Technical Relevance: Yes – The study focuses on the composition and characteristics of protein deposits on soft contact lenses, directly linking to their performance. Innovation: No – The research primarily investigates existing lenses rather than introducing new technologies or methods. Feasibility: Yes – The investigation of protein deposition and lens characteristics can be practically conducted within current manufacturing frameworks. Regulatory Fit: Yes – The study addresses the antigenic reactions and potential complications associated with contact lenses, which are crucial for medical device regulations. Commercial Potential: Yes – Understanding protein deposition can lead to better lens designs and replacement schedules, appealing to manufacturers and users. Research Credibility: Yes – The methods described, including various microscopy techniques and electrophoresis, indicate a solid scientific foundation, though institutional reputation isn't provided. IP / Competition: No – The research primarily describes observations rather than introducing novel methods or materials that can be patented. Overall Relevance: Yes – The findings are pertinent to the ongoing development and optimization of soft contact lenses, aligning with industry interests."
8,Overnight Corneal Reshaping versus Soft Disposable Contact Lenses: Vision-Related Quality-of-Life Differences From a Randomized Clinical Trial,0.808337,Yes,"Technical Relevance: Yes – The technology discussed in the paper pertains directly to contact lenses, specifically comparing overnight corneal reshaping lenses and daily disposable soft lenses. Innovation: No – The paper does not present new technologies but rather compares existing lens types in terms of user experience. Feasibility: Yes – The study's methods are practical and reflect common practices in clinical trials, indicating feasible real-world application. Regulatory Fit: Yes – The study involves FDA-regulated devices and suggests a potential for compliance with medical device safety standards. Commercial Potential: Yes – The findings could facilitate marketing strategies around lens preferences, benefiting manufacturers. Research Credibility: Yes – The methodology involves established clinical trial practices and references credible testing measures. IP / Competition: No – The study primarily compares existing technologies and does not suggest innovative changes that could be patentable. Overall Relevance: Yes – The paper is relevant as it explores user preferences between two established types of soft contact lenses, which aligns with the company's focus on emerging technologies in the field."
9,A Wirelessly Powered Smart Contact Lens with Reconfigurable Wide Range and Tunable Sensitivity Sensor Readout Circuitry,0.812875,Yes,"Technical Relevance: Yes – The technology described directly involves smart contact lenses using biocompatible materials and sensors relevant to soft contact lens applications. Innovation: Yes – The paper demonstrates a novel reconfigurable sensor readout system that improves sensitivity and power efficiency compared to traditional methods. Feasibility: Yes – The use of commercially available materials and standard manufacturing processes suggests practical usability and scalability in manufacturing. Regulatory Fit: Yes – The mention of bio-compatible materials indicates potential compliance with medical device regulations for safety and efficacy. Commercial Potential: Yes – The technology's integration into smart lenses presents opportunities for productization, especially concerning health monitoring and drug delivery. Research Credibility: Yes – The study appears rigorous and likely backed by reputable authors or institutions, indicated by its detailed technical execution. IP / Competition: Yes – The innovation of wirelessly powered and tunable sensor circuitry is likely to have competitive advantages and could be patentable. Overall Relevance: Yes – The paper aligns well with the company's interest in new soft lens technologies, especially non-injection molded methods."


## Openalex: trends in relevant papers