# The REG202_PROC_STEP_TEXT Table

Welcome to the ``REG202_PROC_STEP_TEXT`` table overview. This table contains detailed textual information related to procedural steps from the ``REG201_PROC_STEP`` table, offering additional context and explanations for each step. These texts provide valuable insights into the nature of procedural steps throughout the patent application process, helping to clarify the status or actions involved. Each record captures a specific text linked to a procedural step, along with its type, which defines the role or category of the text.

While ``REG201_PROC_STEP`` records structured data on procedural actions, the ``REG202_PROC_STEP_TEXT`` table enriches this information with explanatory texts that help users interpret the significance of each step. Each record is categorised by the ``STEP_TEXT_TYPE`` attribute, which defines the role of the text, whether it describes the step itself, references a particular year, or details extension states. This added layer of context is particularly valuable for analysts, researchers, and policymakers who need deeper insights beyond numeric codes and classifications. By integrating procedural data with descriptive text, the table enhances transparency and facilitates a more comprehensive understanding of the patent process.

In [4]:
from epo.tipdata.patstat import PatstatClient
from epo.tipdata.patstat.database.models import REG201_PROC_STEP, REG101_APPLN, REG202_PROC_STEP_TEXT
from sqlalchemy import select, func, case, select, and_

patstat = PatstatClient(env='PROD')

db = patstat.orm()

## Key Fields in the REG202_PROC_STEP_TEXT Table

### ID (Primary Key)
The ID field serves as a technical identifier that uniquely connects patent applications across various tables.
### STEP_ID (Primary Key)
The ``STEP_ID`` is a unique identifier assigned to each procedural step within the patent application process. It is present in multiple tables, including ``REG201_PROC_STEP``, ``REG202_PROC_STEP_TEXT``, ``REG203_PROC_STEP_DATE``, ``REG721_PROC_STEP``, ``REG722_PROC_STEP_TEXT``, and ``REG723_PROC_STEP_DATE``. This attribute serves as a key reference for identifying and linking procedural steps across various datasets. With a domain of up to 30 characters, it ensures precise tracking and organization of procedural events associated with patent applications.

### STEP_TEXT
The ``STEP_TEXT`` attribute represents the text associated with a procedural step in the patent process. It provides a description or explanation related to the specific procedural step identified in the ``REG201_PROC_STEP`` table. This text can offer additional details or context about the step, helping users better understand its significance and actions. The field can contain up to approximately 300 characters, and if no description is available, the value is left as an empty string.

### STEP_TEXT_TYPE (Primary Key)
The ``STEP_TEXT_TYPE`` attribute defines the type of text associated with a procedural step. It categorizes the text in terms of its content or purpose, such as a description of the step, specific year references, or details about extension states. The text type helps users identify the context in which the text is used. The field allows up to 30 characters and does not have a default value. Examples of possible text types include “STEP_DESCRIPTION,” “YEAR,” and “Extension states,”.

In [5]:
q = db.query(
    REG202_PROC_STEP_TEXT.id,
    REG202_PROC_STEP_TEXT.step_id,
    REG202_PROC_STEP_TEXT.step_text,
    REG202_PROC_STEP_TEXT.step_text_type,
    REG201_PROC_STEP.step_phase,
    REG201_PROC_STEP.step_result,
    REG201_PROC_STEP.step_country,
    REG201_PROC_STEP.time_limit
).join(
    REG201_PROC_STEP,
    REG202_PROC_STEP_TEXT.step_id == REG201_PROC_STEP.step_id
).limit(1000)

res = patstat.df(q)
res

Unnamed: 0,id,step_id,step_text,step_text_type,step_phase,step_result,step_country,time_limit
0,15759285,STEP_ISAT_3066343,International searching authority,STEP_DESCRIPTION,REGEN,,,
1,10719527,STEP_ABEX_10361570,Amendments,STEP_DESCRIPTION,EXAMN,,,
2,19169723,STEP_PROL_4132942,en,procedure language,EXAMN,,,
3,93915644,STEP_PFEE_3778507,Penalty fee Rule 85a EPC 1973,STEP_DESCRIPTION,EXAMN,,,01
4,4774073,STEP_1105006,Payment of the first designation fee,STEP_DESCRIPTION,REGEN,,,
...,...,...,...,...,...,...,...,...
995,21199972,STEP_PROL_4892796,Language of the procedure,STEP_DESCRIPTION,EXAMN,,,
996,99956632,RENEWAL_14433776,Renewal fee payment,STEP_DESCRIPTION,UNDEF,,,
997,12806105,RENEWAL_6631784,05,YEAR,UNDEF,,,
998,21859964,STEP_PROL_4873544,Language of the procedure,STEP_DESCRIPTION,EXAMN,,,


We can show the list of distinct step text types.

In [6]:
q = db.query(
    REG202_PROC_STEP_TEXT.step_text_type
).distinct()

res = patstat.df(q)
res

Unnamed: 0,step_text_type
0,Extension states
1,procedure language
2,approval
3,YEAR
4,Validation states
5,STEP_IDENTIFICATION
6,COUNTRY
7,sequence-number
8,Kind of amendment
9,APPLICATION_NUMBER
