In [1]:
import os
import pandas as pd
import simple_icd_10_cm as cm

In [25]:
results_df = pd.read_csv('../assets/results_polars.csv')
results_df

Unnamed: 0,ICD,train_r2,test_r2,rmse,numDataPoints
0,I2510,0.821758,0.685638,0.220995,45743
1,A419,0.817408,0.637543,0.212193,66982
2,I5021,0.847237,0.63699,0.252858,30385
3,I5040,0.861982,0.633038,0.245427,28102
4,I5031,0.831802,0.621393,0.260167,30088
5,I5020,0.800174,0.596512,0.264783,28440
6,E860,0.849868,0.585271,0.219916,24355
7,O80,0.706916,0.53306,0.206021,44662
8,I130,0.753724,0.47885,0.235892,20075
9,O700,0.650249,0.431166,0.192847,42850


The International Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) is a coding system used to classify and code various diseases, conditions, and medical procedures. It is primarily used for billing and statistical purposes in the healthcare industry. ICD-10-CM is organized into chapters, and these chapters are further divided into various categories. The major categories of ICD-10-CM include:

1. **Infectious and Parasitic Diseases (A00-B99)**: This category covers diseases caused by bacteria, viruses, fungi, and parasites, including conditions like tuberculosis, HIV, and various infectious diseases.

2. **Neoplasms (C00-D49)**: Neoplasms are abnormal growths or tumors, including both benign and malignant (cancerous) tumors.

3. **Diseases of the Blood and Blood-Forming Organs and Certain Disorders Involving the Immune Mechanism (D50-D89)**: This category includes disorders of the blood, bone marrow, and the immune system.

4. **Endocrine, Nutritional, and Metabolic Diseases (E00-E89)**: Covers diseases related to the endocrine system (e.g., diabetes), nutritional deficiencies, and metabolic disorders.

5. **Mental, Behavioral, and Neurodevelopmental Disorders (F01-F99)**: Encompasses mental health disorders and conditions related to behavior and neurodevelopment.

6. **Diseases of the Nervous System (G00-G99)**: Includes neurological conditions and diseases affecting the brain, spinal cord, and nerves.

7. **Diseases of the Eye and Adnexa (H00-H59)**: Covers eye-related conditions and disorders of the adnexa (e.g., eyelids).

8. **Diseases of the Ear and Mastoid Process (H60-H95)**: Addresses conditions related to the ear and mastoid process.

9. **Diseases of the Circulatory System (I00-I99)**: Encompasses cardiovascular diseases, including heart conditions, hypertension, and vascular disorders.

10. **Diseases of the Respiratory System (J00-J99)**: Includes disorders of the respiratory tract, such as pneumonia, asthma, and chronic obstructive pulmonary disease (COPD).

11. **Diseases of the Digestive System (K00-K95)**: Covers gastrointestinal disorders, including conditions related to the stomach, intestines, liver, and pancreas.

12. **Diseases of the Skin and Subcutaneous Tissue (L00-L99)**: Addresses skin conditions and disorders of the subcutaneous tissue.

13. **Diseases of the Musculoskeletal System and Connective Tissue (M00-M99)**: Includes disorders of the musculoskeletal system, bones, muscles, and connective tissues.

14. **Diseases of the Genitourinary System (N00-N99)**: Covers conditions related to the urinary system and male/female reproductive organs.

15. **Pregnancy, Childbirth, and the Puerperium (O00-O9A)**: Encompasses conditions related to pregnancy, childbirth, and the postpartum period.

16. **Certain Conditions Originating in the Perinatal Period (P00-P96)**: Addresses conditions that affect newborns and infants.

17. **Congenital Malformations, Deformations, and Chromosomal Abnormalities (Q00-Q99)**: Covers congenital anomalies and chromosomal disorders.

18. **Symptoms, Signs, and Abnormal Clinical and Laboratory Findings, Not Elsewhere Classified (R00-R99)**: This category includes codes for symptoms, signs, and abnormal test results when a definitive diagnosis hasn't been made.

19. **Injury, Poisoning, and Certain Other Consequences of External Causes (S00-T88)**: Encompasses codes for various injuries, poisonings, and other consequences of external causes.

20. **External Causes of Morbidity (V00-Y99)**: Addresses the external causes of injuries, including accidents, intentional self-harm, and other factors contributing to health problems.

21. **Factors Influencing Health Status and Contact with Health Services (Z00-Z99)**: This category includes codes for encounters with the healthcare system, as well as factors influencing health status.

Each of these major categories is further subdivided into more specific codes that provide detailed information about diseases, conditions, and medical procedures, allowing for precise diagnosis and billing in the healthcare industry.

In [28]:
# Define the ICD-10-CM major categories mapping
icd_categories = {
    'A': 'Infectious and Parasitic Diseases',
    'B': 'Infectious and Parasitic Diseases',
    'C': 'Neoplasms',
    'D': 'Diseases of the Blood and Blood-Forming Organs',
    'E': 'Endocrine, Nutritional, and Metabolic Diseases',
    'F': 'Mental, Behavioral, and Neurodevelopmental Disorders',
    'G': 'Diseases of the Nervous System',
    'H': 'Diseases of the Eye and Adnexa',
    'I': 'Diseases of the Circulatory System',
    'J': 'Diseases of the Respiratory System',
    'K': 'Diseases of the Digestive System',
    'L': 'Diseases of the Skin and Subcutaneous Tissue',
    'M': 'Diseases of the Musculoskeletal System and Connective Tissue',
    'N': 'Diseases of the Genitourinary System',
    'O': 'Pregnancy, Childbirth, and the Puerperium',
    'P': 'Certain Conditions Originating in the Perinatal Period',
    'Q': 'Congenital Malformations, Deformations, and Chromosomal Abnormalities',
    'R': 'Symptoms, Signs, and Abnormal Clinical and Laboratory Findings',
    'S': 'Injury, Poisoning, and Certain Other Consequences of External Causes',
    'T': 'Injury, Poisoning, and Certain Other Consequences of External Causes',
    'V': 'External Causes of Morbidity',
    'Y': 'External Causes of Morbidity',
    'Z': 'Factors Influencing Health Status and Contact with Health Services'
}


In [29]:
results_df['code_category'] = [icd_categories.get(i[0], 'Unknown Category') for i in list(results_df.ICD)]
results_df['code_label'] = [cm.get_description(i) if cm.is_valid_item(i) else "Unknown Code" for i in list(results_df.ICD)]
results_df

Unnamed: 0,ICD,train_r2,test_r2,rmse,numDataPoints,code_category,code_label
0,I2510,0.821758,0.685638,0.220995,45743,Diseases of the Circulatory System,Atherosclerotic heart disease of native coronary artery without angina pectoris
1,A419,0.817408,0.637543,0.212193,66982,Infectious and Parasitic Diseases,"Sepsis, unspecified organism"
2,I5021,0.847237,0.63699,0.252858,30385,Diseases of the Circulatory System,Acute systolic (congestive) heart failure
3,I5040,0.861982,0.633038,0.245427,28102,Diseases of the Circulatory System,Unspecified combined systolic (congestive) and diastolic (congestive) heart failure
4,I5031,0.831802,0.621393,0.260167,30088,Diseases of the Circulatory System,Acute diastolic (congestive) heart failure
5,I5020,0.800174,0.596512,0.264783,28440,Diseases of the Circulatory System,Unspecified systolic (congestive) heart failure
6,E860,0.849868,0.585271,0.219916,24355,"Endocrine, Nutritional, and Metabolic Diseases",Dehydration
7,O80,0.706916,0.53306,0.206021,44662,"Pregnancy, Childbirth, and the Puerperium",Encounter for full-term uncomplicated delivery
8,I130,0.753724,0.47885,0.235892,20075,Diseases of the Circulatory System,"Hypertensive heart and chronic kidney disease with heart failure and stage 1 through stage 4 chronic kidney disease, or unspecified chronic kidney disease"
9,O700,0.650249,0.431166,0.192847,42850,"Pregnancy, Childbirth, and the Puerperium",First degree perineal laceration during delivery


ICD Code: A123
Major Category: ['Infectious and Parasitic Diseases']
