If you're opening this Notebook on colab, you will probably need to install 🤗 `Transformers` and 🤗 `Datasets` as well as other dependencies. 

* `datasets`
* `transformers`
* `rogue-score`
* `nltk`
* `pytorch`
* `ipywidgets`

*Note*: Since we are using the GPU to optimize the performance of the deep learning algorithms, `CUDA` needs to be installed on the device.

In [1]:
! pip install datasets transformers rouge-score nltk ipywidgets

Collecting datasets
  Downloading datasets-1.18.3-py3-none-any.whl (311 kB)
[K     |████████████████████████████████| 311 kB 5.1 MB/s 
[?25hCollecting transformers
  Downloading transformers-4.17.0-py3-none-any.whl (3.8 MB)
[K     |████████████████████████████████| 3.8 MB 58.3 MB/s 
[?25hCollecting rouge-score
  Downloading rouge_score-0.0.4-py2.py3-none-any.whl (22 kB)
Collecting fsspec[http]>=2021.05.0
  Downloading fsspec-2022.2.0-py3-none-any.whl (134 kB)
[K     |████████████████████████████████| 134 kB 66.8 MB/s 
[?25hCollecting huggingface-hub<1.0.0,>=0.1.0
  Downloading huggingface_hub-0.4.0-py3-none-any.whl (67 kB)
[K     |████████████████████████████████| 67 kB 6.0 MB/s 
Collecting xxhash
  Downloading xxhash-3.0.0-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (212 kB)
[K     |████████████████████████████████| 212 kB 57.9 MB/s 
Collecting aiohttp
  Downloading aiohttp-3.8.1-cp37-cp37m-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_12_x86_64.manylinux201

When using `nltk`, `punkt` also needs to be installed. I guess it is not installed automatically. Not having `punkt` will result in an error during the analysis.

In [2]:
import nltk
nltk.download('punkt')

[nltk_data] Downloading package punkt to /root/nltk_data...
[nltk_data]   Unzipping tokenizers/punkt.zip.


True

If you're opening this notebook locally, make sure your environment has an install from the last version of those libraries.

To be able to share your model with the community and generate results like the one shown in the picture below via the inference API, there are a few more steps to follow.

First you have to store your authentication token from the Hugging Face website (sign up [here](https://huggingface.co/join) if you haven't already!) then execute the following cell and input your username and password:

In [3]:
from huggingface_hub import notebook_login

notebook_login()

Login successful
Your token has been saved to /root/.huggingface/token
[1m[31mAuthenticated through git-credential store but this isn't the helper defined on your machine.
You might have to re-authenticate when pushing to the Hugging Face Hub. Run the following command in your terminal in case you want to set this credential helper as the default

git config --global credential.helper store[0m


Then you need to install `Git-LFS`.

If you are not using `Google Colab`, you may need to install `Git-LFS` manually, since the code below may not work and depending on your operating system. You can read about `Git-LFS` and how to install it [here](https://git-lfs.github.com/).

In [4]:
! apt install git-lfs

Reading package lists... Done
Building dependency tree       
Reading state information... Done
The following package was automatically installed and is no longer required:
  libnvidia-common-470
Use 'apt autoremove' to remove it.
The following NEW packages will be installed:
  git-lfs
0 upgraded, 1 newly installed, 0 to remove and 39 not upgraded.
Need to get 2,129 kB of archives.
After this operation, 7,662 kB of additional disk space will be used.
Get:1 http://archive.ubuntu.com/ubuntu bionic/universe amd64 git-lfs amd64 2.3.4-1 [2,129 kB]
Fetched 2,129 kB in 1s (2,458 kB/s)
Selecting previously unselected package git-lfs.
(Reading database ... 155320 files and directories currently installed.)
Preparing to unpack .../git-lfs_2.3.4-1_amd64.deb ...
Unpacking git-lfs (2.3.4-1) ...
Setting up git-lfs (2.3.4-1) ...
Processing triggers for man-db (2.8.3-2ubuntu0.1) ...


Make sure your version of `Transformers` is at least 4.11.0 since the functionality was introduced in that version:

In [5]:
import transformers

print(transformers.__version__)

4.17.0


You can find a script version of this notebook to fine-tune your model in a distributed fashion using multiple GPUs or TPUs [here](https://github.com/huggingface/transformers/tree/master/examples/seq2seq).

# Fine-tuning a model on a summarization task

In this notebook, we will see how to fine-tune one of the [🤗`Transformers`](https://github.com/huggingface/transformers) model for a summarization task. We will use the [PubMed Summarization dataset](https://huggingface.co/datasets/ccdv/pubmed-summarization) which contains PubMed articles accompanied with abstracts.

![Widget inference on a summarization task](https://github.com/huggingface/notebooks/blob/master/examples/images/summarization.png?raw=1)

We will see how to easily load the dataset for this task using 🤗 `Datasets` and how to fine-tune a model on it using the `Trainer` API.

In [6]:
model_checkpoint = "sshleifer/distilbart-xsum-12-1"

This notebook is built to run  with any model checkpoint from the [Model Hub](https://huggingface.co/models) as long as that model has a sequence-to-sequence version in the Transformers library. Here we picked the [`sshleifer/distilbart-xsum-12-1`](https://huggingface.co/sshleifer/distilbart-xsum-12-1) checkpoint. 

## Loading the dataset

We will use the [🤗 `Datasets`](https://github.com/huggingface/datasets) library to download the data and get the metric we need to use for evaluation (to compare our model to the benchmark). This can be easily done with the functions `load_dataset` and `load_metric`.  

In [7]:
from datasets import load_dataset, load_metric

raw_datasets = load_dataset("ccdv/pubmed-summarization")
metric = load_metric("rouge")

Downloading:   0%|          | 0.00/4.88k [00:00<?, ?B/s]

No config specified, defaulting to: pub_med_summarization_dataset/document


Downloading and preparing dataset pub_med_summarization_dataset/document to /root/.cache/huggingface/datasets/ccdv___pub_med_summarization_dataset/document/1.0.0/5792402f4d618f2f4e81ee177769870f365599daa729652338bac579552fec30...


Downloading:   0%|          | 0.00/779M [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/43.7M [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/43.8M [00:00<?, ?B/s]

0 examples [00:00, ? examples/s]

0 examples [00:00, ? examples/s]

0 examples [00:00, ? examples/s]

Dataset pub_med_summarization_dataset downloaded and prepared to /root/.cache/huggingface/datasets/ccdv___pub_med_summarization_dataset/document/1.0.0/5792402f4d618f2f4e81ee177769870f365599daa729652338bac579552fec30. Subsequent calls will reuse this data.


  0%|          | 0/3 [00:00<?, ?it/s]

Downloading:   0%|          | 0.00/2.16k [00:00<?, ?B/s]

The `dataset` object itself is [`DatasetDict`](https://huggingface.co/docs/datasets/package_reference/main_classes.html#datasetdict), which contains one key for the training, validation and test set:

In [8]:
raw_datasets

DatasetDict({
    train: Dataset({
        features: ['article', 'abstract'],
        num_rows: 119924
    })
    validation: Dataset({
        features: ['article', 'abstract'],
        num_rows: 6633
    })
    test: Dataset({
        features: ['article', 'abstract'],
        num_rows: 6658
    })
})

To access an actual element, you need to select a split first, then give an index:

In [9]:
raw_datasets["train"][0]

{'abstract': "<S> background : the present study was carried out to assess the effects of community nutrition intervention based on advocacy approach on malnutrition status among school - aged children in shiraz , iran.materials and methods : this case - control nutritional intervention has been done between 2008 and 2009 on 2897 primary and secondary school boys and girls ( 7 - 13 years old ) based on advocacy approach in shiraz , iran . </S> <S> the project provided nutritious snacks in public schools over a 2-year period along with advocacy oriented actions in order to implement and promote nutritional intervention . for evaluation of effectiveness of the intervention growth monitoring indices of pre- and post - intervention were statistically compared.results:the frequency of subjects with body mass index lower than 5% decreased significantly after intervention among girls ( p = 0.02 ) . </S> <S> however , there were no significant changes among boys or total population . </S> <S> 

Since the `pubmed` data is extremely large, we are going to remove rows so that we have a training set of 8,000, a validation set of 2,000, and a test set of 2,000. 

In [10]:
raw_datasets["train"] = raw_datasets["train"].select(range(1, 8001))
raw_datasets["validation"] = raw_datasets["validation"].select(range(1, 2001))
raw_datasets["test"] = raw_datasets["test"].select(range(1, 2001))

To get a sense of what the data looks like, the following function will show some examples picked randomly in the dataset.

In [11]:
import datasets
import random
import pandas as pd
from IPython.display import display, HTML

def show_random_elements(dataset, num_examples=5):
    assert num_examples <= len(dataset), "Can't pick more elements than there are in the dataset."
    picks = []
    for _ in range(num_examples):
        pick = random.randint(0, len(dataset)-1)
        while pick in picks:
            pick = random.randint(0, len(dataset)-1)
        picks.append(pick)
    
    df = pd.DataFrame(dataset[picks])
    for column, typ in dataset.features.items():
        if isinstance(typ, datasets.ClassLabel):
            df[column] = df[column].transform(lambda i: typ.names[i])
    display(HTML(df.to_html()))

In [12]:
show_random_elements(raw_datasets["train"])

Unnamed: 0,article,abstract
0,"the most common site is legs and melanomas in men are most common on the back . melanoma of the clivus is an extremely rare case presentation with only a few cases reported in the literature . conventional imaging techniques like computed tomography ( ct ) and magnetic resonance imaging ( mri ) may be suboptimal in evaluating such tumor , and may lead to inaccurate staging . a multimodality whole body imaging technique , 2-deoxy-2-[18f ] fluoro - d - glucose positron emission tomography / ct ( 18f - fdg pet / ct ) is being increasingly used in oncology for staging of multiple malignancies to know the spread of the tumor in the body . this rare case is important because it highlights the extensive disease that can be caused by a clival tumor and the role of noninvasive imaging , that is , 18f - fdg pet / ct in correct staging and hence , guiding further management of the disease . a 55-year - old woman , presented to the hospital with chief complaints of headache , decreased vision in the left eye , and occasional episodes of vomiting since 3 months . mri brain revealed altered signal intensity lesion with solid , hemorrhagic , and few cystic components in basiocciput , basisphenoid , clivus , sella , and right petrous apex ; displacing optic chiasma superiorly . there was associated soft tissue component extending into cavernous sinus with partial encasement of cavernous segment of right internal carotid artery . cemr study revealed a large moderately enhancing mass lesion involving the clivus with sellar - suprasellar extension with encasement of bilateral internal carotid arteries suggestive of plasmacytoma / chordoma or metastasis [ figure 1 ] . she underwent endonasal transsphenoidal excision of clival tumor and the black colored , relatively avascular tumor was confirmed to be melanocytic melanoma of clivus on histopathological examination . the patient was thoroughly examined to rule out any lesion on the skin and mucosa with other investigations including chest x - ray . a week after the surgery , this patient was referred to our department for a whole body 18f - fdg pet / ct scan for restaging . whole body pet - ct scan was performed after intravenous ( iv ) administration of 10 mci of 18f - fdg . pet and contrast - enhanced ct images were acquired and reconstructed to obtain transaxial , coronal , and sagittal views . the study revealed residual hypermetabolic well - defined lobulated soft tissue lesion in the basisphenoid and sella turcica region extending into the extraaxial space of right middle cranial fossa causing destruction of the sella turcica , sphenoid sinus , dorsal sella , and clivus ; suggestive of residual disease . also multiple metabolically active skeletal lesions were noted suggestive of skeletal metastasis [ figure 2 ] . ( a and b ) magnetic resonance images - t1 and t2 weighted axial sections of the brain ( preoperative ) showing altered signal intensity lesion with solid , hemorrhagic , and few cystic components in basiocciput , basisphenoid , clivus , sella , and right petrous apex ; displacing optic chiasma superiorly associated with soft tissue component extending into cavernous sinus with partial encasement of cavernous segment of right internal carotid artery ( a ) maximal intensity projection image of the patient from base of skull to mid - thigh showing focal areas of hypermetabolism throughout the body corresponding to multiple metastatic skeletal lesions . physiological uptake noted in heart , liver , bowel , kidneys , and urinary bladder , ( b ) sagittal positron emission tomography and fused pet - computed tomography images reveal abnormal fluoro-2-deoxy - d - glucose uptake in spinal column corresponding to lytic lesions on ct , ( c ) metabolically active well - defined lobulated soft tissue lesion in basisphenoid and sella turcica region , extending into the extraaxial space of right middle cranial fossa and indenting the medial temporal lobe causing destruction of the sella turcica , sphenoid sinus , dorsal sella , and clivus , ( d ) hypermetabolic lytic intradiploic lesions noted in left anterior frontal , high frontal , and parietal region eighty five percent of the patients diagnosed in early stages can be cured with surgery . primary intracranial malignant melanoma is a rare entity with incidence estimated to be 0.005 cases per 100,000 population . the age of the patients usually range from 15 - 71 years , with a peak incidence in the 5 decade . symptoms at presentation include headache ; vomiting due to intracranial hypertension ; hydrocephalus ; focal neurological deficits due to compression of the brain , spinal cord , or cauda equina ; subarachnoid hemorrhage ; and seizures . to our knowledge , very few cases of primary melanoma of the clivus have been cited in the literature previously . metastases involving this area have been previously described as a single case report or included in series with other skull base tumors . in 2009 , a literature review was performed by pallini et al . , which reveals that out of 46 patients who underwent surgery for clival bone tumor , seven proved to be metastatic , representing 0.18 and 0.42% , respectively of intracranial and skull base tumors which were treated in their institution in the study period between january 1995 and december 2007 . the primary tumors associated were lung adenocarcinoma ( n = 2 ) , prostate carcinoma ( n = 2 ) , skin melanoma ( n = 1 ) , hepatocarcinoma ( n = 1 ) , and lung squamous cell carcinoma ( n = 1 ) . in 2010 , chaudhary et al . , presented a case of an atypical clival meningeal melanoma treated with a multidisciplinary staged transcrural and transsphenoidal endoscopic surgical approach . no other metastases was evident for 2 years after initial symptoms and with no evidence of a cutaneous source , diagnosis of a primary meningeal lesion of the clivus was made . bone metastases occur in a significant proportion of patients with metastatic melanoma . in such patients survival , in 108 patients in 2008 , which revealed median survival following diagnosis of bone metastases in malignant melanoma to be 3.2 months ( range 0.3 - 47.4 months ) . bone metastases most commonly occurred in patients with the primary melanoma originating on the back and lower limbs and spine was the commonest site of bone involvement , followed by ribs , pelvis , long bones , and skull . fdg pet is a sensitive and specific technique for patients with melanoma but has limitations with small ( less than 1 cm ) , pulmonary , and brain metastases . it is felt to be superior to ct alone in detecting abdominal , nodal , subcutaneous , and skin sites . it is useful in assessing extent of disease in patients with surgically resectable disease by conventional methods as it may render them unresectable in a considerable population . in our patient , surgical removal of the tumor was done from an outside institution and was referred to us for further management . pet - ct scan was performed for the patient in view of histological diagnosis of melanocytic melanoma . also , no primary site could be localized after thorough general examination of the skin . the findings on pet - ct scan suggest that the clival mass represents the primary site of malignancy . this case adds to the literature on the occurrence of intracranial malignant melanoma in patients with extensive skeletal metastasis , and supports the finding that such tumors need to be followed carefully .","<S> malignant melanoma of the clivus is a rare entity , for which there is little evidence - based literature for guiding clinicians to understand the importance of disease staging via noninvasive imaging strategy . </S> <S> this report highlights the case of a 55-year - old lady with histopathologically confirmed melanocytic melanoma of the clivus postoperative status , with multiple skeletal metastasis , demonstrated on 2-deoxy-2-[18f ] fluoro - d - glucose positron emission tomography / computed tomography ( 18f - fdg pet / ct scan ) . the experience gained with this patient demonstrates the feasibility and usefulness of this noninvasive application in accurate staging and hence , correct decision making regarding further treatment . </S>"
1,"eight adult cats weighing between 2.9 and 4.4 ( mean , 3.9 ) kg were used in this study . each animal was anesthetized with 5% halothane administered through an endotracheal tube by a mechanical ventilator ( mini-7 ; royal medical , seoul , korea ) . pancuronium bromide ( panslan ; reyon pharmaceutical , seoul , korea ) ( 0.6 mg / kg ) was used for skeletal muscle relaxation . to monitor blood pressure and allow pulse - gated mr imaging , a femoral artery was cannulated , and for the administration of drugs and contrast agents , this procedure was applied to a femoral vein . after a left lateral thoracotomy along the fifth intercostal space , pericardiotomy was performed by means of a midline incision , and a pericardial cradle was prepared by attaching the margins of the dissected pericardium to the adjacent thoracic wall . the left anterior descending ( lad ) coronary artery was isolated distal to the first diagonal branch , and a snare loop was made with 4 - 0 silk placed in a slender plastic tube . occlusion or reperfusion of the lad artery was achieved simply by fastening or releasing the snare loop . obstruction of this artery was confirmed by observing changes in the color of the myocardium at risk during a preliminary test occlusion . in each cat the lad artery was occluded for 90 minutes , and this was followed by 90 minutes of reperfusion . gd - dtpa has been widely used as a clinical mr contrast agent . after injection , it is rapidly distributed from the vascular compartment to the extracellular fluid space . in rats , the plasma half - life is approximately 12 minutes ( 8) , and due to a rapidly changing blood concentration resulting from the large volume distributed , the utility of gd - dtpa for the mr diagnosis of acute myocardial ischemia is therefore limited ( 9 , 10 , 11 ) . gadomer-17 , provided by schering , is designed for prolonged intravascular retention and reduced transcapillary diffusion and has a molecular weight ( mw ) of 17453 g / mol . noh et al . demonstrated that following the administration of gadomer-17 , the signal intensity of the enhanced area seen on t1-weighted images increased rapidly , and maximum enhancement was detected during a 40 - 60 minute period ( 7 ) . gadophrin-2 , a necrosis - avid contrast material , consists of mesoporphyrins linked to gadolinium , and its mw is 1697.25 g / mol . the methods by which it is synthesized , and its chemical structure , physiochemical properties and imaging behaviors have been previously described in detail ( 12 ) . mr imaging was performed on a 1.5 t magnetom vision system ( siemens , erlangen , germany ) . for signal reception , a circularly polarized head array coil 27 cm in diameter ( siemens , erlangen , germany ) was used . during the imaging procedure electrocardiography - triggered breath - hold turbo spin - echo t2-weighted mr images were obtained along the short axis of the heart prior to the injection of contrast agents , and in order to obtain additional information as to myocardial status , images in the sagittal plane were also acquired . the acquisition parameters for t2-weighted mr images were as follows : repetition time msec/ echo time msec of 400 - 600 ( according to heart rate ) / 82 , echo train length of 33 , acquisition time of 9 - 10 seconds , matrix size of 132256 , field of view of 210280 mm , and section thickness of 5 mm . after acquiring the baseline image and intravenously injecting contrast medium , gadomer-17-enhanced t1-weighted mr images were obtained dynamically for one hour . after visual comfirmation of complete washout of gadomer-17 ( 3 hours after its injection ) , gadophrin-2 was administered , and contrast - enhanced t1-weighted mr images were obtained for the next hour . electrocardiography - triggered multisection t1-weighted spin - echo imaging was performed with the following imaging parameters : repetition time msec/ echo time msec of 300 - 500 ( according to heart rate)/ 25 , section thickness of 5 mm , field of view of 210280 mm , and one signal acquisition . all images were obtained along the short axis of the heart , and to provide additional confirmation of the signal enhancement of irreversibly damaged myocardium , images in the sagittal plane were also occasionally acquired . after mr imaging studies were completed , each cat was sacrificed by intravenous injection of kcl solution . the heart was excised and cut into five or six consecutive slices , 5 mm thick , in the same planes in which the mr images were obtained . the specimens were immersed in 1.5% ttc solution at 36 and stained for 15 minutes , and were then stored in 10% formalin solution for 12 hours . . photographs of ttc - stained specimens in the same planes , for which mr images were obtained along the short axis of the heart , were scanned into a computer ( macintosh ; apple computer inc . , u.s.a . ) to measure the size of the infarct area and of the total left ventricle mass using public domain image processing software ( nih image 1.55 ; national institutes of health , bethesda , md . , u.s.a . ) . all mr images were independently analyzed by two experienced radiologists , and discrepancies were resolved by consensus . the size of the infarct area in the myocardium of the left ventricle was measured by outlining the high signal area seen on t2-weighted images and enhanced areas on gadomer-17- and gadophrin-2-enhanced t1-weighted images . the size of the infarct area was expressed as a percentage of the size of the total left ventricle , as revealed by mr imaging and ttc histochemical staining . the size of the enhanced areas seen on gadomer-17 and gadophrin-2-enhanced t1-weighted images , and of the high signal area on t2-weighted images , was compared with that of the infarct area disclosed by ttc histochemical staining . to determine statistical significance ( defined as p < 0.05 ) , a paired student t test was used . electron microscopic examinations were performed on tissue taken from three areas . that from the infarct area was sampled from the centers of ttc - unstained areas ; for the lateral border zone , tissue from the ttc - stained peripheral region adjacent to the ttc - unstained area ( 2 mm apart from that area ) was sampled ; and for normal myocardium , tissue was taken from the center of the ttc - stained area of the posterior wall . after collection , tissues were cut into 1-mm cubes and fixed in a 2.5% buffered glutaraldehyde solution for 12 to 16 hours followed by additional fixation in a solution of osmotic acid at 5 for 2 hours . the cubes were then dehydrated in graded alcohol at room temperature , passed through propylene oxide , and placed in a 1:1 mixture of propylene oxide and epon 812 ( polyscience inc . ) for 12 to 16 hours . sections approximately 0.5 m thick were cut on an lkb ultramicrotome ( bromma , sweden ) using a diamond knife , and were mounted on a copper grid and stained with 4% aqueous uranyl acetate and lead citrate for examination with a transmission electron microscope ( jem-1200 ex ii , tokyo , japan ) . the electron microscopic criteria for irreversibly damaged myocardium were that all mitochondria were swollen , had disorganized cristae , and contained electron - dense deposits and contraction bands in addition to disrupted sarcolemmas . on the other hand , the ultrastructures of reversibly damaged myocardium showed mild edematous myocytes , increased sarcoplasmic space , a prominent i - band , and mild peripheral aggregation of nuclear chromatin without the features of irreversibly damaged myocardium ( 13 ) . eight adult cats weighing between 2.9 and 4.4 ( mean , 3.9 ) kg were used in this study . each animal was anesthetized with 5% halothane administered through an endotracheal tube by a mechanical ventilator ( mini-7 ; royal medical , seoul , korea ) . pancuronium bromide ( panslan ; reyon pharmaceutical , seoul , korea ) ( 0.6 mg / kg ) was used for skeletal muscle relaxation . to monitor blood pressure and allow pulse - gated mr imaging , a femoral artery was cannulated , and for the administration of drugs and contrast agents , this procedure was applied to a femoral vein . after a left lateral thoracotomy along the fifth intercostal space , pericardiotomy was performed by means of a midline incision , and a pericardial cradle was prepared by attaching the margins of the dissected pericardium to the adjacent thoracic wall . the left anterior descending ( lad ) coronary artery was isolated distal to the first diagonal branch , and a snare loop was made with 4 - 0 silk placed in a slender plastic tube . occlusion or reperfusion of the lad artery was achieved simply by fastening or releasing the snare loop . obstruction of this artery was confirmed by observing changes in the color of the myocardium at risk during a preliminary test occlusion . in each cat the lad artery was occluded for 90 minutes , and this was followed by 90 minutes of reperfusion . after injection , it is rapidly distributed from the vascular compartment to the extracellular fluid space . in rats , the plasma half - life is approximately 12 minutes ( 8) , and due to a rapidly changing blood concentration resulting from the large volume distributed , the utility of gd - dtpa for the mr diagnosis of acute myocardial ischemia is therefore limited ( 9 , 10 , 11 ) . gadomer-17 , provided by schering , is designed for prolonged intravascular retention and reduced transcapillary diffusion and has a molecular weight ( mw ) of 17453 g / mol . noh et al . demonstrated that following the administration of gadomer-17 , the signal intensity of the enhanced area seen on t1-weighted images increased rapidly , and maximum enhancement was detected during a 40 - 60 minute period ( 7 ) . gadophrin-2 , a necrosis - avid contrast material , consists of mesoporphyrins linked to gadolinium , and its mw is 1697.25 g / mol . the methods by which it is synthesized , and its chemical structure , physiochemical properties and imaging behaviors have been previously described in detail ( 12 ) . mr imaging was performed on a 1.5 t magnetom vision system ( siemens , erlangen , germany ) . for signal reception , a circularly polarized head array coil 27 cm in diameter ( siemens , erlangen , germany ) was used . during the imaging procedure electrocardiography - triggered breath - hold turbo spin - echo t2-weighted mr images were obtained along the short axis of the heart prior to the injection of contrast agents , and in order to obtain additional information as to myocardial status , images in the sagittal plane were also acquired . the acquisition parameters for t2-weighted mr images were as follows : repetition time msec/ echo time msec of 400 - 600 ( according to heart rate ) / 82 , echo train length of 33 , acquisition time of 9 - 10 seconds , matrix size of 132256 , field of view of 210280 mm , and section thickness of 5 mm . after acquiring the baseline image and intravenously injecting contrast medium , gadomer-17-enhanced t1-weighted mr images were obtained dynamically for one hour . after visual comfirmation of complete washout of gadomer-17 ( 3 hours after its injection ) , gadophrin-2 was administered , and contrast - enhanced t1-weighted mr images were obtained for the next hour . electrocardiography - triggered multisection t1-weighted spin - echo imaging was performed with the following imaging parameters : repetition time msec/ echo time msec of 300 - 500 ( according to heart rate)/ 25 , section thickness of 5 mm , field of view of 210280 mm , and one signal acquisition . all images were obtained along the short axis of the heart , and to provide additional confirmation of the signal enhancement of irreversibly damaged myocardium , images in the sagittal plane were also occasionally acquired . after mr imaging studies were completed , each cat was sacrificed by intravenous injection of kcl solution . the heart was excised and cut into five or six consecutive slices , 5 mm thick , in the same planes in which the mr images were obtained . the specimens were immersed in 1.5% ttc solution at 36 and stained for 15 minutes , and were then stored in 10% formalin solution for 12 hours . photographs of ttc - stained specimens in the same planes , for which mr images were obtained along the short axis of the heart , were scanned into a computer ( macintosh ; apple computer inc . , to measure the size of the infarct area and of the total left ventricle mass using public domain image processing software ( nih image 1.55 ; national institutes of health , bethesda , md . all mr images were independently analyzed by two experienced radiologists , and discrepancies were resolved by consensus . the size of the infarct area in the myocardium of the left ventricle was measured by outlining the high signal area seen on t2-weighted images and enhanced areas on gadomer-17- and gadophrin-2-enhanced t1-weighted images . the size of the infarct area was expressed as a percentage of the size of the total left ventricle , as revealed by mr imaging and ttc histochemical staining . the size of the enhanced areas seen on gadomer-17 and gadophrin-2-enhanced t1-weighted images , and of the high signal area on t2-weighted images , was compared with that of the infarct area disclosed by ttc histochemical staining . to determine statistical significance ( defined as p < 0.05 ) , a paired student t test was used . electron microscopic examinations were performed on tissue taken from three areas . that from the infarct area was sampled from the centers of ttc - unstained areas ; for the lateral border zone , tissue from the ttc - stained peripheral region adjacent to the ttc - unstained area ( 2 mm apart from that area ) was sampled ; and for normal myocardium , tissue was taken from the center of the ttc - stained area of the posterior wall . after collection , tissues were cut into 1-mm cubes and fixed in a 2.5% buffered glutaraldehyde solution for 12 to 16 hours followed by additional fixation in a solution of osmotic acid at 5 for 2 hours . the cubes were then dehydrated in graded alcohol at room temperature , passed through propylene oxide , and placed in a 1:1 mixture of propylene oxide and epon 812 ( polyscience inc . ) for 12 to 16 hours . sections approximately 0.5 m thick were cut on an lkb ultramicrotome ( bromma , sweden ) using a diamond knife , and were mounted on a copper grid and stained with 4% aqueous uranyl acetate and lead citrate for examination with a transmission electron microscope ( jem-1200 ex ii , tokyo , japan ) . the electron microscopic criteria for irreversibly damaged myocardium were that all mitochondria were swollen , had disorganized cristae , and contained electron - dense deposits and contraction bands in addition to disrupted sarcolemmas . on the other hand , the ultrastructures of reversibly damaged myocardium showed mild edematous myocytes , increased sarcoplasmic space , a prominent i - band , and mild peripheral aggregation of nuclear chromatin without the features of irreversibly damaged myocardium ( 13 ) . the high signal area seen on t2-weighted images and the enhanced area on gadomer-17-enhanced t1-weighted images were larger than the enhanced area on gadophrin-2-enhanced t1-weighted images and the infarct area disclosed by ttc histochemical staining ( t2= 39.2 % ; gadomer-17 = 37.25 % vs gadophrin-2 = 29.6 % ; ttc staining = 28.2 % ; p < 0.05 ) . the size of the high signal area seen on t2-weighted images correlated closely with that of the enhanced area on gadomer-17-enhanced t1-weighted images , and the size of the enhanced area on gadophrin-2-enhanced t1-weighted images showed close correlation with that of the infarct area revealed by ttc histochemical staining ( figs . 1 , 2 ) . electron microscopic examination of tissue taken from the three areas showed virtually the same results in each cat . ultrastructural changes in the infarct area indicated irreversibly damaged myocardium ; in the lateral border zone , the features of reversibly damaged myocardium were observed but findings of irreversibly damaged myocardium were absent ( fig . in this study , we found that the high signal area seen on t2-weighted images and the enhanced area on gadomer-17-enhanced t1-weighted images were larger than the enhanced area on gadophrin-2-enhanced t1-weighted images and the infarct area revealed by ttc histochemical staining , with statistical significance . electron microscopic examination showed that tissue taken from the ttc - stained peripheral region adjacent to the ttc - unstained area exhibited the features of reversibly damaged myocardium . in a cat model of reperfused myocardial infarction , identification of the lateral border zone is therefore possible , and by means of mr imaging we were able to determine the size and distribution of this zone . it has been well documented that the coronary artery is the major source of ischemic heart disease : thrombotic occlusion of an epicardial coronary artery is usually the cause of acute myocardial infarction ( 14 ) . the myocardium initiates anaerobic glycolysis within 10 seconds of occlusion of the coronary artery , with the accumulation of lactate and other metabolites ( 15 ) . acute myocardial infarction first begins in the subendocardium within 20 - 40 minutes of occlusion and spreads toward the subepicardium . this concept of infarct progression has been previously described and termed the wavefront of myocardial necrosis by reimer at al . a part of the transmural progression of injury is related to the transmural gradient of collateral blood flow . previous clinical and experimental studies have indicated that collateral blood flow following coronary occlusion is extremely poor in the subendocardial region and blood is shunted preferentially to the subepicardial zone . furthermore , two other factors , greater systolic wall stress and oxygen consumption , can contribute to subendocardial ischemia . previous studies involving dogs and humans have shown that the evolution of infarction is usually completed within six hours of coronary occlusion ; if reperfusion therapy is instituted while viable myocardium is present , the infarct can therefore be confined to a smaller area . another study has suggested , however , that further myocardial cell death due to reperfusion may occur after prolonged ischemia ( reperfusion injury ) ( 18 ) . the reduction of infarct size by reperfusion results primarily from the salvage of myocardium in the subepicardial region of the ischemia . in cases associated with coronary artery stenosis , coronary collateral flow plays an important role in maintaining the viability of the myocardium and in the smaller and confined infarct area ( 19 , 20 ) . the lateral border zone ( peri - infarct area ) is defined as the lateral area of reversibly injured myocardium adjacent to the core of the infarct and within the area supplied by the occluded artery . studies of acute myocardial infarction ( 21 , 22 ) have determined whether salvageable myocardium existed along the lateral border zone as well as in the transmural direction . they found that intermediate levels of collateral blood flow and biochemical derangement , as well as intermediate functional impairment of myocardium , occur in the lateral margins of an ischemic region . such data have been interpreted to imply the existence of a lateral border zone where salvage of the myocardium may be possible . however , some studies have argued that the lateral border zone of acute myocardial infarction is limited to a narrow zone or does not exist as a quantitatively significant region ( 23 , 24 ) . this controversy over the existence of such a zone of intermediate injury may have arisen because of a difference in research methods . in their analysis of biochemical and flow gradient , yellon et al . reported that a quantitatively significant and spatially identifiable "" border zone "" region did not exist , a conclusion probably due to the tissue sampling technique they employed and differences in the ligation time of the coronary artery ( 23 ) . on the other hand , others have reported that if necrosis - avid contrast agents were used , a lateral border zone was demonstrated by cardiac mr imaging . various mr sequences and techniques have been developed for the assessment of ischemic heart disease . previous reports have indicated that non - enhanced t1-weighted mri fails to distinguish between myocardial infarction and normal myocardium . ( 25 ) reported that breath - hold turbo spin - echo t2-weighted mri can successfully detect acute myocardial infarction , providing excellent tissue contrast and high spatial resolution in a reasonably short scan time . in that study , segmental analysis of acute myocardial infarction showed a diagnostic concordance rate between t2-weighted mri and rest thallium - spect of 95% . however , even optimal t2-weighted images revealed both reversibly and irreversibly injured myocardium ( 26 , 27 ) . many attempts have been made to use contrast - enhanced t1-weighted mri for the evaluation of acute myocardial infarction . 28 ) reported that if infarct size was estimated on the basis of gd - dtpa - enhanced spin - echo images , the effect of reperfusion therapy on infarct size could be accurately assessed . other researchers , however , have reported that the use of a contrast agent such as gd - dtpa overestimated the extent of myocardial infarction by approximately 10 - 20 % ( 29 - 31 ) . using gd - dtpa polylysine - enhanced mri in a cat model of reperfused myocardial infarction , choi et al . ( 32 ) investigated changes in the size and degree of signal enhancement during the evolution of myocardial infarction over a six - day period . after observing that during those six days the enhanced area became smaller , they concluded that the highly enhancing area seen during the acute stage of reperfused myocardial infarction included both an irreversibly damaged necrotic area and a reversibly damaged peri - infarct zone . a necrosis - avid mr contrast agent , bis - gadolinium mesoporphyrins ( gadophrin-2 ; schering , berlin , germany ) , has recently become available and this shows a marked affinity for non - viable tissue components . the mechanism of signal enhancement by the contrast material in irreversibly damaged myocardium is still not well understood , but it can be assumed to result from some kind of binding of the compound to the sites of denatured tissue components by means of reperfused coronary flow and progressive extravascular diffusion . further studies to elucidate the mechanism of accumulation are needed . in an animal model with occlusive and reperfused myocardial infarction , gadophrin-2-enhanced mr imaging has proved capable of distinguishing between irreversibly and reversibly injured myocardium , with strong , persistent signal enhancement of the infarct area ( 33 , 34 ) . in addition , another intravascular contrast agent , gadomer-17 ( schering , berlin , germany ) , has recently been introduced . according to studies performed at this institution , the enhanced area seen on gadomer-17-enhanced t1-weighted images was similar to the high signal area on t2-weighted images , and statistical analysis showed no significant difference between them . we therefore believe that the enhanced area seen on gadomer-17-enhanced t1-weighted images probably included both infarct and peri - infarct areas . consequently , direct comparison of the enhanced area revealed by contrast - enhanced mri when gadophrin-2 and gadomer-17 are used may provide the means of distinguishing between irreversibly and reversibly injured myocardium . for revascularization therapy to be successful , it is imperative to distinguish between viable and non - viable myocardium : only the former is likely to benefit from this therapy . in this current study , the statistical difference in size between the abnormal signal areas on mr images and the infarct areas revealed by ttc histochemical staining may represent the lateral border zone and thus suggest the extent of reversibly damaged myocardium . first of all , we did not consider the in - vivo interaction between gadophrin-2 and gadomer-17 . a previous study ( 34 ) reported that maximal enhancement took place 40 - 60 minutes after the administration of gadomer-17 but 1 - 3 hours after the administration of gadophrin-2 , and that with both agents gradual washout then occurred ; we therefore expected that since the maximal enhancement time of these two agents differed , the effect would be minor or non - existent . second , we did not quantitatively analyze the lateral border zone seen on mr images which suggested reversibly damaged myocardium . thus , in order to better understand the clinical usefulness of gadomer-17 and gadophrin-2 , and the interaction between them , and to quantitatively analyze the lateral border zone seen on mr images , further study may be required . in conclusion , by means of mr imaging and pathologic correlation we were able to identify the lateral border zone in reperfused myocardial infarction in a cat model , and it may therefore be assumed that both the lateral and transmural border zone contain potentially salvageable myocardium . contrast - enhanced mr imaging using gadophrin-2 and gadomer-17 is potentially useful for determining the size and distribution of the lateral border zone .","<S> objectiveto identify and evaluate the lateral border zone by comparing the size and distribution of the abnormal signal area demonstrated by mr imaging with the infarct area revealed by pathological examination in a reperfused myocardial infarction cat model.materials and methodsin eight cats , the left anterior descending coronary artery was occluded for 90 minutes , and this was followed by 90 minutes of reperfusion . </S> <S> ecg - triggered breath - hold turbo spin - echo t2-weighted mr images were initially obtained along the short axis of the heart before the administration of contrast media . </S> <S> after the injection of gadomer-17 and gadophrin-2 , contrast - enhanced t1-weighted mr images were obtained for three hours . </S> <S> the size of the abnormal signal area seen on each image was compared with that of the infarct area after ttc staining . </S> <S> to assess ultrastructural changes in the myocardium at the infarct area , lateral border zone and normal myocardium , electron microscopic examination was performed.resultsthe high signal area seen on t2-weighted images and the enhanced area seen on gadomer-17-enhanced t1wi were larger than the enhanced area on gadophrin-2-enhanced t1wi and the infarct area revealed by ttc staining ; the difference was expressed as a percentage of the size of the total left ventricle mass ( t2= 39.2% ; gadomer-17 = 37.25% vs gadophrin-2 = 29.6% ; ttc staining = 28.2% ; p < 0.05 ) . </S> <S> the ultrastructural changes seen at the lateral border zone were compatible with reversible myocardial damage.conclusionin a reperfused myocardial infarction cat model , the presence and size of the lateral border zone can be determined by means of gadomer-17- and gadophrin-2-enhanced mr imaging . </S>"
2,"later , research showed that the fibers that transfer the pain stimuli are organized in the fetus . the nerve pathway myelinization begins in the 2 and 3 trimesters and finishes between 30 to 37weeks of gestational age . mkoban et al ( 2003 ) also showed that insufficient pain control results in hypoxia and stimuli reaction . kazak et al compared effect of pharmacologic and non - pharmacologic treatment on pain and concluded that non - pharmacological methods are more effective than pharmacological methods for pain relief . this has helped the mother - infant bond and increased the mother 's self - esteem and skills . studies have shown that kangaroo mother care ( kmc ) is effective on infant pain[710 ] . kangaroo care originated in bogot , colombia because of a lack of incubators for preterm infants . during kmc the adult holds the diapered infant against his / her skin . . a breastfeeding mother may allow the infant self - regulatory access to her breast . the adult is without clothing from the waist up ; a blanket covers both the infant and adult . kmc has three provisions : 1 ) skin to skin contact , 2 ) exclusive breastfeeding , 3 ) support to the mother and infant . care - giving and analgesia is in the domain of nursing activities and must be a priority in nursing standards . this was a semi - experimental double blind , double group , and clinical trial study approved by the research ethics committee of mashhad university of medical sciences . sixty infants born during march to july 2006 with the following inclusion criteria : born at term with a weight between 2500 to 4000 g , type of delivery nvd , apgar score 1=7 - 10 , at earliest 24 hr after birth , not fed since 30 minutes and lack of skin lesions in mother or infant , were studied . the mother was requested to put the baby under her gown between her breasts with maximum skin to skin contact . the time of skin to skin contact was 2 minutes before vaccination and 3 minutes afterwards . behavioral changes were scored according to neonatal / infant pain scale ( nips ) recommended for children less than 1 year old . in this scoring a score greater than 3 indicates pain.facial expression : relaxed muscles 0 , grimace 1crying : no cry 0 , whimper 1 , vigorous crying 2breathing patterns : relaxed 0 , change in breathing 1arms : relaxed / restrained 0 , extended 1legs : relaxed / restrained 0 , flexed / extended 1state of arousal : sleeping / awake 0 , fussy 1 \n facial expression : relaxed muscles 0 , grimace 1 crying : no cry 0 , whimper 1 , vigorous crying 2 breathing patterns : relaxed 0 , change in breathing 1 arms : relaxed / restrained 0 , extended 1 legs : relaxed / restrained 0 , flexed / extended 1 state of arousal : sleeping / awake 0 , fussy 1 babies in the control group were wrapped in a blanket and put near the bed of their mothers . we recorded the physiologic and behavioral reactions of these infants to pain using the same method as for the case group . data were analyzed using chi - square , fisher exact test , paired t - test and independent t - test and mann - whitney test . the case group consisted of 53% were males and 47% females whereas in controls 60% were females and 40% males ; 80% of the case group and 73.3% of the control group had 40 weeks gestational age ; 83.3% of the infants in the case group and 96.7% in the control group had a first minute apgar score of 9 . mean birth weight in the case group was 3242306.6 and in the control group 3151331.5 grams . according to the results obtained from the nips score , during intervention 30% of the infants had a pain score of 6 and 70% in the case group 7 , while 96.6% of the neonates in the control group had a score of 7 and 3.3% had a score of 6 . comparison of pain severity before and during intervention three minutes after intervention 93.3% of the infants in the case group had pain score 0 and only 6.6 had 6 to 7 pain scores while in the control group 70% had a zero pain score and more than 26% had a 6 or 7 pain scores . this was also significant ( p=0.021 , table 2 ) . mean pain intensity 3 minutes after intervention was significantly lower in the case than control group(p=0.008 , table 3 ) . according to mann - whitney test there was a significant statistical difference in the cry interval times between the 2 groups ( p<0.001 ) during intervention . \n comparison of pain severity after intervention comparison of pain severity during and after intervention there was a significant statistical difference between the 2 groups in the time interval of crying after intervention ( p=0.008 ) . the preinterventional o2 saturation in the case group was 95.802.78 and 94.073.18 in the control group , during intervention 96.172.61 in the case group and 94.532.64 in control group and after intervention 95.602.19 in case group and 95.101.72 in the control group . mean pain intensity during intervention was significantly lower in the case group than in control group . mean pain intensity 3 minutes after intervention was significantly lower in the case than control group . this was compatible with the results of the gray et al and johnston et al[8 , 13 ] . gray et al showed that pain reaction was 65% less in their case group compared to the control group . the results of johnston et al showed that the efficacy of the kmc was significant in relieving pain in infants of 32 weeks gestational age . the results of anderson et al showed that skin to skin contact is a factor that relieves infant pain and reduces behavioral and physiological reactions to painful stimulations . moreover , luedington investigated the effect of skin to skin contact on painful nursing procedures . he concluded that infants who had skin to skin contact showed less pain related reactions such as change in grimace . our study was compatible with johnson 's with regard to pulse rate in the two groups which was insignificant . gray reported that hugging neonates during vaccination decreased the crying interval compared to the control group in which the neonates were placed on a bed during vaccination . the present investigation showed that the skin to skin contact group had a crying interval time shorter than that of controls . according to the results of this study kmc decreased pain severity in neonates of the case group during the intervention and 3 minutes afterwards .","<S> objectiveit has been demonstrated that newborns feel pain completely . </S> <S> thus , they should be treated with this in mind . </S> <S> recent research showed that non - pharmacological interventions such as kangaroo care may be useful for decreasing pain in newborns . </S> <S> we tried to determine the effect of kangaroo care on the pain intensity of vaccination in healthy newborns.methodsthis study was a randomized case - control clinical trial . </S> <S> subjects were 60 healthy full - term newborns delivered in a general hospital , in iran , from march to july 2006 . </S> <S> they were randomly assigned to case and control groups . </S> <S> the case group received 30 minutes skin to skin contact , whereas infants in the control group were put , wrapped in a blanket , aside the mothers . </S> <S> behavioral changes of newborns were evaluated and observed 2 minutes before , during , and 3 minutes after the intervention . </S> <S> all procedures were filmed . </S> <S> an assistant who was blinded to the study , scored behavior changes using neonatal / infant pain scale . </S> <S> heart rate and oxygen saturation levels as displayed on the pulse monitor and duration of crying were recorded using a stopwatch.findingsmean pain intensity during the intervention v was significantly lower in the case group ( p<0.006 ) . </S> <S> mean pain intensity 3 minutes after intervention was also significantly lower in the case group ( p<0.021 ) . </S> <S> mean duration of crying was significantly lower in case group as well ( p<0.001).conclusionkangaroo care may be used to decrease pain intensity in newborns undergoing painful procedures . </S>"
3,"their occurrence was first described in 1670 by thilesus . however , at that time fistulas were a common complication of chronic and untreated cholecystitis . according to a 2005 study , 226 cases have been reported in total , with fewer than 25 in the last 50 years . the reduced incidence in current times can be attributed to more rapid diagnosis and treatment with antibiotics or surgery . although occurring in acalculous cholecystitis and carcinoma of the gallbladder , fistulas are still most commonly associated with gallstones [ 3 , 4 ] . obstruction of the cystic duct leads to an increase in gallbladder pressure and reduced perfusion with necrosis , which consequently causes gallbladder perforation . the contents of the gallbladder may then empty into the peritoneal cavity and an abscess may form or a fistula may develop through adherence to the duodenum , colon or abdominal wall , often via the fundus of the gallbladder . the right upper quadrant is the most common location for the exit tract of the fistula , but locations such as the gluteal region , umbilicus and right groin have also been documented . cholecystocutaneous fistulas are most often seen in elderly women over the age of 60 , likely due to coexistent disease and non - specific symptoms interfering with diagnosis . a white 85-year - old female with hypertension and a previous history of breast biopsy underwent endoscopic retrograde cholangiopancreatography with sphincterotomy after initially presenting on may 3 , 2011 with common duct stones . the patient was initially seen in the emergency department complaining of a 3-day history of sharp intermittent epigastric and right upper quadrant pain radiating to the central back . mild scleral icterus was noted , but there were no signs of jaundice or lymphadenopathy . her abdomen was soft , non - distended and mildly tender to palpation with a positive murphy 's sign . routine blood work demonstrated an elevated white blood cell count of 16.1 , no abnormalities on sma7 , elevated lipase > 3,000 , and elevated liver function testing including an alkaline phosphatase of 215 , a bilirubin of 41 , an ast of 100 , a ggt of 305 and an alt of 194 . clinical evidence of mild jaundice accompanied by blood work abnormalities and positive radiological signs led to the diagnosis of acute calculous cholecystitis , common bile duct stones up to 7 mm in size and biliary gallstone pancreatitis . she was treated conservatively with intravenous antibiotics and underwent endoscopic retrograde cholangiopancreatography with sphincterotomy for removal of several stones of various sizes . percutaneous cholecystostomy was then carried out for drainage of the gallbladder after development and medical control of atrial fibrillation . on june 1 , 2011 she was re - admitted to the hospital with a left lower lobe pulmonary embolism . on june 27 , 2011 the percutaneous drain was removed at her request . in early august 2011 , she re - developed right upper quadrant discomfort ; furthermore , she noted some purulent drainage from the percutaneous drain site and extrusion of approximately 30 gallstones . she had several follow - up ultrasounds which identified a fistulous tract measuring 0.78 cm in diameter communicating with the external opening in the right upper quadrant ( fig . an irregular hypoechoic area just inside the subcutaneous tissue measuring 4.1 2.7 cm was presumed to represent a contracted gallbladder . plans were made for laparoscopic cholecystectomy and management of her cholecystocutaneous fistula on february 22 , 2012 , once she finished her coumadin regiment . in the morning of the operation , on february 22 , 2012 , the patient 's inr was still elevated at 1.8 and the surgery was re - scheduled for a month later . on april 18 , 2012 the patient underwent laparoscopic cholecystectomy and fistula division ( fig . 2 ) . three additional gallstones were found in the gallbladder at the time of the operation . we present the case of an 85-year - old white female who was diagnosed with a cholecystocutaneous fistula that developed as a complication following removal of a percutaneous drain that was used to treat her acute cholecystitis . re - occurrence of her cholecystitis after drain removal and the presence of gallstones promoted the production of a fistula along the pre - existing tract of the drain . her concurrent treatment with anticoagulants for a pulmonary embolism delayed the definitive management of her cholecystitis and fistula . fortunately , the patient remained in reasonably good health throughout the waiting period from time of fistula diagnosis to surgery . more conservative approaches such as percutaneous cholecystotomy have been used in high - risk patients , leading to spontaneous closure of the fistula . however , in this case the fistula developed through the old drain tract , so surgical intervention was employed . as with uncomplicated cholecystitis , laparoscopic techniques are favorable compared to open surgery and thus a laparoscopic cholecystectomy was undertaken in this case . the gallstones removed during cholecystectomy were of orange - brown color consistent with cholesterol stones . although fistula formation is now a rare complication of cholecystitis , it remains a possibility and should be considered in the differential diagnosis of any fistulous tract in the right abdominal wall . we have demonstrated that previous percutaneous drainage of an acute gallbladder infection can promote the formation of such a fistula if the infection is not properly dealt with or re - occurs . physicians should be prepared to recognize this complication in patients after drain removal and prior to definitive surgery .","<S> cases of cholecystocutaneous fistulas are now a rare occurrence as a result of rapid diagnosis and treatment . </S> <S> we present a case of cholecystocutaneous fistula developing after the removal of a percutaneous drain for the treatment of acute cholecystitis . </S> <S> re - occurring infection and presence of gallstones led to fistulization of the gallbladder fundus and the development of a tract along the path created by the drain . the patient presented with re - occurring right upper quadrant abdominal pain , purulent discharge from the fistulous opening and expulsion of multiple gallstones . </S> <S> she underwent laparoscopic cholecystectomy and fistula excision . </S>"
4,"data sources were the truven health marketscan commercial claims and encounters and the marketscan medicare supplemental databases from january 1 , 2009 , to september 30 , 2012 . records of patients with a copd diagnosis at any diagnosis position within the intake period from july 1 , 2009 , through september 30 , 2011 , were included . claims for laboratory , pathology , or radiology services were not used to identify individuals with a specific condition , because their use could incorrectly identify individuals as having that condition based on the reason for testing ( eg , screening ) rather than the test results ; therefore , those claims were ignored during patient selection , and diagnoses were termed non - rule - out copd . the first occurrence of non - rule - out copd diagnosis ( international classification of diseases , ninth revision , clinical modification codes 490.xx , 491.xx , 492.xx , 494.xx , or 496.xx ) ( e - table 1 ) was defined as the index event , and the date of the index event was defined as the index date . the study proposal was presented to and accepted by the novartis outcomes research review forum . eligible patients were aged 40 to 90 years ( inclusive ) , had copd , and used at least one long - acting muscarinic antagonist ( lama ) , long - acting 2-adrenergic agonist ( laba ) , inhaled corticosteroid ( ics)/laba , or lama + ics + laba from 180 days before the index date through 180 days after the index date . patients were enrolled continuously in the medical , pharmacy benefit , and fee - for - service plan from 180 days before the index date to 360 days after the index date . comorbidities were defined as any occurrence of a specific diagnosis code from 180 days before the index date through 180 days after the index date . this time period , which does not fall completely within the baseline period or the 360-day follow - up period , was chosen to establish and solidify the baseline patient comorbidities being evaluated . comorbidities of interest were identified prospectively and consisted of chronic kidney disease ( ckd ) ; cardiovascular disease ( cvd ) , including heart failure , stroke , acute myocardial infarction ( mi ) , and peripheral vascular disease ; asthma ; depression ; diabetes ; osteoporosis ; and anemia . resource consumption was measured from 180 days prior to the index date to the index date ( for baseline assessments ) and from the index date through 360 days after the index date ( for outcomes assessments ) . resource use assessments included all - cause and disease - specific ed visits , hospitalizations , office visits ( defined as any office visit to any doctor ) , outpatient visits , and total length of hospital stay ; in this instance , disease - specific means copd- or asthma - related . the health - care costs assessed included all - cause and disease - specific costs for ed visits , hospitalizations , office visits , and other outpatient visits , as well as medical , prescription drug , and total health - care costs . other covariate variables included age , sex , region , employment status , and index medication ( the first drug class used during the period ) . patient characteristics , comorbidities of interest , health - care use , and costs were summarized descriptively . a generalized linear model ( glm ) was used to evaluate which comorbidities drive total costs after accounting for patient characteristics in the total population . after examining the data , we selected glms with a log - link and distribution to evaluate the incremental all - cause costs , adjusting for baseline demographics , resource use , and comorbidities . to better understand total costs that are potentially attributable to comorbidities , the average treatment effect ( change in the response by a change in a covariate ) of each comorbidity was calculated by using the recycled prediction method . the predicted costs for patients with cvd were calculated based on the estimated glm ( with costs as the dependent variable ) by assuming all patients had cvd ( regardless of whether they had cvd ) while keeping other covariates as they were . the predicted costs , for which every observation is treated as if it represents patients without cvd , were obtained in the same manner . the average treatment effect was the mean difference in the predicted costs for the two groups . thus , we compared two hypothetical populations ( one with cvd and one without cvd ) that had the exact same values for the other independent variables in the model . cis were generated with the percentile method ( the 95% lower bound ci is the 2.5th percentile of the bootstrap distribution , and the 95% upper bound ci is the 97.5th percentile of the bootstrap distribution ) . data sources were the truven health marketscan commercial claims and encounters and the marketscan medicare supplemental databases from january 1 , 2009 , to september 30 , 2012 . records of patients with a copd diagnosis at any diagnosis position within the intake period from july 1 , 2009 , through september 30 , 2011 , were included . claims for laboratory , pathology , or radiology services were not used to identify individuals with a specific condition , because their use could incorrectly identify individuals as having that condition based on the reason for testing ( eg , screening ) rather than the test results ; therefore , those claims were ignored during patient selection , and diagnoses were termed non - rule - out copd . the first occurrence of non - rule - out copd diagnosis ( international classification of diseases , ninth revision , clinical modification codes 490.xx , 491.xx , 492.xx , 494.xx , or 496.xx ) ( e - table 1 ) was defined as the index event , and the date of the index event was defined as the index date . the study proposal was presented to and accepted by the novartis outcomes research review forum . eligible patients were aged 40 to 90 years ( inclusive ) , had copd , and used at least one long - acting muscarinic antagonist ( lama ) , long - acting 2-adrenergic agonist ( laba ) , inhaled corticosteroid ( ics)/laba , or lama + ics + laba from 180 days before the index date through 180 days after the index date . patients were enrolled continuously in the medical , pharmacy benefit , and fee - for - service plan from 180 days before the index date to 360 days after the index date . comorbidities were defined as any occurrence of a specific diagnosis code from 180 days before the index date through 180 days after the index date . this time period , which does not fall completely within the baseline period or the 360-day follow - up period , was chosen to establish and solidify the baseline patient comorbidities being evaluated . comorbidities of interest were identified prospectively and consisted of chronic kidney disease ( ckd ) ; cardiovascular disease ( cvd ) , including heart failure , stroke , acute myocardial infarction ( mi ) , and peripheral vascular disease ; asthma ; depression ; diabetes ; osteoporosis ; and anemia . resource consumption was measured from 180 days prior to the index date to the index date ( for baseline assessments ) and from the index date through 360 days after the index date ( for outcomes assessments ) . resource use assessments included all - cause and disease - specific ed visits , hospitalizations , office visits ( defined as any office visit to any doctor ) , outpatient visits , and total length of hospital stay ; in this instance , disease - specific means copd- or asthma - related . the health - care costs assessed included all - cause and disease - specific costs for ed visits , hospitalizations , office visits , and other outpatient visits , as well as medical , prescription drug , and total health - care costs . other covariate variables included age , sex , region , employment status , and index medication ( the first drug class used during the period ) . patient characteristics , comorbidities of interest , health - care use , and costs were summarized descriptively . a generalized linear model ( glm ) was used to evaluate which comorbidities drive total costs after accounting for patient characteristics in the total population . after examining the data , we selected glms with a log - link and distribution to evaluate the incremental all - cause costs , adjusting for baseline demographics , resource use , and comorbidities . to better understand total costs that are potentially attributable to comorbidities , the average treatment effect ( change in the response by a change in a covariate ) of each comorbidity was calculated by using the recycled prediction method . the predicted costs for patients with cvd were calculated based on the estimated glm ( with costs as the dependent variable ) by assuming all patients had cvd ( regardless of whether they had cvd ) while keeping other covariates as they were . the predicted costs , for which every observation is treated as if it represents patients without cvd , were obtained in the same manner . the average treatment effect was the mean difference in the predicted costs for the two groups . thus , we compared two hypothetical populations ( one with cvd and one without cvd ) that had the exact same values for the other independent variables in the model . cis were generated with the percentile method ( the 95% lower bound ci is the 2.5th percentile of the bootstrap distribution , and the 95% upper bound ci is the 97.5th percentile of the bootstrap distribution ) . patient characteristics on the index date are summarized in table 1 , and health - care use and costs from the 180 days preceding the index date through the index date are summarized in table 2 . the most common comorbidity was cvd ( 34.8% ) , followed by diabetes ( 22.8% ) , asthma ( 14.7% ) , anemia ( 14.2% ) , ckd ( 9.9% ) , depression ( 9.9% ) , and osteoporosis ( 6.9% ) . most patients ( 52.8% ) had one or two comorbidities of interest . patients with ckd and anemia experienced the highest incidence of all - cause ed visits leading to hospitalizations ( 23.2% and 20.4% , respectively ) and all - cause hospitalizations ( 38.0% and 33.8% , respectively ) . the percentages of all - cause office visits and of all - cause other outpatient visits were generally similar across the various comorbidity groups ( all - cause office visits , 93.8%-95.4% ; all - cause other outpatient visits , 95.1%-97.7% ) and were higher than those for patients with no baseline comorbidities of interest ( 83.9% and 84.6% , respectively ) . ics = inhaled corticosteroid ; laba = long - acting 2-adrenergic agonist ; lama = long - acting muscarinic antagonist . ckd = chronic kidney disease ; cvd = cardiovascular disease ; ics = inhaled corticosteroid ; laba = long - acting 2-adrenergic agonist ; lama = long - acting muscarinic agonist ; mi = myocardial infarction . spouse / child / dependent relation . within the period from 180 d before the index date through 180 d after the index date ( inclusive ) . includes heart failure , stroke , acute mi , and peripheral vascular disease . baseline resource use and resource use from the index date ( exclusive ) through 180 d before the index date ( inclusive ) . the prevalence of copd- or asthma - related hospitalizations was highest among patients with asthma at baseline ( 8.3% ) ; for the other baseline comorbidities , the frequencies of copd- or asthma - related hospitalizations were similar , ranging from 3.4% ( osteoporosis ) to 3.9% ( depression ) . mean all - cause total health - care costs from the 180 days before the index date through the index date were highest among patients with ckd ( $ 19,405 ) and anemia ( $ 18,011 ) and lowest among those with asthma ( $ 10,583 ) and osteoporosis ( $ 11,438 ) . mean copd- or asthma - related total health - care costs were highest among patients with asthma ( $ 1,845 ) and osteoporosis ( $ 1,566 ) . in addition to those shown in table 2 , analyses were conducted based on the number of comorbidities present . patients with four or more comorbidities experienced the highest incidence of ed visits leading to hospitalizations ( 33.1% ) , compared with 22.2% in those with three comorbidities , 13.8% in those with two comorbidities , 7.4% in those with one comorbidity , and 2.5% in those with no comorbidities . similarly , the rate of all - cause hospitalizations was highest in patients with four or more comorbidities ( 50.4% ) and lowest in those with no comorbidities ( 5.0% ) . all - cause total health - care costs increased as the number of comorbidities increased ( zero comorbidities of interest , $ 4,790 ; four or more comorbidities of interest , $ 27,895 ) , as did copd- or asthma - related total health - care costs ( zero comorbidity of interest , $ 871 ; four or more comorbidities of interest , $ 2,216 ) . during the time period from the index date through 360 days after the index date , 38.6% of patients with copd and ckd and 34.3% of patients with copd and anemia had all - cause ed visits leading to hospitalizations ( fig 2a ) . the percentage of patients experiencing all - cause hospitalizations was highest among those with ckd ( 57.0% ) and anemia ( 52.6% ) ( fig 2a ) . the percentage of patients with copd- or asthma - related hospitalizations was highest among those with asthma ( 17.4% ) and cvd ( 12.0% ) ( fig 2a ) . all - cause total health - care costs during the time period from the index date through 360 days following the index date were highest among patients with copd and ckd ( $ 41,288 ) and patients with copd and anemia ( $ 38,870 ) ( table 3 ) . copd- or asthma - related total health - care costs were highest among patients with copd and asthma ( $ 5,389 ) and those with copd and ckd ( $ 5,117 ) . a , b , resource use from index date through 360 d after the index date for each outcome . ckd = chronic kidney disease ; cvd = cardiovascular disease ; er = emergency room . total medical costs by comorbidity and number of comorbidities data are presented as mean ( sd ) . costs are from index date through 360 d after the index date and are in 2012 us dollars . see table 1 legend for expansion of abbreviations . patients with four or more comorbidities of interest experienced the highest incidence of ed visits leading to hospitalizations ( 50.1% vs 9.5% with no comorbidities ) and all - cause hospitalizations ( 68.7% vs 16.7% with no comorbidities ) ( fig 2b ) . all - cause total health - care costs and copd- or asthma - related total health - care costs both increased as the number of comorbidities increased ( table 3 ) . the reference group selected for the glm was female patients with copd aged 40 to 64 years , living in the south , employed , having an index medication of ics / laba fixed or loose - dose combination with no ed visits or hospitalizations regardless of relationship to asthma or copd , and who did not have ckd , cvd , asthma , depression , diabetes , osteoporosis , or anemia . in this group , a ratio , based on the impact on total health - care costs , was estimated for each variable in the model ( table 4 ) and represents multiplicative effects . ratios for variables included in the generalized linear model arithmetic mean cost for the reference group was $ 12,408 . characteristics with the greatest impact on costs included depression ( ratio , 1.35 ) , ckd ( ratio , 1.43 ) , anemia ( ratio , 1.54 ) , and cvd ( ratio , 1.55 ) comorbidities ( table 4 ) . the average treatment effect for each comorbidity after adjusting for age , sex , geographic location , baseline health - care use , employment status , and index copd medication is shown in figure 3 . for the time period from the index date through 360 days following the index date , a patient with copd and anemia had , on average , $ 10,762 more in total health - care costs than a patient with copd but without anemia . cvd and ckd increased total health - care costs by $ 9,882 and $ 8,912 , respectively . difference in average total health - care cost by comorbidity from index date through 360 d after the index date in 2012 us dollars . patient characteristics on the index date are summarized in table 1 , and health - care use and costs from the 180 days preceding the index date through the index date are summarized in table 2 . the most common comorbidity was cvd ( 34.8% ) , followed by diabetes ( 22.8% ) , asthma ( 14.7% ) , anemia ( 14.2% ) , ckd ( 9.9% ) , depression ( 9.9% ) , and osteoporosis ( 6.9% ) . most patients ( 52.8% ) had one or two comorbidities of interest . patients with ckd and anemia experienced the highest incidence of all - cause ed visits leading to hospitalizations ( 23.2% and 20.4% , respectively ) and all - cause hospitalizations ( 38.0% and 33.8% , respectively ) . the percentages of all - cause office visits and of all - cause other outpatient visits were generally similar across the various comorbidity groups ( all - cause office visits , 93.8%-95.4% ; all - cause other outpatient visits , 95.1%-97.7% ) and were higher than those for patients with no baseline comorbidities of interest ( 83.9% and 84.6% , respectively ) . ics = inhaled corticosteroid ; laba = long - acting 2-adrenergic agonist ; lama = long - acting muscarinic antagonist . ckd = chronic kidney disease ; cvd = cardiovascular disease ; ics = inhaled corticosteroid ; laba = long - acting 2-adrenergic agonist ; lama = long - acting muscarinic agonist ; mi = myocardial infarction . spouse / child / dependent relation . within the period from 180 d before the index date through 180 d after the index date ( inclusive ) . includes heart failure , stroke , acute mi , and peripheral vascular disease . baseline resource use and resource use from the index date ( exclusive ) through 180 d before the index date ( inclusive ) . the prevalence of copd- or asthma - related hospitalizations was highest among patients with asthma at baseline ( 8.3% ) ; for the other baseline comorbidities , the frequencies of copd- or asthma - related hospitalizations were similar , ranging from 3.4% ( osteoporosis ) to 3.9% ( depression ) . mean all - cause total health - care costs from the 180 days before the index date through the index date were highest among patients with ckd ( $ 19,405 ) and anemia ( $ 18,011 ) and lowest among those with asthma ( $ 10,583 ) and osteoporosis ( $ 11,438 ) . mean copd- or asthma - related total health - care costs were highest among patients with asthma ( $ 1,845 ) and osteoporosis ( $ 1,566 ) . in addition to those shown in table 2 , analyses were conducted based on the number of comorbidities present . patients with four or more comorbidities experienced the highest incidence of ed visits leading to hospitalizations ( 33.1% ) , compared with 22.2% in those with three comorbidities , 13.8% in those with two comorbidities , 7.4% in those with one comorbidity , and 2.5% in those with no comorbidities . similarly , the rate of all - cause hospitalizations was highest in patients with four or more comorbidities ( 50.4% ) and lowest in those with no comorbidities ( 5.0% ) . all - cause total health - care costs increased as the number of comorbidities increased ( zero comorbidities of interest , $ 4,790 ; four or more comorbidities of interest , $ 27,895 ) , as did copd- or asthma - related total health - care costs ( zero comorbidity of interest , $ 871 ; four or more comorbidities of interest , $ 2,216 ) . during the time period from the index date through 360 days after the index date , 38.6% of patients with copd and ckd and 34.3% of patients with copd and anemia had all - cause ed visits leading to hospitalizations ( fig 2a ) . the percentage of patients experiencing all - cause hospitalizations was highest among those with ckd ( 57.0% ) and anemia ( 52.6% ) ( fig 2a ) . the percentage of patients with copd- or asthma - related hospitalizations was highest among those with asthma ( 17.4% ) and cvd ( 12.0% ) ( fig 2a ) . all - cause total health - care costs during the time period from the index date through 360 days following the index date were highest among patients with copd and ckd ( $ 41,288 ) and patients with copd and anemia ( $ 38,870 ) ( table 3 ) . copd- or asthma - related total health - care costs were highest among patients with copd and asthma ( $ 5,389 ) and those with copd and ckd ( $ 5,117 ) . a , b , resource use from index date through 360 d after the index date for each outcome . ckd = chronic kidney disease ; cvd = cardiovascular disease ; er = emergency room . total medical costs by comorbidity and number of comorbidities data are presented as mean ( sd ) . costs are from index date through 360 d after the index date and are in 2012 us dollars . see table 1 legend for expansion of abbreviations . patients with four or more comorbidities of interest experienced the highest incidence of ed visits leading to hospitalizations ( 50.1% vs 9.5% with no comorbidities ) and all - cause hospitalizations ( 68.7% vs 16.7% with no comorbidities ) ( fig 2b ) . all - cause total health - care costs and copd- or asthma - related total health - care costs both increased as the number of comorbidities increased ( table 3 ) . the reference group selected for the glm was female patients with copd aged 40 to 64 years , living in the south , employed , having an index medication of ics / laba fixed or loose - dose combination with no ed visits or hospitalizations regardless of relationship to asthma or copd , and who did not have ckd , cvd , asthma , depression , diabetes , osteoporosis , or anemia . in this group , a ratio , based on the impact on total health - care costs , was estimated for each variable in the model ( table 4 ) and represents multiplicative effects . ratios for variables included in the generalized linear model arithmetic mean cost for the reference group was $ 12,408 . characteristics with the greatest impact on costs included depression ( ratio , 1.35 ) , ckd ( ratio , 1.43 ) , anemia ( ratio , 1.54 ) , and cvd ( ratio , 1.55 ) comorbidities ( table 4 ) . the average treatment effect for each comorbidity after adjusting for age , sex , geographic location , baseline health - care use , employment status , and index copd medication is shown in figure 3 . for the time period from the index date through 360 days following the index date , a patient with copd and anemia had , on average , $ 10,762 more in total health - care costs than a patient with copd but without anemia . cvd and ckd increased total health - care costs by $ 9,882 and $ 8,912 , respectively . difference in average total health - care cost by comorbidity from index date through 360 d after the index date in 2012 us dollars . our results were consistent with those of previously published work , in that a significant burden of comorbidity was associated with copd , and comorbid conditions were associated with incremental increases in resource use and health - care costs . total health - care costs during the period from the index date through 360 days following the index date were greatest among patients with copd and ckd or anemia ; copd- or asthma - related total health - care costs were greatest among patients with copd and asthma and ckd . these results were driven , in part , by high incidences of all - cause ed visits leading to hospitalizations among patients with copd and ckd or anemia , and a high incidence of copd- or asthma - related hospitalization among patients with copd and asthma . multivariable analyses adjusted for age , sex , geographic location , baseline health - care use , employment status , and index copd medication showed that the effect of comorbidities on total health - care costs was greatest for anemia . this finding is consistent with the high incidences of all - cause ed visits leading to hospitalizations and all - cause hospitalizations among patients with copd and anemia . substantial treatment effects of cvd and ckd ( about $ 9,000 ) are likely attributable , in part , to the high incidence ( about 50% ) of all - cause hospitalization in both groups . cavailles et al reviewed the pathophysiologic and epidemiologic links between copd and the comorbidities studied here , and concluded that shared risk factors and the influence of chronic systemic inflammation are likely contributors to these relationships . smoking is a major risk factor for both copd and cvd , and cvd was the most common comorbidity in the copd population ; thus , the frequent coexistence of these two conditions is unsurprising . however , the literature also suggests that the systemic inflammation associated with copd produces a procoagulant state and endothelial dysfunction that may contribute to thromboembolic events . in fact , at least one study has shown that the link between cardiovascular events ( including death ) and copd is independent of smoking status and other confounding coronary risk factors . there is no evidence of a direct role for copd - related inflammation in anemia , which was identified as the most costly of the comorbidities in this study . however , older age , malnutrition , and cvd frequently accompany copd and are believed to play a role in the development of anemia in patients with copd . consistent with our findings , ornek et al determined that anemia significantly increased the cost of copd treatment in patients hospitalized for acute exacerbation of copd . furthermore , anemia was independently prognostic for premature mortality , hospital admissions , and cumulative duration of hospitalization in patients with severe copd receiving long - term oxygen therapy . our results support ckd as a key driver of total health - care costs as well as copd- and asthma - related health - care costs in patients with copd , although it was not as prevalent as other comorbidities studied . epidemiologic studies have confirmed copd as a risk factor for ckd , and the literature suggests that renal function is sensitive to hypoxemia and hypercarbia . additionally , arterial stiffness associated with copd may damage glomeruli , and some copd medications may have nephrotoxic effects . the presence of chronic renal failure significantly increased the cost of care in patients hospitalized for acute exacerbation of copd . finally , copd- or asthma - related total health - care costs were higher among patients with copd and asthma than among those with any other comorbidity . overlap syndrome refers to patients who have components of both conditions , and it is often used to describe elderly individuals in whom the distinction between asthma and copd is difficult to make . in a study of nearly 25,000 insured adults with copd , those with asthma had 1.6 times greater odds of having respiratory - related ed visits , hospitalizations , or both than did those with copd alone and demonstrated an approximately 50% increase in respiratory - related health - care costs . hospital cost use for copd or bronchiectasis was also evaluated in a recent analysis of the nationwide inpatient sample and the nationwide emergency department sample database of the healthcare cost and utilization project . although significant trends were not found in age - adjusted rates of hospital discharges from 2001 to 2012 , ed visits from 2006 to 2011 , or 30-day readmissions from 2009 to 2012 , the mean charges and costs of all discharges increased considerably from 2001 to 2012 , with aggregate charges for inpatient stays increasing from $ 8,023,983,422 in 2001 to $ 18,112,392,566 in 2012 . this study was subject to several limitations . because it was restricted to patients who were on long - acting therapies , patients with the mildest form of copd and patients with more advanced disease who were not appropriately prescribed long - acting therapies inherent to claims data research , the clinical accuracy of the coding could not be assessed . in addition , patients with health maintenance organizations or full or partial capitated point - of - service insurance coverage were excluded from this study because the financial information for this population was incomplete . further , this analysis did not include patients on medicaid and it contained < 8% of patients with managed medicare . no information was available regarding the severity of disease and the level of treatment adherence . an analysis in the younger , working - age copd population ( 45 - 64 years of age ) showed that these costs ( which include the costs of impaired productivity at work , lost productivity because of early retirement , disability pensions paid , and tax revenue lost ) are considerably higher than the direct medical cost of copd . in conclusion , these results show that a high prevalence of patients with copd and multiple comorbidities have associated high resource use and costs , especially within the all - cause use category . further research on comorbid conditions affecting the treatment adherence of patients with copd , copd pathogenic pathways , and worsening overall prognosis is necessary to elucidate the role of comorbidities in copd .","<S> background : the morbidity and mortality associated with copd exacts a considerable economic burden . comorbidities in copd are associated with poor health outcomes and increased costs . </S> <S> our objective was to assess the impact of comorbidities on copd - associated costs in a large administrative claims dataset.methods:this was a retrospective observational study of data from the truven health marketscan commercial claims and encounters and the marketscan medicare supplemental databases from january 1 , 2009 , to september 30 , 2012 . </S> <S> resource consumption was measured from the index date ( date of first occurrence of non - rule - out copd diagnosis ) to 360 days after the index date . </S> <S> resource use ( all - cause and disease - specific [ ie , copd- or asthma - related ] ed visits , hospitalizations , office visits , other outpatient visits , and total length of hospital stay ) and health - care costs ( all - cause and disease - specific costs for ed visits , hospitalizations , office visits , and other outpatient visits and medical , prescription , and total health - care costs ) were assessed . </S> <S> generalized linear models were used to evaluate the impact of comorbidities on total health - care costs , adjusting for age , sex , geographic location , baseline health - care use , employment status , and index copd medication.results:among 183,681 patients with copd , the most common comorbidities were cardiovascular disease ( 34.8% ) , diabetes ( 22.8% ) , asthma ( 14.7% ) , and anemia ( 14.2% ) . </S> <S> most patients ( 52.8% ) had one or two comorbidities of interest . </S> <S> the average all - cause total health - care costs from the index date to 360 days after the index date were highest for patients with chronic kidney disease ( $ 41,288 ) and anemia ( $ 38,870 ) . </S> <S> the impact on total health - care costs was greatest for anemia ( $ 10,762 more , on average , than a patient with copd without anemia).conclusions : our analysis demonstrated that high resource use and costs were associated with copd and multiple comorbidities . </S>"


The metric is an instance of [`datasets.Metric`](https://huggingface.co/docs/datasets/package_reference/main_classes.html#datasets.Metric):

In [13]:
metric

Metric(name: "rouge", features: {'predictions': Value(dtype='string', id='sequence'), 'references': Value(dtype='string', id='sequence')}, usage: """
Calculates average rouge scores for a list of hypotheses and references
Args:
    predictions: list of predictions to score. Each predictions
        should be a string with tokens separated by spaces.
    references: list of reference for each prediction. Each
        reference should be a string with tokens separated by spaces.
    rouge_types: A list of rouge types to calculate.
        Valid names:
        `"rouge{n}"` (e.g. `"rouge1"`, `"rouge2"`) where: {n} is the n-gram based scoring,
        `"rougeL"`: Longest common subsequence based scoring.
        `"rougeLSum"`: rougeLsum splits text using `"
"`.
        See details in https://github.com/huggingface/datasets/issues/617
    use_stemmer: Bool indicating whether Porter stemmer should be used to strip word suffixes.
    use_agregator: Return aggregates if this is set to True
Retu

You can call its `compute` method with your predictions and labels, which need to be list of decoded strings:

In [14]:
fake_preds = ["hello there", "general kenobi"]
fake_labels = ["hello there", "general kenobi"]
metric.compute(predictions=fake_preds, references=fake_labels)

{'rouge1': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0)),
 'rouge2': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0)),
 'rougeL': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0)),
 'rougeLsum': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0))}

## Preprocessing the data

Before we can feed those texts to our model, we need to preprocess them. This is done by a 🤗 `Transformers` `Tokenizer` which will (as the name indicates) tokenize the inputs (including converting the tokens to their corresponding IDs in the pretrained vocabulary) and put it in a format the model expects, as well as generate the other inputs that the model requires.

To do all of this, we instantiate our tokenizer with the `AutoTokenizer.from_pretrained` method, which will ensure:

- we get a tokenizer that corresponds to the model architecture we want to use,
- we download the vocabulary used when pretraining this specific checkpoint.

That vocabulary will be cached, so it's not downloaded again the next time we run the cell.


In [16]:
from transformers import AutoTokenizer
    
tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)

Downloading:   0%|          | 0.00/26.0 [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/1.55k [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/878k [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/446k [00:00<?, ?B/s]

By default, the call above will use one of the fast tokenizers (backed by Rust) from the 🤗 `Tokenizers` library.

You can directly call this tokenizer on one sentence or a pair of sentences:

In [17]:
tokenizer("Hello, this one sentence!")

{'input_ids': [0, 31414, 6, 42, 65, 3645, 328, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1]}

Depending on the model you selected, you will see different keys in the dictionary returned by the cell above. They don't matter much for what we're doing here (just know they are required by the model we will instantiate later), you can learn more about them in [this tutorial](https://huggingface.co/transformers/preprocessing.html) if you're interested.

Instead of one sentence, we can pass along a list of sentences:

In [18]:
tokenizer(["Hello, this one sentence!", "This is another sentence."])

{'input_ids': [[0, 31414, 6, 42, 65, 3645, 328, 2], [0, 713, 16, 277, 3645, 4, 2]], 'attention_mask': [[1, 1, 1, 1, 1, 1, 1, 1], [1, 1, 1, 1, 1, 1, 1]]}

To prepare the targets for our model, we need to tokenize them inside the `as_target_tokenizer` context manager. This will make sure the tokenizer uses the special tokens corresponding to the targets:

In [19]:
with tokenizer.as_target_tokenizer():
    print(tokenizer(["Hello, this one sentence!", "This is another sentence."]))

{'input_ids': [[0, 31414, 6, 42, 65, 3645, 328, 2], [0, 713, 16, 277, 3645, 4, 2]], 'attention_mask': [[1, 1, 1, 1, 1, 1, 1, 1], [1, 1, 1, 1, 1, 1, 1]]}


If you are using one of the five T5 checkpoints we have to prefix the inputs with "summarize:" (the model can also translate and it needs the prefix to know which task it has to perform).

In [20]:
if model_checkpoint in ["t5-small", "t5-base", "t5-larg", "t5-3b", "t5-11b"]:
    prefix = "summarize: "
else:
    prefix = ""

We can then write the function that will preprocess our samples. We just feed them to the `tokenizer` with the argument `truncation=True`. This will ensure that an input longer that what the model selected can handle will be truncated to the maximum length accepted by the model. The padding will be dealt with later on (in a data collator) so we pad examples to the longest length in the batch and not the whole dataset.

The max input length of `sshleifer/distilbart-xsum-12-1` is 1024, so `max_input_length = 1024`.

In [24]:
max_input_length = 1024
max_target_length = 256

def preprocess_function(examples):
    inputs = [prefix + doc for doc in examples["article"]]
    model_inputs = tokenizer(inputs, max_length=max_input_length, truncation=True)

    # Setup the tokenizer for targets
    with tokenizer.as_target_tokenizer():
        labels = tokenizer(examples["abstract"], max_length=max_target_length, truncation=True)

    model_inputs["labels"] = labels["input_ids"]
    return model_inputs

This function works with one or several examples. In the case of several examples, the tokenizer will return a list of lists for each key:

In [25]:
preprocess_function(raw_datasets['train'][:2])

{'input_ids': [[0, 405, 11493, 11, 55, 87, 654, 207, 9, 1484, 8, 189, 1338, 1814, 207, 11, 1402, 3505, 9, 16640, 2156, 941, 11, 1484, 11793, 17930, 8, 73, 368, 13785, 5804, 4, 134, 41, 23249, 16, 6533, 25, 41, 15650, 17215, 672, 9, 23385, 43202, 36, 1368, 428, 4839, 36, 1368, 428, 28696, 316, 821, 1589, 385, 462, 4839, 8, 189, 16072, 25, 10, 898, 9, 5, 7482, 2199, 2156, 13162, 2156, 2129, 10894, 2156, 17930, 2156, 50, 13785, 5804, 479, 6104, 3218, 3608, 14, 7967, 8, 18327, 139, 111, 2174, 797, 71, 13785, 5804, 2156, 941, 11, 471, 8, 5397, 16640, 2156, 189, 28, 13969, 30, 41, 23249, 4, 1978, 41, 23249, 747, 41089, 1290, 5298, 215, 25, 16069, 2156, 8269, 2156, 8, 25599, 642, 22423, 2156, 8, 4634, 189, 33, 10, 2430, 1683, 15, 1318, 9, 301, 36, 2231, 1168, 4839, 8, 819, 2194, 11, 1484, 19, 1668, 479, 4634, 2156, 7, 1477, 2166, 13838, 2156, 2231, 1168, 2156, 8, 17618, 32444, 11, 1484, 19, 1668, 2156, 24, 74, 28, 5701, 7, 185, 10, 16300, 1548, 11, 9397, 9883, 54, 240, 1416, 13, 1668, 111, 30

To apply this function on all the pairs of sentences in our dataset, we just use the `map` method of our `dataset` object we created earlier. This will apply the function on all the elements of all the splits in `dataset`, so our training, validation and testing data will be preprocessed in one single command.

In [26]:
tokenized_datasets = raw_datasets.map(preprocess_function, batched=True)

  0%|          | 0/8 [00:00<?, ?ba/s]

  0%|          | 0/2 [00:00<?, ?ba/s]

  0%|          | 0/2 [00:00<?, ?ba/s]

Even better, the results are automatically cached by the 🤗 `Datasets` library to avoid spending time on this step the next time you run your notebook. The 🤗 `Datasets` library is normally smart enough to detect when the function you pass to map has changed (and thus requires to not use the cache data). For instance, it will properly detect if you change the task in the first cell and rerun the notebook. 🤗 `Datasets` warns you when it uses cached files, you can pass `load_from_cache_file=False` in the call to `map` to not use the cached files and force the preprocessing to be applied again.

Note that we passed `batched=True` to encode the texts by batches together. This is to leverage the full benefit of the fast tokenizer we loaded earlier, which will use multi-threading to treat the texts in a batch concurrently.

## Fine-tuning the model

Now that our data is ready, we can download the pretrained model and fine-tune it. Since our task is of the sequence-to-sequence kind, we use the `AutoModelForSeq2SeqLM` class. Like with the tokenizer, the `from_pretrained` method will download and cache the model for us.

In [27]:
from transformers import AutoModelForSeq2SeqLM, DataCollatorForSeq2Seq, Seq2SeqTrainingArguments, Seq2SeqTrainer

model = AutoModelForSeq2SeqLM.from_pretrained(model_checkpoint)

Downloading:   0%|          | 0.00/423M [00:00<?, ?B/s]

Note that  we don't get a warning like in our classification example. This means we used all the weights of the pretrained model and there is no randomly initialized head in this case.

To instantiate a `Seq2SeqTrainer`, we will need to define three more things. The most important is the [`Seq2SeqTrainingArguments`](https://huggingface.co/transformers/main_classes/trainer.html#transformers.Seq2SeqTrainingArguments), which is a class that contains all the attributes to customize the training. It requires one folder name, which will be used to save the checkpoints of the model, and all other arguments are optional:

In [28]:
batch_size = 2
model_name = model_checkpoint.split("/")[-1]
args = Seq2SeqTrainingArguments(
    f"{model_name}-finetuned-pubmed",
    evaluation_strategy = "epoch",
    learning_rate=2e-5,
    per_device_train_batch_size=batch_size,
    per_device_eval_batch_size=batch_size,
    weight_decay=0.01,
    save_total_limit=3,
    num_train_epochs=5,
    predict_with_generate=True,
    fp16=True,
    push_to_hub=True,
    seed = 42,
)

Here we set the evaluation to be done at the end of each epoch, tweak the learning rate, use the `batch_size` defined at the top of the cell and customize the weight decay. Since the `Seq2SeqTrainer` will save the model regularly and our dataset is quite large, we tell it to make three saves maximum. Lastly, we use the `predict_with_generate` option (to properly generate summaries) and activate mixed precision training (to go a bit faster).

The last argument to setup everything so we can push the model to the [Hub](https://huggingface.co/models) regularly during training. Remove it if you didn't follow the installation steps at the top of the notebook. If you want to save your model locally in a name that is different than the name of the repository it will be pushed, or if you want to push your model under an organization and not your name space, use the `hub_model_id` argument to set the repo name (it needs to be the full name, including your namespace: for instance `"sgugger/t5-finetuned-xsum"` or `"huggingface/t5-finetuned-xsum"`).

Then, we need a special kind of data collator, which will not only pad the inputs to the maximum length in the batch, but also the labels:

In [29]:
data_collator = DataCollatorForSeq2Seq(tokenizer, model=model)

The last thing to define for our `Seq2SeqTrainer` is how to compute the metrics from the predictions. We need to define a function for this, which will just use the `metric` we loaded earlier, and we have to do a bit of pre-processing to decode the predictions into texts:

In [30]:
import nltk
import numpy as np

def compute_metrics(eval_pred):
    predictions, labels = eval_pred
    decoded_preds = tokenizer.batch_decode(predictions, skip_special_tokens=True)
    # Replace -100 in the labels as we can't decode them.
    labels = np.where(labels != -100, labels, tokenizer.pad_token_id)
    decoded_labels = tokenizer.batch_decode(labels, skip_special_tokens=True)
    
    # Rouge expects a newline after each sentence
    decoded_preds = ["\n".join(nltk.sent_tokenize(pred.strip())) for pred in decoded_preds]
    decoded_labels = ["\n".join(nltk.sent_tokenize(label.strip())) for label in decoded_labels]
    
    result = metric.compute(predictions=decoded_preds, references=decoded_labels, use_stemmer=True)
    # Extract a few results
    result = {key: value.mid.fmeasure * 100 for key, value in result.items()}
    
    # Add mean generated length
    prediction_lens = [np.count_nonzero(pred != tokenizer.pad_token_id) for pred in predictions]
    result["gen_len"] = np.mean(prediction_lens)
    
    return {k: round(v, 4) for k, v in result.items()}

Then we just need to pass all of this along with our datasets to the `Seq2SeqTrainer`:

In [31]:
trainer = Seq2SeqTrainer(
    model,
    args,
    train_dataset=tokenized_datasets["train"],
    eval_dataset=tokenized_datasets["validation"],
    data_collator=data_collator,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics
)

Cloning https://huggingface.co/Kevincp560/distilbart-xsum-12-1-finetuned-pubmed into local empty directory.
Using amp half precision backend


We can now finetune our model by just calling the `train` method:

In [32]:
trainer.train()

The following columns in the training set  don't have a corresponding argument in `BartForConditionalGeneration.forward` and have been ignored: abstract, article. If abstract, article are not expected by `BartForConditionalGeneration.forward`,  you can safely ignore this message.
***** Running training *****
  Num examples = 8000
  Num Epochs = 5
  Instantaneous batch size per device = 2
  Total train batch size (w. parallel, distributed & accumulation) = 2
  Gradient Accumulation steps = 1
  Total optimization steps = 20000


Epoch,Training Loss,Validation Loss,Rouge1,Rouge2,Rougel,Rougelsum,Gen Len
1,3.3604,3.157548,25.0078,11.5381,18.4246,23.1605,54.8935
2,3.0697,2.947829,26.4947,12.5411,19.4328,24.6123,57.948
3,2.8638,2.867243,26.8856,12.7568,19.8949,24.8745,59.6245
4,2.7243,2.834695,26.7347,12.5152,19.6516,24.7756,60.439
5,2.6072,2.823643,27.0012,12.728,19.8685,25.0485,59.969


Saving model checkpoint to distilbart-xsum-12-1-finetuned-pubmed/checkpoint-500
Configuration saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint-500/config.json
Model weights saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint-500/pytorch_model.bin
tokenizer config file saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint-500/tokenizer_config.json
Special tokens file saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint-500/special_tokens_map.json
tokenizer config file saved in distilbart-xsum-12-1-finetuned-pubmed/tokenizer_config.json
Special tokens file saved in distilbart-xsum-12-1-finetuned-pubmed/special_tokens_map.json
Saving model checkpoint to distilbart-xsum-12-1-finetuned-pubmed/checkpoint-1000
Configuration saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint-1000/config.json
Model weights saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint-1000/pytorch_model.bin
tokenizer config file saved in distilbart-xsum-12-1-finetuned-pubmed/checkpoint

TrainOutput(global_step=20000, training_loss=3.016437646484375, metrics={'train_runtime': 14930.2111, 'train_samples_per_second': 2.679, 'train_steps_per_second': 1.34, 'total_flos': 4.11965173407744e+16, 'train_loss': 3.016437646484375, 'epoch': 5.0})

You can now upload the result of the training to the Hub, just execute this instruction:

In [33]:
trainer.push_to_hub()

Saving model checkpoint to distilbart-xsum-12-1-finetuned-pubmed
Configuration saved in distilbart-xsum-12-1-finetuned-pubmed/config.json
Model weights saved in distilbart-xsum-12-1-finetuned-pubmed/pytorch_model.bin
tokenizer config file saved in distilbart-xsum-12-1-finetuned-pubmed/tokenizer_config.json
Special tokens file saved in distilbart-xsum-12-1-finetuned-pubmed/special_tokens_map.json
Several commits (2) will be pushed upstream.
The progress bars may be unreliable.


Upload file pytorch_model.bin:   0%|          | 3.36k/845M [00:00<?, ?B/s]

Upload file runs/Mar04_18-48-11_9242787ecb4f/events.out.tfevents.1646419720.9242787ecb4f.77.0:  25%|##5       …

To https://huggingface.co/Kevincp560/distilbart-xsum-12-1-finetuned-pubmed
   8d5ba37..d4f3569  main -> main

To https://huggingface.co/Kevincp560/distilbart-xsum-12-1-finetuned-pubmed
   d4f3569..ed7252e  main -> main



'https://huggingface.co/Kevincp560/distilbart-xsum-12-1-finetuned-pubmed/commit/d4f35697c066ef3eb61012ed45e91a2d4dbc9a9d'

You can now share this model with all your friends, family, favorite pets: they can all load it with the identifier `"your-username/the-name-you-picked"` so for instance:

```python
from transformers import AutoModelForSeq2SeqLM

model = AutoModelForSeq2SeqLM.from_pretrained("sgugger/my-awesome-model")
```