If you're opening this Notebook on colab, you will probably need to install 🤗 `Transformers` and 🤗 `Datasets` as well as other dependencies. 

* `datasets`
* `transformers`
* `rogue-score`
* `nltk`
* `pytorch`
* `ipywidgets`

*Note*: Since we are using the GPU to optimize the performance of the deep learning algorithms, `CUDA` needs to be installed on the device.

In [1]:
! pip install datasets transformers rouge-score nltk ipywidgets

Collecting datasets
  Downloading datasets-1.18.3-py3-none-any.whl (311 kB)
[K     |████████████████████████████████| 311 kB 3.6 MB/s 
[?25hCollecting transformers
  Downloading transformers-4.17.0-py3-none-any.whl (3.8 MB)
[K     |████████████████████████████████| 3.8 MB 50.5 MB/s 
[?25hCollecting rouge-score
  Downloading rouge_score-0.0.4-py2.py3-none-any.whl (22 kB)
Collecting xxhash
  Downloading xxhash-3.0.0-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (212 kB)
[K     |████████████████████████████████| 212 kB 40.2 MB/s 
[?25hCollecting huggingface-hub<1.0.0,>=0.1.0
  Downloading huggingface_hub-0.4.0-py3-none-any.whl (67 kB)
[K     |████████████████████████████████| 67 kB 3.8 MB/s 
[?25hCollecting fsspec[http]>=2021.05.0
  Downloading fsspec-2022.2.0-py3-none-any.whl (134 kB)
[K     |████████████████████████████████| 134 kB 15.0 MB/s 
Collecting aiohttp
  Downloading aiohttp-3.8.1-cp37-cp37m-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_12_x86_64.manyli

When using `nltk`, `punkt` also needs to be installed. I guess it is not installed automatically. Not having `punkt` will result in an error during the analysis.

In [2]:
import nltk
nltk.download('punkt')

[nltk_data] Downloading package punkt to /root/nltk_data...
[nltk_data]   Unzipping tokenizers/punkt.zip.


True

If you're opening this notebook locally, make sure your environment has an install from the last version of those libraries.

To be able to share your model with the community and generate results like the one shown in the picture below via the inference API, there are a few more steps to follow.

First you have to store your authentication token from the Hugging Face website (sign up [here](https://huggingface.co/join) if you haven't already!) then execute the following cell and input your username and password:

In [3]:
from huggingface_hub import notebook_login

notebook_login()

Login successful
Your token has been saved to /root/.huggingface/token
[1m[31mAuthenticated through git-credential store but this isn't the helper defined on your machine.
You might have to re-authenticate when pushing to the Hugging Face Hub. Run the following command in your terminal in case you want to set this credential helper as the default

git config --global credential.helper store[0m


Then you need to install `Git-LFS`.

If you are not using `Google Colab`, you may need to install `Git-LFS` manually, since the code below may not work and depending on your operating system. You can read about `Git-LFS` and how to install it [here](https://git-lfs.github.com/).

In [4]:
! apt install git-lfs

Reading package lists... Done
Building dependency tree       
Reading state information... Done
The following package was automatically installed and is no longer required:
  libnvidia-common-470
Use 'apt autoremove' to remove it.
The following NEW packages will be installed:
  git-lfs
0 upgraded, 1 newly installed, 0 to remove and 39 not upgraded.
Need to get 2,129 kB of archives.
After this operation, 7,662 kB of additional disk space will be used.
Get:1 http://archive.ubuntu.com/ubuntu bionic/universe amd64 git-lfs amd64 2.3.4-1 [2,129 kB]
Fetched 2,129 kB in 2s (1,109 kB/s)
Selecting previously unselected package git-lfs.
(Reading database ... 155320 files and directories currently installed.)
Preparing to unpack .../git-lfs_2.3.4-1_amd64.deb ...
Unpacking git-lfs (2.3.4-1) ...
Setting up git-lfs (2.3.4-1) ...
Processing triggers for man-db (2.8.3-2ubuntu0.1) ...


Make sure your version of `Transformers` is at least 4.11.0 since the functionality was introduced in that version:

In [5]:
import transformers

print(transformers.__version__)

4.17.0


You can find a script version of this notebook to fine-tune your model in a distributed fashion using multiple GPUs or TPUs [here](https://github.com/huggingface/transformers/tree/master/examples/seq2seq).

# Fine-tuning a model on a summarization task

In this notebook, we will see how to fine-tune one of the [🤗`Transformers`](https://github.com/huggingface/transformers) model for a summarization task. We will use the [PubMed Summarization dataset](https://huggingface.co/datasets/ccdv/pubmed-summarization) which contains PubMed articles accompanied with abstracts.

![Widget inference on a summarization task](https://github.com/huggingface/notebooks/blob/master/examples/images/summarization.png?raw=1)

We will see how to easily load the dataset for this task using 🤗 `Datasets` and how to fine-tune a model on it using the `Trainer` API.

In [6]:
model_checkpoint = "sshleifer/distilbart-cnn-6-6"

This notebook is built to run  with any model checkpoint from the [Model Hub](https://huggingface.co/models) as long as that model has a sequence-to-sequence version in the Transformers library. Here we picked the [`sshleifer/distilbart-cnn-6-6`](https://huggingface.co/sshleifer/distilbart-cnn-6-6?text=The+tower+is+324+metres+%281%2C063+ft%29+tall%2C+about+the+same+height+as+an+81-storey+building%2C+and+the+tallest+structure+in+Paris.+Its+base+is+square%2C+measuring+125+metres+%28410+ft%29+on+each+side.+During+its+construction%2C+the+Eiffel+Tower+surpassed+the+Washington+Monument+to+become+the+tallest+man-made+structure+in+the+world%2C+a+title+it+held+for+41+years+until+the+Chrysler+Building+in+New+York+City+was+finished+in+1930.+It+was+the+first+structure+to+reach+a+height+of+300+metres.+Due+to+the+addition+of+a+broadcasting+aerial+at+the+top+of+the+tower+in+1957%2C+it+is+now+taller+than+the+Chrysler+Building+by+5.2+metres+%2817+ft%29.+Excluding+transmitters%2C+the+Eiffel+Tower+is+the+second+tallest+free-standing+structure+in+France+after+the+Millau+Viaduct.) checkpoint. 

## Loading the dataset

We will use the [🤗 `Datasets`](https://github.com/huggingface/datasets) library to download the data and get the metric we need to use for evaluation (to compare our model to the benchmark). This can be easily done with the functions `load_dataset` and `load_metric`.  

In [7]:
from datasets import load_dataset, load_metric

raw_datasets = load_dataset("ccdv/pubmed-summarization")
metric = load_metric("rouge")

Downloading:   0%|          | 0.00/4.88k [00:00<?, ?B/s]

No config specified, defaulting to: pub_med_summarization_dataset/document


Downloading and preparing dataset pub_med_summarization_dataset/document to /root/.cache/huggingface/datasets/ccdv___pub_med_summarization_dataset/document/1.0.0/5792402f4d618f2f4e81ee177769870f365599daa729652338bac579552fec30...


Downloading:   0%|          | 0.00/779M [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/43.7M [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/43.8M [00:00<?, ?B/s]

0 examples [00:00, ? examples/s]

0 examples [00:00, ? examples/s]

0 examples [00:00, ? examples/s]

Dataset pub_med_summarization_dataset downloaded and prepared to /root/.cache/huggingface/datasets/ccdv___pub_med_summarization_dataset/document/1.0.0/5792402f4d618f2f4e81ee177769870f365599daa729652338bac579552fec30. Subsequent calls will reuse this data.


  0%|          | 0/3 [00:00<?, ?it/s]

Downloading:   0%|          | 0.00/2.16k [00:00<?, ?B/s]

The `dataset` object itself is [`DatasetDict`](https://huggingface.co/docs/datasets/package_reference/main_classes.html#datasetdict), which contains one key for the training, validation and test set:

In [8]:
raw_datasets

DatasetDict({
    train: Dataset({
        features: ['article', 'abstract'],
        num_rows: 119924
    })
    validation: Dataset({
        features: ['article', 'abstract'],
        num_rows: 6633
    })
    test: Dataset({
        features: ['article', 'abstract'],
        num_rows: 6658
    })
})

To access an actual element, you need to select a split first, then give an index:

In [9]:
raw_datasets["train"][0]

{'abstract': "<S> background : the present study was carried out to assess the effects of community nutrition intervention based on advocacy approach on malnutrition status among school - aged children in shiraz , iran.materials and methods : this case - control nutritional intervention has been done between 2008 and 2009 on 2897 primary and secondary school boys and girls ( 7 - 13 years old ) based on advocacy approach in shiraz , iran . </S> <S> the project provided nutritious snacks in public schools over a 2-year period along with advocacy oriented actions in order to implement and promote nutritional intervention . for evaluation of effectiveness of the intervention growth monitoring indices of pre- and post - intervention were statistically compared.results:the frequency of subjects with body mass index lower than 5% decreased significantly after intervention among girls ( p = 0.02 ) . </S> <S> however , there were no significant changes among boys or total population . </S> <S> 

Since the `pubmed` data is extremely large, we are going to remove rows so that we have a training set of 8,000, a validation set of 2,000, and a test set of 2,000. 

In [10]:
raw_datasets["train"] = raw_datasets["train"].select(range(1, 8001))
raw_datasets["validation"] = raw_datasets["validation"].select(range(1, 2001))
raw_datasets["test"] = raw_datasets["test"].select(range(1, 2001))

To get a sense of what the data looks like, the following function will show some examples picked randomly in the dataset.

In [11]:
import datasets
import random
import pandas as pd
from IPython.display import display, HTML

def show_random_elements(dataset, num_examples=5):
    assert num_examples <= len(dataset), "Can't pick more elements than there are in the dataset."
    picks = []
    for _ in range(num_examples):
        pick = random.randint(0, len(dataset)-1)
        while pick in picks:
            pick = random.randint(0, len(dataset)-1)
        picks.append(pick)
    
    df = pd.DataFrame(dataset[picks])
    for column, typ in dataset.features.items():
        if isinstance(typ, datasets.ClassLabel):
            df[column] = df[column].transform(lambda i: typ.names[i])
    display(HTML(df.to_html()))

In [12]:
show_random_elements(raw_datasets["train"])

Unnamed: 0,article,abstract
0,"several species of bacteria have been linked to chronic infections of colon and have been shown to increase the risk of colon cancer . streptococcus bovis has been evaluated as one of the possible etiologic agents for colorectal cancer . \n s. bovis is a gram - positive bacterium , which is considered as a normal inhabitant of the human gastrointestinal tract but is less frequently present than other streptococcus species . whatever , s. bovis has been shown to be an increased cause of endocarditis and bacteraemia and found to be associated with gastrointestinal diseases [ 36 ] or with the colorectal cancer [ 710 ] . although a number of bacteria have been associated with cancer , their possible role in carcinogenesis is unclear . in some cases helicobacter pylori may cause stomach cancer [ 11 , 12 ] since animal models have demonstrated koch 's third and fourth postulates for the role of h. pylori in the causation of stomach cancer . moreover , it has been found also that salmonella typhi is associated with gallbladder cancer and escherichia coli with crohn 's disease as well as colon cancer [ 15 , 16 ] . mccoy and mason first reported a case of the association between streptococcal endocarditis and colon cancer in 1951 . many studies have examined the presence of s. bovis in stool samples obtained from patients with colorectal cancer to find a relationship between this bacterium and the risk of colorectal cancer . in 1977 , klein et al . found a significantly strong association of s. bovis in stool samples of patients with colon cancer compared with healthy controls and patients with nonmalignant gastrointestinal disease . in contrast , other studies did not find any significant association between faecal s. bovis and human colorectal cancer [ 2124 ] . the aims of the current study were to isolate s. bovis from stool specimens collected from patients with malignant and non - malignant gastrointestinal diseases and from a healthy group to compare the association of the bacterium in malignant and non - malignant gastrointestinal diseases and to determine the susceptibility of the isolated strains to different antimicrobial agents . stool specimens were collected in sterile containers before the initiation of therapy from inpatients with malignant and non - malignant gastrointestinal diseases in ibn - sina specialized hospital , khartoum , sudan . the stool samples were from 28 inpatients with malignant gastrointestinal diseases ( table 1 ) , 27 inpatients with non - malignant gastrointestinal diseases ( table 2 ) , and 50 controls . 20 out of the 50 controls were outpatients who came to the gastric intestinal tract clinic suffering from the gastrointestinal tract problems and the medical examination and laboratory tests showed that they did not have any type of cancer . the inpatients and the 20 controls were working in agriculture and animal husbandry and aged between 25 and 65 years . 5% of each inpatients and controls aged from 25 to 50 years and 95% aged from 51 to 65 years . 30 out of the 50 controls were apparently - healthy farmers aged 5086 years and the mean age was 64 10 years . the stool samples were collected during the medical survey for infectious and tropical diseases organized by university of medical sciences and technology in sudan from gezira agriculture scheme with a population of 40 thousand people . the stool samples were cultured on macconkey agar plates ( himedia limited , bombay ) and incubated aerobically at 37c overnight . antimicrobial susceptibility to penicillin and trimethoprim - sulfamethoxazole was carried out on the isolated strains by disk diffusion method on mueller - hinton agar plates ( himedia limited , bombay ) . sensitivity of bacteria to each antibiotic was carried out by measuring the diameter of inhibition of bacterial growth around the disc . a diameter of 26 mm was considered as the cut - off point for sensitivity for the penicillin and a diameter of 24 mm for the trimethoprim - sulfamethoxazole according to national committee for clinical laboratory standard ( nccls ) method . \n \n\t\t\t\t\t\n\t\t\t\t\t test was used for significant prevalence of s. bovis in stool specimens from patients with malignant or with non - malignant gastrointestinal diseases . the statistical analysis was also performed to compare between presence of s. bovis in stool specimens from patients with noncolonic cancer and with non - malignant gastrointestinal diseases . stool specimens were collected from fifty - five patients with malignant and non - malignant gastrointestinal diseases and from fifty persons as control group . the specimens were examined to the prevalence of streptococcus bovis strains and their susceptibility to the used antibiotics . all the isolated s. bovis strains were found to be sensitive to penicillin and trimethoprim - sulfamethoxazole with diameter of inhibition from 26 to 32 mm and from 24 to 32 mm , respectively . s. bovis was not detected in the control group but it was detected in the inpatients aged 5165 years . bacteriological analysis of stool specimens from the patients with malignant gastrointestinal diseases showed that out of 28 specimens s. bovis was isolated from 10 patients . six positive were from patients with carcinoma of the stomach , and one positive was from each patient with carcinoma of rectum , pancreas , colon , and liver . the prevalence of s. bovis in stool from the patients with carcinoma in stomach , rectum , pancreas , colon , polyps , and hepatocytes was 55% , 25% , 14% , 50% , 0% , and 50% , respectively ( table 3 ) . analysis of stool specimens from the patients with non - malignant gastrointestinal diseases showed that out of 27 specimens s. bovis was isolated from 5 patients . two positive from patients with liver cirrhosis , two positive from patients with obstructive jaundice , and one from patient with portal hypertension were found . the prevalence of s. bovis in stool from the patients with crohn 's disease , liver cirrhosis , obstructive jaundice , portal hypertension , and viral hepatitis was 0% , 28% , 33% , 14% , and 0% , respectively ( table 4 ) . \n\t\t\t\t\t\n\t\t\t\t\t from analysing the prevalence of s. bovis in stool from the patients with malignant and non - malignant gastrointestinal diseases and healthy controls was 36% , 18% , and 0% , respectively ( figure 1 ) . the statistical analysis showed that the prevalence of s. bovis in stool specimens from patients with malignant or with non - malignant gastrointestinal diseases was statistically significant ( p value of test was 0.02 ) . colorectal cancer is the fourth most common cancer among men and third most common among women worldwide . a large number of previous studies point to association of s. bovis with gastrointestinal diseases [ 5 , 6 ] and cancer of the human colon . the current study finds a correlative relationship between existence of s. bovis and malignant gastrointestinal diseases since prevalence of the bacterium in stool from the patients with carcinoma in stomach , rectum , colon , and hepatocytes was 55% , 25% , 50% , and 50% , respectively . in comparison , the prevalence of s. bovis in stool from the patients with crohn 's disease , liver cirrhosis , obstructive jaundice , portal hypertension , and viral hepatitis was 0% , 28% , 33% , 14% , and 0% , respectively . \n\t\t\t\t from analysis the prevalence of s. bovis in stool from the patients with malignant and non - malignant gastrointestinal diseases and healthy controls was 36% , 18% , and 0% , respectively . the test statistical analysis showed that the prevalence of s. bovis in stool specimens from patients with malignant or with non - malignant gastrointestinal diseases was statistically significant ( p = 0.02 ) . the current findings confirm the previous data that correlated the association of s. bovis with colorectal cancer specially klein study 1977 and the later studies [ 1820 ] . in contrast to the literature regarding an association with colorectal cancer , less is written about associations of s. bovis with other gastrointestinal diseases or with other cancers except klein study 1977 but klein 1987 reported also the lack of association of s. bovis with noncolonic gastrointestinal carcinoma . however , alazmi et al . , 2006 found that s. bovis bacteremia in adults was frequently associated with hepatic dysfunction , and zarkin et al . , postulated a triad of s. bovis bacteremia , liver disease , and colonic pathology whereby the liver disease might account for the increased fecal carriage , entry to the portal venous system , or passage from the portal to systemic circulation of s. bovis . in this context , our findings add more information about the association of s. bovis with non - malignant gastrointestinal diseases and with non - colonic cancer : since the current study isolated s. bovis from the stool of patients with liver cirrhosis , obstructive jaundice , and portal hypertension ( table 4 ) and the bacterium was isolated from stool of patients with carcinoma in stomach , rectum , and hepatocytes ( table 3 ) , the association of s. bovis with non - malignant gastrointestinal diseases or with noncolonic cancer was not significantly different by test ( p > 0.05 ) . however , klein 1977 isolated s. bovis from stool of patients with inflammatory bowel disease , other gastrointestinal disorders and noncolonic neoplasms and he found that presence of s. bovis in stool of patients with non - colonic cancer was not significantly different from that in controls . tjalsma and colleagues detected immune reactions against s. bovis antigens in sera of 11 out of 12 colon cancer patients and in 3 out of 4 patients with colon polyps . they found that one of the diagnostic antigens represents a surface - exposed heparin - binding protein that , according to their speculation , might be involved in the attachment of s. bovis to tumor cells . they claim that profiling of the humoral immune response against s. bovis infections may represent a promising diagnostic tool in early detection of human colon cancer . however , research has not yet determined if s. bovis is a causative agent of colon cancer or if preexisting cancer makes the lumen of the large intestine more hospitable to s. bovis outgrowth . previous findings ( reviewed in ) suggest an active role of s. bovis in the promotion of intestinal carcinogenesis when adult rats were treated with azoxymethane for 2 weeks and subsequently received injections with either s. bovis bacteria or wall - extracted antigens twice weekly . the authors observed progression of preneoplastic lesions , enhanced expression of proliferation markers , and increased production of interleukin-8 in the colonic mucosa in these rats . the same group used a partially purified s. bovis s300 fraction representing 12 different proteins and triggered the synthesis of proinflammatory proteins ( human interleukin-8 and prostaglandin e2 ) , correlated with the in vitro overexpression of cyclooxygenase-2 in human colon carcinoma cells and in rat colonic mucosa . these data could point to a role of oxygen radicals in colon carcinogenesis induced by a chronic infection with s. bovis . the mechanism could be similar to the one suspected for the development of gastric carcinomas after persisting h. pylori infections since a cecropin - like h. pylori peptide , hp ( 220 ) , was found to be a monocyte chemoattractant and activated the monocyte nadph oxidase to produce oxygen radicals . presently it would still be important to know whether the increased presence of s. bovis in colonic cancers and polyps results from the preferential bacterial colonization of these cancers and their precursors or whether s. bovis represents a carcinogen that is causally involved in gastrointestinal cancer . however , our study presents a significant association of streptococcus bovis with malignant gastrointestinal diseases . the significant association of s. bovis with malignant gastrointestinal diseases compared to its association with non - malignant gastrointestinal diseases presented in this study confirms the previous studies about the association between this bacterium and colorectal cancer and may support the idea that there is correlation between this bacterium and the malignant gastrointestinal diseases .","<S> \n streptococcus bovis is a gram - positive bacterium causing serious human infections , including endocarditis and bacteremia , and is usually associated with underlying disease . </S> <S> the aims of the current study were to compare prevalence of the bacterium associated with malignant and nonmalignant gastrointestinal diseases and to determine the susceptibility of the isolated strains to different antimicrobial agents . </S> <S> the result showed that the prevalence of s. bovis in stool specimens from patients with malignant or with nonmalignant gastrointestinal diseases was statistically significant . </S> <S> this result may support the idea that there is correlation between s. bovis and the malignant gastrointestinal diseases . </S>"
1,"about 30 % of people with congenital hearing loss are syndromic and the remaining 70 % are non - syndromic . in addition , most elderly people develop age - related ( late - onset ) hearing loss [ 13 ] . in general , these hearing losses have been classified as different diseases due to distinct pathogeneses [ 1 , 2 ] . sensorineural hearing losses are caused by impairments of inner ears and are difficult to cure due to the location and complex morphology of inner ears [ 1 , 2 ] . sensorineural hearing loss is a clinically heterogeneous disease leading to negative impacts on quality of life ( qol ) in all generations . inner ears have been analyzed in order to clarify the pathogeneses of sensorineural hearing losses . the organ of corti contains two kinds of sensory cells [ inner hair cells ( ihcs ) and outer hair cells ( ohcs ) ] and plays an important role in mechanotransduction , by which sound stimuli are converted into electric stimuli . auditory information from the sensory cells is transferred to spiral ganglion neurons ( sgns ) as the primary carriers and is eventually transferred to the auditory cortex in the cerebrum [ 1 , 2 ] . the sv consists of marginal cells , melanocytes ( also known as intermediate cells ) and basal cells , and has been shown to maintain high levels of potassium ion for endocochlear potential ( ep ) [ 4 , 5 ] . melanocytes in the inner ear are located specifically in the sv , and defects in melanocytes lead to impaired ep levels resulting in hearing loss . thus , disturbance of these constituent cells in inner ears has been shown to cause hearing losses . vestibular hair cells covered with otoconia play an important role in mechanotransduction , by which gravity impulses are converted into neural impulses . thus , the vestibule containing hair cells and an otolith is one of the organs responsible for balance . impairments of hearing and balance both major problems in the field of occupational and environmental health are caused by the intricate interplay of genetic , aging and environmental factors [ 13 ] . this review focuses on hearing impairments caused by neurodegeneration of sgns due to impairments of hearing - related genes ( c - ret and ednrb ) and by environmental stresses [ low frequency noise ( lfn ) and heavy metals ] . glial cell line - derived neurotrophic factor ( gdnf)one of the ligands for c - ret exerts its effect on target cells by binding to a glycosyl phosphatidylinositol ( gpi)-anchored cell surface protein ( gfr1 ) . this binding facilitates the formation of a complex with the receptor tyrosine kinase c - ret . formation of this complex activates c - ret autophosphorylation as a trigger for c - ret - mediated signaling pathways to give positive signals for cell survival [ 912 ] . previous studies have also indicated that gdnf stimulates a ret - independent signaling pathway [ 10 , 13 , 14 ] . tyrosine 1062 ( y1062 ) in c - ret plays an important role in kinase activation as one of the autophosphorylation sites , and is also a multi - docking site for several signaling molecules including shc , a transmitter for c - ret - mediated signaling pathways [ 13 , 15 , 16 ] . in both mice and humans , c - ret has been shown to be essential for the development and maintenance of the enteric nervous system ( ens ) [ 13 , 15 ] and to be the most frequent causal gene of hirschsprung disease ( hscr ; megacolon disease ) ( in 2025 % of cases ) in humans [ 17 , 18 ] . in fact , severe hscr ( e.g. , total intestinal agangliosis and impaired development of the kidney ) has been shown to develop in homozygous knock - in mice in which y1062 in c - ret was replaced with phenylalanine ( c - ret - ki - mice ) , while heterozygous c - ret y1062f knock - in mice ( c - ret - ki - mice ) are reported to have no hscr - linked phenotypes . thus , the results of previous studies indicate that hscr in mice develops recessively , while hscr in humans has been shown to develop dominantly due to ret mutations . as described above , c - ret and c - ret are crucial genes for hscr ; however , there had been no direct evidence to link c - ret and c - ret to hearing impairments in mice or humans . our recent studies have shown that complete unphosphorylated y1062 in c - ret , with no change in expression level , caused congenital hearing loss in c - ret - ki - mice , while partially unphosphorylated c - ret led to normal hearing development until 1 month of age but then accelerated age - related hearing loss in c - ret - ki - mice . thus , impairments of c - ret phosphorylation monogenetically result in early - onset syndromic hearing loss as well as late - onset non - syndromic hearing loss . our results correspond in part to the results of previous studies demonstrating that c - ret , gfr1 and gdnf are expressed in auditory neurons [ 22 , 23 ] and that gdnf has a protective effect on antibiotic - mediated ototoxicities [ 2427 ] . waardenburg - shah syndrome ( ws type iv , ws - iv ) , which is caused by mutations in the transcription factor sox10 , cytokine endothelin ( et)-3 and its receptor endothelin receptor b ( ednrb ) , is characterized by hypopigmentation , megacolon disease and hearing loss . the incidence of ws is 1 per 10,000 to 20,000 people . endothelin receptor b ( ednrb / ednrb ) belongs to the g - protein - coupled receptor family that mediates the multifaceted actions of endothelins [ 32 , 33 ] . mutations of ednrb / ednrb have been shown to cause embryonic defects in melanocytes and enteric ganglion neurons derived from the neural crest , resulting in hypopigmentation , megacolon disease and congenital hearing loss . in previous studies with animal models , both piebald - lethal rats in which ednrb is spontaneously mutated and ednrb homozygous knock - out [ ednrb(/ ) ] mice have been shown to have typical ws - iv phenotypes . thus , previous studies indicate that ednrb is a key regulatory molecule for embryonic development of melanocytes and peripheral neurons , including neurons in the ens . previous studies also demonstrated that impairments of ednrb / ednrb cause syndromic hearing loss due to congenital defects of melanocytes in the stria vascularis of the inner ear [ 30 , 3235 ] . in our previous study , ednrb protein was expressed in sgns from wild - type ( wt)-mice on postnatal day 19 ( p19 ) , while it was undetectable in sgns from wt - mice on p3 . correspondingly , ednrb homozygously deleted mice [ ednrb(/)-mice ] developed congenital hearing loss ( fig . 1 ) . thus , expression of ednrb expressed in sgns in the inner ears is required for postnatal development of hearing in mice . a therapeutic strategy for congenital hearing loss in ws - iv patients has not been established . ednrb expressed in sgns could be a novel potential therapeutic strategy for congenital hearing loss in ws - iv patients.fig . 1schematic summary of congenital deafness caused by neurodegeneration of spiral ganglion neurons ( sgns ) in c - ret - knock - in - mice and ednrb - knock - out - mice . triangles rosenthal s canals in wild - type ( wt ) ( light gray background ) , or homozygous c - ret - knock - in ( ret - ki ) and homozygous ednrb - knock - out - mice ( ednrb - ko ) ( white background ) ; gray circles / no outline immature sgns ; gray circles / thin outline sgns ; gray circles / bold outline sgns with dark gray circles / dotted outline sgns with decreased phosphorylation of y1062 in c - ret or decreased expression of ednrb . a c - ret - ki- and ednrb - ko - mice suffer from congenital deafness with neurodegeneration of sgns . bc - ret - ki - mice showed no y1062-phosphorylated sgns even on p8 , although y1062-phosphorylated sgns began to appear in wt mice from p8 . ednrb - ko - mice also showed undetectably low expression level of ednrb in sgns on p8 , although ednrb - positive sgns began to appear in wt mice from p8 schematic summary of congenital deafness caused by neurodegeneration of spiral ganglion neurons ( sgns ) in c - ret - knock - in - mice and ednrb - knock - out - mice . triangles rosenthal s canals in wild - type ( wt ) ( light gray background ) , or homozygous c - ret - knock - in ( ret - ki ) and homozygous ednrb - knock - out - mice ( ednrb - ko ) ( white background ) ; gray circles / no outline immature sgns ; gray circles / thin outline sgns ; gray circles / bold outline sgns with dark gray circles / dotted outline sgns with decreased phosphorylation of y1062 in c - ret or decreased expression of ednrb . a c - ret - ki- and ednrb - ko - mice suffer from congenital deafness with neurodegeneration of sgns . bc - ret - ki - mice showed no y1062-phosphorylated sgns even on p8 , although y1062-phosphorylated sgns began to appear in wt mice from p8 . ednrb - ko - mice also showed undetectably low expression level of ednrb in sgns on p8 , although ednrb - positive sgns began to appear in wt mice from p8 phosphorylation of y1062 in c - ret has been shown to mediate several biological responses , including development and survival of neuronal cells [ 13 , 37 ] . in our recent studies , c - ret - ki - mice developed severe congenital deafness with neurodegeneration of sgns on postnatal day ( p ) 8 - 18 , while c - ret - ki - mice showed morphology of sgns comparable to that in wt mice on p2 - 3 . phoshorylation of y1062 in c - ret of sgns from wt mice on p2 - 3 was below the limit of detection , while that on p8 - 18 was clearly detectable . thus , it is thought that sgns from c - ret - ki - mice developed normally at least until p3 after birth , when y1062 in c - ret of sgns from wt mice is unphospholylated . however , in c - ret - ki - mice , phosphorylation of y1062 is no longer maintained by p8p18 , when y1062 in c - ret of sgns from wt mice exhibits significant phosphorylation . furthermore , partially unphosphorylated y1062 in c - ret of sgns accelerated age - related hearing loss with accelerated reduction of sgns from 4 months of age , while normal hearing and normal density of sgns were observed at least until 1 month of age , when hearing has matured . on the other hand , ednrb protein was expressed in sgns from wt - mice on postnatal day 19 ( p19 ) , while it was undetectable in sgns from wt - mice on p3 . correspondingly , ednrb(/)-mice with congenital hearing loss showed a decreased number of sgns ( fig . 1 ) and thus , our results show that ednrb expression in sgns in inner ears is required for postnatal survival of sgns in mice . the neurodegeneration of sgns from c - ret - ki - mice and ednrb(/)-mice did not show typical apoptotic signals and did not involve disturbance of hair bundles of ihcs and ohcs [ 20 , 36 ] . the congenital hearing loss involving neurodegeneration of sgns as well as megacolon disease in ednrb(/)-mice were improved markedly by introducing an ednrb transgene under the control of the dopamine beta - hydroxylase promoter ( ednrb(/ ) ; dbh - ednrb - mice ) . neurodegeneration of sgns was restored by introducing constitutively activated ret also in the case of c - ret - mediated hearing loss . thus , our results indicate that c - ret and ednrb expressed in sgns could be molecular targets in the prevention of hearing impairments . \n exposure to noise is recognized as one of the major environmental factors causing hearing loss . noise consists of sound with broad frequencies , but there is limited information about the frequency - dependent influence of noise on health . low frequency noise ( lfn ) is constantly generated from natural and artificial sources . the frequency range of lfn is usually defined as being below 100 hz , while that of infrasound is usually below 20 hz . in our recent study , we found that chronic exposure to lfn at moderate levels of 70 db sound pressure level ( spl ) causes impaired balance involving morphological abnormalities of the vestibule with increased levels of oxidative stress ( fig . 2 ) . previous studies have shown that behavioral impairments induced by antibiotics involved degeneration of vestibular cells and oxidative stress [ 40 , 41 ] . in addition , a previous study has shown that antioxidant compounds prevent noise - induced hearing loss . ototoxicity caused by oxidative stress in inner ears has been shown to accompany impairment of antioxidant enzymes . thus , existing studies indicate the necessity for further investigation of a causal molecule related to oxidative stress in vestibular hair cells affected by lfn , and of the preventive effect of antioxidants on impaired balance caused by lfn exposure . on the other hand , exposure to heavy metals including mercury , cadmium and arsenic has been suggested to cause impairments in balance and hearing [ 4446 ] in humans and experimental animals . childhood exposure to heavy metals has been shown to sensitively affect hearing development in humans [ 4850 ] . aging has also been shown to affect sensitivities to ototoxic factors in mice . therefore , further studies are needed to determine the age - specific susceptibilities to environmental stresses , including heavy metals , in terms of ototoxicity in mice and humans.fig . 2schematic summary of impaired balance in mice caused by exposure to low frequency noise ( lfn ) . chronic exposure to low frequency noise ( lfn , 0.1 khz ) at moderate levels of 70 db sound pressure level ( spl ) causes impaired balance involving morphological impairments of the vestibule with enhanced levels of oxidative stress schematic summary of impaired balance in mice caused by exposure to low frequency noise ( lfn ) . chronic exposure to low frequency noise ( lfn , 0.1 khz ) at moderate levels of 70 db sound pressure level ( spl ) causes impaired balance involving morphological impairments of the vestibule with enhanced levels of oxidative stress our studies provide direct evidence that c - ret and ednrb expressed in sgns are novel targets for hearing loss . these studies underline the importance of considering the activity as well as the expression of the target molecule in order to elucidate the etiologies of hereditary deafness . in addition , environmental stresses , including exposure to noise and heavy metals , can cause impairments of hearing and balance that are affected intricately by aging and genetic factors . information obtained in previous studies prompts further investigation of the influence of environmental stresses on the impairment of hearing and balance with consideration of aging and genetic factors to develop new diagnostic , preventive and therapeutic strategies against impairment of hearing and balance .","<S> impairments of hearing and balance are major problems in the field of occupational and environmental health . </S> <S> such impairments have previously been reported to be caused by genetic and environmental factors . however , their mechanisms have not been fully clarified . on the other hand , </S> <S> the inner ear contains spiral ganglion neurons ( sgns ) in the organ of corti , which serve as the primary carriers of auditory information from sensory cells to the auditory cortex in the cerebrum . </S> <S> inner ears also contain a vestibule in the vicinity of the organ of corti one of the organs responsible for balance . </S> <S> thus , inner ears could be a good target to clarify the pathogeneses of sensorineural hearing losses and impaired balance . in our previous studies with c - ret knock - in mice and endothelin receptor b ( ednrb ) knock - out mice </S> <S> , it was found that syndromic hearing losses involved postnatal neurodegeneration of sgns caused by impairments of c - ret and ednrb , which play important roles in neuronal development and maintenance of the enteric nervous system . </S> <S> the organ of corti and the vestibule in inner ears also suffer from degeneration caused by environmental stresses including noise and heavy metals , resulting in impairments of hearing and balance . in this review , </S> <S> we introduce impairments of hearing and balance caused by genetic and environmental factors and focus on impairments of sgns and the vestibule in inner ears as the pathogeneses caused by these factors . </S>"
2,"tissue homeostasis is characterized by the balance between proliferation and cell growth on one side and cell death on the other side . in response to stressful stimuli , however , in cancer cells activation of pathways that favor cell survival instead of cell death under stressful conditions may contribute to tumorigenesis . in addition , this adaptive stress response promotes the development of acquired resistance , since current treatment approaches such as chemotherapy and irradiation trigger cellular stress pathways , and thus , initiate the activation of survival cascades and anti - apoptotic mechanisms . apoptosis or programmed cell death is the cell 's intrinsic death program that regulates various physiological as well as pathological processes and that is evolutionary highly conserved . hence , further insights into the molecular mechanisms of how cellular stress signals trigger anti - apoptotic mechanisms and how this contributes to tumor resistance to apoptotic cell death are expected to provide the basis for a rational approach for the development of new molecular targeted therapies . there are two major apoptosis signaling pathways , that is , the death receptor ( extrinsic ) pathway and the mitochondrial ( intrinsic ) pathway . under most circumstances , activation of either pathway eventually leads to proteolytic cleavage and thus activation of caspases , a family of cysteine proteases that act as common death effector molecules . accordingly , caspases are responsible for many of the biochemical and morphological hallmarks of apoptotic cell death by cleaving a range of substrates in the cytoplasm or nucleus . ligation of death receptors of the tumor necrosis factor ( tnf ) receptor superfamily such as cd95 ( apo-1/fas ) or trail receptors by their corresponding natural ligands , that is , cd95 ligand or trail , results in the recruitment of caspase-8 into a multimeric complex at the plasma membrane , the death - inducing signaling complex ( disc ) [ 6 , 7 ] . this in turn leads to caspase-8 activation , which can then directly cleave downstream effector caspases such as caspase-3 . alternatively , caspase-8 can promote outer mitochondrial membrane permeabilization by cleaving bid , a bh3-only protein that translocates to mitochondria upon cleavage and causes cytochrome c release . the mitochondrial pathway is initiated by the release of apoptogenic factors such as cytochrome c , apoptosis - inducing factor ( aif ) second mitochondria - derived activator of caspase ( smac)/direct iap binding protein with low pi ( diablo ) or omi / high temperature requirement protein a ( htra2 ) from the mitochondrial intermembrane space into the cytosol . the release of cytochrome c into the cytosol triggers activation of caspase-3 via the formation of a large cytosolic complex , which is called the apoptosome and consists of cytochrome c , apaf-1 , and caspase-9 . smac / diablo or omi / htra2 promotes caspase activation by binding to inhibitor of apoptosis ( iap ) proteins and thereby disrupts the interaction of iaps with caspase-3 or -9 [ 9 , 10 ] . therefore , cancer cells react to cellular stress signals by mounting an anti - apoptotic response , which enables cancer cells to evade apoptotic cell death and ensures cell survival . a wide range of stress signals has been identified , which may evoke a cell survival program in case of sublethal damage , while cell death is usually initiated if the damage is too severe , that is , starvation , hypoxia , dna damaging drugs , irradiation , er stress , and reactive oxygen species just to name a few . the molecular mechanisms that initiate cell death upon cellular stress stimuli have often not exactly been identified and likely depend on the individual stimulus . for example , following exposure to genotoxic substances , damage to dna or to other critical molecules is considered to be a common initial event which is then transmitted by the cellular stress response to the activation of cellular effector systems such as the apoptotic machinery . various stress - inducible molecules , for example , jnk , mapk / erk , nf-b , or ceramide have been implicated in propagating the apoptotic signal [ 1315 ] . besides caspase - dependent and caspase - independent apoptosis , additional non - apoptotic modes of cell death also exist and have gained increasing attention over the last years , including necrosis , autophagy , mitotic catastrophe , and lysosomal cell death [ 16 , 17 ] . while resistance to these cell death modalities can also contribute to evasion of cell death under stress conditions , the discussion of these alternative modes of cell death is beyond the scope of this review . a characteristic feature of human cancers is the evasion of apoptosis in response to stress stimuli , which contributes to both tumorigenesis and treatment resistance . in principle , cell death pathways can be blocked at different levels of the signaling cascade by upregulation of anti - apoptotic proteins and/or by downregulation or dysfunction of proapoptotic molecules . examples of altered apoptosis signaling pathways that contribute to stress resistance in human cancers will be discussed in the following paragraphs ( figure 1 ) . death receptors are part of the tumor necrosis factor ( tnf ) receptor gene superfamily , which comprises more than 20 proteins , for example , cd95 ( apo 1/fas ) , trail receptors , and tnf receptor 1 ( tnfri ) [ 7 , 19 ] . death receptors exert many different biological functions , including the regulation of cell death and survival , differentiation , and immune regulation [ 7 , 19 ] . members of the tnf receptor family share a characteristic cytoplasmic domain called the death domain , which is pivotal for transducing the death signal from the cell 's surface to intracellular signaling pathways [ 7 , 19 ] . signaling via death receptor can be impaired in human cancers via downregulation of receptor surface expression as part of an adaptive stress response . for example , in chemotherapy - resistant leukemia or neuroblastoma cells , downregulation of cd95 expression was identified as a mechanism of acquired drug resistance [ 20 , 21 ] . for the apoptosis - inducing trail receptors trail - r1 and trail - r2 , abnormal transport from intracellular stores such as the endoplasmatic reticulum to the cell surface rendered colon carcinoma cells resistant to trail - induced cell death . further , membrane expression of death receptors can be reduced by epigenetic changes such as cpg - island hypermethylation of gene promoters in response to stress signals [ 23 , 24 ] . abnormal expression of decoy receptors presents an alternative mechanism of resistance to trail- or cd95-induced apoptosis . to this end , the decoy receptor 3 ( dcr3 ) , which counteracts cd95-mediated apoptosis by competitively binding cd95 ligand , was shown to be overexpressed in lung carcinoma or colon carcinoma and in glioblastoma [ 25 , 26 ] and trail - r3 ; a decoy receptor for trail was reported to be expressed at high levels in gastric carcinoma . in addition , anti - apoptotic proteins with a death effector domain ( ded ) such as cellular flice - inhibitory protein ( cflip ) and phosphoprotein enriched in diabetes/ phosphoprotein enriched in astrocytes-15 kda ( ped / pea-15 ) can be aberrantly expressed upon cellular stress [ 28 , 29 ] . for example , high oxygen tension ( hyperoxia ) has been reported to lead to upregulation of cflip , which inhibited apoptosis during hyperoxia by suppressing both extrinsic and intrinsic apoptotic pathways , the latter via inhibition of bax . because of their sequence homology to caspase-8 , both cflip and ped can be recruited into the death - inducing signaling complex ( disc ) upon receptor ligation instead of procaspase-8 , thereby preventing caspase-8 activation [ 28 , 29 ] . moreover , the expression of caspase-8 or its function is impaired by genetic or epigenetic mechanisms in various cancers . for example , caspase-8 mutations were identified in some tumors , that is , in colorectal and head and neck carcinomas , although the overall frequency is low [ 31 , 32 ] . alternative splicing of intron 8 of the caspase-8 gene resulting in the generation of caspase-8l , a catalytically inactive splice variant presents another mechanism of caspase-8 inactivation [ 34 , 35 ] . epigenetic silencing secondary to hypermethylation of regulatory sequences of the caspase-8 gene occurs in various tumors , for example , neuroblastoma , malignant brain tumors , ewing tumor , retinoblastoma , rhabdomyosarcoma , or small lung cell carcinoma [ 33 , 3639 ] . furthermore , phosphorylation of caspase-8 on tyrosine 308 by , for example , src has been shown to interfere with its proapoptotic activity . the bcl-2 family of proteins consists of both anti - apoptotic proteins , for example , bcl-2 , bcl - xl , and mcl-1 , as well as proapoptotic molecules such as bax , bak , and bh3 domain only molecules . there are currently two models to explain the activation of bax and bak by bh3-only proteins . the direct activation model holds that bh3-only proteins , which act as direct activators such as bim and the cleaved form of bid ( tbid ) , bind directly to bax and bak to trigger their activation , while bh3-only proteins that act as sensitizers , for example , bad , bind to the prosurvival bcl-2 proteins . according to the indirect activation model , bh3-only proteins activate bax and bak in an indirect fashion by engaging the multiple anti - apoptotic bcl-2 proteins that inhibit bax and bak , thereby releasing their inhibition on bax and bak [ 42 , 43 ] . regardless of the exact mode of bax and bak activation , the ratio of anti - apoptotic versus proapoptotic bcl-2 proteins rather than the expression levels of one particular molecule of the bcl-2 family regulates apoptosis sensitivity . an increase in the ratio of anti- to proapoptotic bcl-2 proteins has been detected in various cancers and has been correlated to tumor cell survival and apoptosis resistance . more recently , bcl-2 has also been implicated in the regulation of the intracellular redox status . bcl-2 localizes to mitochondrial membranes as well as the endoplasmatic reticulum and the nuclear envelope , which are all sites of ros production . while bcl-2 has initially been described as an anti - oxidant because of its inhibitory effect on h2o2-induced lipid peroxidation , there is also evidence that bcl-2 may promote a prooxidant intracellular milieu . accordingly , ectopic expression of bcl-2 resulted in an elevated constitutive level of superoxide anion and intracellular ph in leukemia cells . conversely , reduction of intracellular superoxide sensitized bcl-2-overexpressing tumor cells to apoptotic stimuli independent of the mitochondria . these findings provide a link between oncogene - mediated alterations in the intracellular redox status and cell survival . besides bcl-2 , also cytochrome c has been implicated in the redox regulation of apoptosis . once cytochrome c is released from mitochondria into the cytosol , it triggers formation of the cytochrome c / apaf-1/caspase-9-containing apoptosome , which in turn lead to activation of caspase-9 and downstream effector caspases . there is recent evidence that also the redox state of cytochrome c is involved in the regulation of apoptosis . to this end , the oxidized form of cytochrome c ( fe(3 + ) ) has been reported to induce caspase activation via the apoptosome , while the reduced form of cytochrome c ( fe(2 + ) ) is unable to do so [ 4951 ] . several mechanisms have been discussed to be responsible for this redox - mediated regulation of cytochrome c activity , including different affinities of the oxidized versus the reduced form of cytochrome c for binding to apaf-1 , different abilities of these cytochrome c forms to activate apaf-1 , or , alternatively , different affinities for other factors not belonging to the apoptosome . regardless of the exact mechanisms , this regulation of the redox state of cytochrome c opens the possibility of controlling the effector phase of apoptosis at a postmitochondrial level . besides these genetic alterations in bcl-2 family proteins , impairment of mitochondrial apoptosis may also occur at the postmitochondrial level . for example , expression level or activity of apaf-1 may be reduced due to promoter hypermethylation or loss of heterozygosity at chromosome 12q22 - 23 , which in turn leads to impaired assembly of a functional apoptosome [ 5256 ] . moreover , tumor resistance to apoptosis may be caused by aberrant expression or function of inhibitor of apoptosis iap proteins are a family of endogenous caspase inhibitors with eight human members , that is , xiap , ciap1 , ciap2 , survivin , livin ( ml - iap ) , naip , bruce ( apollon ) , and ilp-2 [ 10 , 57 ] . all iap proteins have at least one baculovirus iap repeat ( bir ) domain that is required for classification as iap family protein . this domain is also the region of the protein that mediates the interaction with caspases . among the iap family proteins , xiap exhibits the strongest anti - apoptotic properties and inhibits apoptosis signaling by binding to active caspase-3 and -7 and by preventing caspase-9 activation . the expression and function of iap proteins are tightly regulated by several mechanisms , among them is translational regulation . to this end , it is particularly interesting to note that xiap and ciap1 belong to the proteins , which are translated via an internal ribosome entry site ( ires ) . this unique property enables protein translation of these iap proteins even under cellular stress conditions when protein synthesis is usually shut down , for example , because of caspase - dependent breakdown of eukaryotic translation initiation factors coupled with activation of the double - stranded rna - activated protein kinase pkr . however , the mrnas encoding xiap or ciap1 protein contain very long 5 untranslated regions ( utrs ) , which are not amenable to a ribosome - scanning translation initiation mechanism and thus , require a cap - independent translation initiation mechanism , that is , ires - mediated translation . ires - mediated translation allows for the continued translation of xiap and ciap1 even under conditions where cap - dependent translation is inhibited such as cellular stress . in addition , ires - mediated translational regulation of xiap and ciap1 expression enables a rapid response to transient cellular stress conditions in order to delay cell death and ensure survival . of note , cellular stress signals , including low - dose irradiation , anoxia , serum starvation and chemotherapeutic drugs , have been reported to stimulate the ires activity of xiap or ciap1 [ 6366 ] . this is in line with the concept that such stress signals promote cell survival under stress conditions , at least in part , via ires - mediated upregulation of anti - apoptotic proteins . evasion of apoptosis is one of the hallmarks of human cancers that promote tumor formation and progression as well as treatment resistance . cellular stress signals can contribute to evasion of apoptosis by activating anti - apoptotic and cell survival programs that ultimately block cell death . this interference with proper apoptosis signaling under stress conditions can occur at different points of the apoptosis signaling network , for example , within the death receptor or the mitochondrial pathway or at the postmitochondrial level . whether or not cellular stress eventually engages cell survival or cell death programs also depends on the type and strength of the stress stimulus as well as the cell type . a better understanding of the molecular mechanisms of this interplay between the cellular stress response and anti - apoptotic programs is expected to yield novel molecular targets for therapeutic interventions . the aim is to prevent protective responses in order to maximize the antitumor activity of anticancer treatment approaches .","<S> one of the hallmarks of human cancers is the intrinsic or acquired resistance to apoptosis . </S> <S> evasion of apoptosis can be part of a cellular stress response to ensure the cell 's survival upon exposure to stressful stimuli . </S> <S> apoptosis resistance may contribute to carcinogenesis , tumor progression , and also treatment resistance , since most current anticancer therapies including chemotherapy as well as radio- and immunotherapies primarily act by activating cell death pathways including apoptosis in cancer cells . </S> <S> hence , a better understanding of the molecular mechanisms regarding how cellular stress stimuli trigger antiapoptotic mechanisms and how this contributes to tumor resistance to apoptotic cell death is expected to provide the basis for a rational approach to overcome apoptosis resistance mechanisms in cancers . </S>"
3,"immunotherapies have shown promising results in cancer types previously hard to cure , such as melanoma,1 , 2 , 3 non - small cell lung cancer , and renal cell carcinoma . in addition to checkpoint - inhibiting antibodies , patient - derived t cells are a potent approach because they can be re - targeted against tumors ex vivo , or they can be infused to the patient without modifications , when extracted from the tumor biopsy . one advantage of using biopsy - derived polyclonal tumor - infiltrating lymphocytes ( tils ) is the presence of neoantigen - specific clones , til infusion requires high - dose preconditioning to eradicate suppressive immune cell subsets from the tumor microenvironment and postconditioning with high - dose systemic interleukin ( il)-2 , both often causing severe toxicities.6 , 8 instead of systemic administration of cytokines like il-2 , it could be more attractive to deliver them locally with gene therapy vectors , such as viruses.9 , 10 in particular , tumor - targeted replication - competent viruses ( i.e. , oncolytic viruses ) enable a thousand - fold amplification of transgene expression , restricted to tumor tissue . with regard to immunotherapy , oncolytic adenovirus constitutes a personalized cancer vaccine generated for each patient in situ , due to release of tumor - associated antigens . of note , virus - mediated danger signaling helps the immune system to recognize tumor cells , and immunostimulatory cytokines further boost this effect.3 , 13 , 14 we have shown that the most promising t cell - stimulating factors , in the context of adoptive cell therapy , are il-2 and tumor necrosis factor alpha ( tnf-).15 , 16 , 17 regarding prior knowledge about the recombinant proteins , il-2 has been widely used in treating malignant melanoma and renal cell carcinoma and it stimulates t cell proliferation and differentiation.9 , 19 , 20 like il-2 , tnf- can activate immune cells , but it also induces antitumor inflammation and the production of other cytokines and chemokines.22 , 23 moreover , it directly causes cancer cell necrosis and apoptosis . in this study , we constructed and characterized new oncolytic adenoviruses built on a backbone of ad5/3-e2f - d24 ( oad ) carrying human il-2 ( hil2 ) , tnf- , or both . two modifications render virus replication tumor specific : an e2f promoter and a 24-base pair ( bp ) deletion in the constant region 2 of e1a , make the viruses selective for cells defective in the retinoblastoma / p16 pathway including most tumor cells.24 , 25 in addition , the ad5/3 chimeric capsid featuring the ad3 knob but ad5 shaft and tail has demonstrated improved cancer cell transduction as well as antitumor efficacy . importantly , safety of this configuration in humans has also been established.13 , 27 , 28 based on the results in an immunocompetent syrian hamster model of pancreatic cancer , these viruses emerged as strong candidates for stimulating the immune system in tumors locally , with the specific application of enabling effective and safe til therapy . ad5/3-e2f - d24-htnfa - ires - hil2 ( tilt-123 ) rose as the leading candidate for human translation . oncolytic adenoviruses were constructed to feature a backbone carrying serotype 5 ( ad5 ) nucleic acid with an ad3 fiber knob . in addition , a 24-bp deletion ( d24 ) in the rb - binding region of adenoviral e1a together with the e2f promoter was established to direct the replication to rb - deficient cancer cells . the transgenes were inserted into the e3 region , to replace some superfluous adenoviral open reading frames , which links expression to virus replication ( figure 1a ) . all cytokine - armed adenovirus constructs were able to kill a panel of human cancer cell lines with similar efficacy as the virus without transgenes ( figures 1b and s1 ) . importantly , when virus was combined with hapt1-targeting tils , the cell - killing effect was significantly increased ( figure 1c ) . both human and hamster cells were able to produce biologically active human il-2 as well as tnf- in vitro when infected with armed viruses ( figures 2a , 2b , and s2 ) . importantly , local production of cytokines was observed with all three armed viruses in vivo while systemic levels remained undetectable ( figure 2c ) , highlighting the feasibility of the technology from a safety perspective . to establish an optimal virus dose , immunocompromised mice bearing orthotopic human ovarian tumors ( skov3-luc ) received three different doses of ad5/3-e2f - d24-htnfa - ires - hil2 . the best efficacy was achieved with the highest dose of 1 10 viral particles ( vps ) , which was significantly different compared with the untreated control group ( p = 0.0085 ) as well as with the lowest dose of 1 10 vps ( p = 0.0287 ) on day 18 ( figure 3a ) . when skov3-luc tumors were treated with ad5/3-e2f - d24-htnfa - ires - hil2 and control viruses , all viruses had similar antitumor efficacy ( figures 3b and 3c ) , suggesting that adenovirus replication rates in vivo were comparable despite the inclusion of transgenes . encouraged by the ex vivo results ( figure 1c ) , the ability of cytokine - armed viruses to enhance til therapy was investigated in immunocompetent syrian hamsters ( figure 4 ) . the unarmed virus and tils had only moderate antitumor effects when administered alone , but a significant improvement in efficacy was observed when they were combined ( p = 0.002 ) ( figure 4a ) . the armed viruses had tremendous efficacy even as single agents ( figures 4b4d ) , but the percentage of cured animals was higher in groups receiving tils and virus compared with the virus - only groups ( p = 0.034 ; figure 4e ) . in fact , cured hamsters treated with the combination of ad5/3-e2f - d24-htnfa - ires - hil2 and til therapy comprised 100% ( figure 4e ) . the experiment was repeated with a reduced virus dose with similar results in efficacy ( figure 4f ) . the cured animals from the second experiment stayed tumor free for the follow - up period of 3 months . to investigate immunological effects of the armed viruses , cells from tumors , spleens , and tumor - draining lymph nodes natural killer ( nk ) cell marker gm1 and t cell markers cd8 and cd4 were more frequent in tumors treated with il-2 , whereas the cytokine combination was the only treatment capable of increasing the level of major histocompatibility complex class ii ( mhc ii ) and decreasing the mac-2 expression in tumors ( figures 5a5e ) . differences in the cell composition in spleens and lymph nodes were minor ( figure s3 ) . interestingly , splenocytes exhibited greater cell proliferation ex vivo if the animals had been treated with armed viruses ( figure 5f ) . to estimate whether the viral treatment established tumor - specific immunity , cured hamsters were re - challenged with the same cancer cells as previously ( hapt1 ) . as a control , different types of cancer cells ( ddt1-mf2 ) the animals that had previously been cured with cytokine - coding viruses rejected hapt1 , whereas the animal treated with unarmed virus had a stable condition . the number of animals in these groups differs , because the curative potential of the unarmed virus was more limited . ddt1-mf2 tumor growth in cured hamsters was comparable to growth in naive animals , indicating the induction of tumor - specific antitumor immunity . treatment - related changes in tissue structures of the heart , lung , liver , and kidney were undetectable ( supplemental materials and methods ) . meanwhile , spleens collected from all treatment groups showed mild and minimal lymphocyte hyperplasia , slightly expanded white pulp , and a mildly increased number of heterophils in the marginal zone or red pulp . there were no differences in the severity of the changes between any of the treatment groups including mock and other controls , suggesting a lack of systemic effects linked to the transgenes as predicted by low serum concentrations ( figure 2c ) . immuno - oncology has made some clinical breakthroughs over the years , but , currently , a minority of patients respond and single - agent treatment modalities seldom lead to lasting remissions.1 , 2 , 3 , 4 thus , the utility of immunotherapy has been established on a proof - of - concept level , but much work remains to help the majority of patients with currently incurable cancer . checkpoint - inhibiting antibodies have received much attention due to their ability to downregulate immunosuppression , but they can not generate new immunity . by contrast , new immune reactions can be achieved with adoptive cell therapies and oncolytic immunotherapy , thus being complementary to the former . however , adoptive cell therapy of non - melanoma solid tumors has proved clinically unimpressive results , because the tumor microenvironment is able to anergize the cell graft.16 , 30 this effect can be countered with the biological phenomena resulting from adenoviral oncolysis , and it can be optimized with tnf- and il-2.16 , 31 we previously studied the effects of these cytokines with recombinant cytokines and with replication - defective vectors coding for murine cytokines.15 , 16 , 17 here , we constructed clinically applicable oncolytic adenoviruses coding for human il-2 and tnf- and used them to boost adoptive cell transfer . the viruses were capable of infecting and lysing a variety of human and syrian hamster cancer cell lines . after intratumoral virus administration , high cytokine concentrations were achieved in target tissue , while blood levels remained undetectable . in addition , signs of toxicity were absent in histopathological evaluation of all major organs . taking into consideration the potential toxicity of high systemic cytokine levels,8 , 32 the results suggest a safer approach for cytokine delivery . moreover , local delivery of il-2 could replace the need for high - dose il-2 administration often included in clinical t cell therapy protocols , although detailed studies are needed to validate the concept . to investigate the efficacy of the viruses on human cancer cells in vivo , we established an orthotopic ovarian carcinoma model in immunocompromised scid mice . despite the replication competence of the virus , the problem of virus infiltration into the tumor is well known ; thus , intratumoral administration represents a more efficient way of delivering the virus . in addition , this model develops resistance to oncolytic adenovirus , seen as a reduction in antitumor efficacy at later time points . the mechanism of resistance was previously shown to be related to the induction of interferon pathways . significant differences in the efficacy between the viruses were absent , which was expected because the cytokines have few effects in an immunocompromised model lacking a complete immune system . cytokine - armed oncolytic viruses are potential enhancers of t cells and t cell - based therapies.14 , 35 , 36 to investigate the synergy between our viruses and tils , we chose syrian hamsters as an immunocompetent animal model . syrian hamsters provide an interesting model for studying oncolytic adenoviruses , since human adenovirus is capable of replicating in hamsters , unlike in mice . conveniently , human il-2 and human tnf- are also bioactive in hamsters , which is evident from the hamster experiment where the unarmed virus had only moderate antitumor effects as compared with the human cytokine - coding viruses that share the same backbone construct . furthermore , we have developed a method to extract and expand tils from established hamster tumors , despite the limited availability of hamster - specific reagents . unfortunately , specific expansion of , for example , cd8 t cells is unfeasible and the til pool depends on the population present in the tumor at the time of collection . even in vitro , the synergistic effect of combining the viruses and tils was evident . in vivo , the combination of unarmed virus and tils led to improvement in efficacy . with regard to the cytokine - armed viruses , good efficacy was seen with single - agent treatment in two individual experiments . however , the combination of armed virus and til therapy resulted in the highest frequency of complete responses , as confirmed by histopathological analysis of the tumors . of note , all tumors in hamsters treated with ad5/3-e2f - d24-htnfa - ires - hil2 and tils were cured , suggesting that inclusion of adenovirally delivered immunostimulatory cytokines contributes to the curative efficacy of til therapy . immunocompetent mouse models have revealed that combining adenovirus - delivered tnf- and il-2 with adoptive t cell transfer decreases immunosuppressive characteristics of the tumor microenvironment and increases the number of active cytotoxic t cells in melanoma tumors.15 , 17 moreover , mouse studies have revealed that the danger signals and immunostimulation caused by adenovirus and cytokines can result in repertoire expansion by polyclonal amplification of many classes of antitumor t cells while t cell exhaustion seems to be thwarted.15 , 16 , 17 mouse tumor models are useful for immunological studies because many reagents are available . however , human adenovirus does not replicate productively in mouse tissues ; therefore , the hamster model has some advantages in this regard . there are only a few hamster - specific or cross - reactive antibodies , and only limited characterization of the immune cell subsets in the tumor is possible . nevertheless , we saw decreased presence of macrophage marker mac-2 in the hamster tumors , whereas the frequency of gm1 ( mostly nk ) cells as well as cd4 and cd8 t cells was increased following intratumoral treatment with the cytokine - coding viruses . observed from immunocompetent mice that both cd8 and cd4 cells are essential for antitumor efficacy resulting from the combination of tnf- and il-2 . il-2 induces both regulatory and helper t cells ; however , in this study , the nature of cd4 cells could not be specified . interestingly , the observed upregulation of mhc ii ( a marker for antigen - presenting cells ) might indicate that the combination of both cytokines enables efficient tumor recognition by cytotoxic cd4 t cells,39 , 41 thus contributing to the overall efficacy . in addition , the presence of tnf- stimulates the expression of il-6 , a cytokine needed for helper t cell differentiation.40 , 42 taken together , although systemic il-2 may cause stimulation of regulatory t cells , which may not necessarily be true for local il-2 , the possibility of the cd4 population also containing cytotoxic and helper t cells can not be excluded . in addition to immunological changes seen in the tumor microenvironment , further benefit from the cytokines was seen on the systemic level . because the spleen serves as an indicator for the common status of the immune system , we investigated the proliferative capability of the splenocytes derived from the treated animals ex vivo . interestingly , the splenocytes from the cytokine - treated animals showed increased proliferative capability compared with the controls , indicating increased adaptive cellular response . the same effect has been seen in a study with oncolytic vaccinia virus coding for granulocyte - macrophage colony - stimulating factor . moreover , adenovirally encoded cytokines evidently induced the formation of immunological memory , typically mediated by t cells . currently , because there are no anti - hamster antibodies available , the presence of memory - type t cells could not be verified . the animals treated with armed viruses resisted tumor recurrence unlike the animal treated with the virus without the cytokines , as also seen with other oncolytic viruses.10 , 43 , 44 in conclusion , we provide evidence that oncolytic adenoviruses coding for human il-2 and tnf- appear safe in immunocompetent hamsters . in addition to direct oncolytic effects and attractive immunological effects , these viruses seem useful for enabling successful til therapy of human solid tumors . whereas the virus itself shows potent antitumor efficacy , the cytokines are useful for induction of t cells in the tumor and for immunological memory responses . the preclinical data reported here allowed tilt biotherapeutics to initiate a human trial studying the utility of ad5/3-e2f - d24-tnfa - ires - hil2 in patients with advanced cancer receiving til therapy . all cell lines were purchased from american type culture collection ( atcc ) unless otherwise stated . human lung adenocarcinoma a549 , human pancreatic cancer panc1 , human melanoma sk - mel-28 , human ovarian carcinoma ovcar-3 , mouse fibroblast l929 , and mouse t lymphocyte cell line ctll-2 were utilized in in vitro assays . negrin , stanford medical school ) , hamster leiomyosarcoma ddt1-mf2 ( a kind gift from dr . william wold ) , and hamster pancreatic cancer hapt1 ( dsmz ) were used both in vitro and in vivo . all cell lines were maintained in rpmi 1640 or dmem supplemented with 10% fetal bovine serum ( fbs ) , 2 mm l - glutamine , 100 u / ml penicillin , and 100 g / ml streptomycin ( all from sigma - aldrich ) and cultured at + 37c and 5% co2 . the transgenes in ad5/3-e2f - d24-hil2 ( oad.il2 ) , ad5/3-e2f - d24-htnfa ( oad.tnfa ) , and ad5/3-e2f - d24-htnfa - ires - hil2 ( oad.tnfa-il2 , also known as tilt-123 ) were placed into the e3 region and they were generated with the bacterial artificial chromosome ( bac)-recombineering strategy based on the selection marker galk adapted from warming et al . , ruzsics et al . , and mck - husl et al . plasmids were propagated in electromax dh5-e competent cells ( thermo fisher scientific / invitrogen ) and the virus genomes were released from bacs with paci restriction enzyme ( thermo scientific ) . the genomes were transfected into a549 cells with lipofectamine 2000 reagent ( invitrogen ) according to the manufacturer s instructions . vp concentration was determined with optical density 260 ( od260 ) reading and infectious units by the tissue culture infectious dose ( tcid50 ) assay . the functionality of the viruses was confirmed by infecting human and hamster cancer cell lines and measuring cell viability with the mts assay by adding 10% celltiter 96 aqueous one solution ( promega ) for the cells and reading the absorbance at 490 nm after 2 hr . a549 , skov3-luc , hapt1 , and ddt1-mf2 cells were infected for 72 hr . human il-2 and tnf- were detected from cell supernatants using the bd cytometric bead array human soluble protein master buffer kit together with human il-2 and tnf- flex sets ( bd ) according to the manufacturer s instructions . the beads were detected with a bd accuri flow cytometer and the results were analyzed with fcap array software ( version 3.0.1 ; bd biosciences ) . biological activity of the cytokines was confirmed with il-2-dependent ctll-2 cells or tnf--sensitive l929 cells . ctll-2 cells were cultured with filtered supernatants or recombinant human il-2 ( r&d systems ) 10 ng / ml at + 37c and 5% co2 for 72 hr . for tnf- activity experiments , l929 cells were incubated with cell growth media supernatants or 0.5 ng / ml recombinant human tnf- ( r&d systems ) together with 2 g / ml actinomycin d for 24 hr . in both assays , was examined by injecting established hapt1 tumors with 1 10 vps 48 hr before collection . human il-2 and tnf- levels in serum and in homogenized tumors were quantified with a cytometric bead array and normalized to total protein content . subcutaneous hapt1 tumors established on syrian hamsters ( mesocricetus auratus , hsdhan : aura ; envigo ) were allowed to develop for 10 days . tumor fragments ( 13 mm ) were cultured in g - rex10 ( wilson wolf ) in the presence of 3,000 iu / ml human il-2 ( peprotech ) . half of the medium was replaced with fresh medium containing 1 g / ml concanavalin a ( con a ) ( sigma - aldrich ) after 5 days and every other day thereafter until day 10 . fresh tils were employed in animal experiments or in an ex vivo killing assay , where hapt1 cells were infected with 5,000 vps per cell for 72 hr before adding 2.5 10 tils extracted from hapt1 tumors . the experimental animal committee of the university of helsinki and the provincial government of southern finland approved the animal experiments performed in this study . immunocompromised female cb-17 scid mice ( janvier labs ) , aged 46 weeks , received 5 10 skov3-luc cells intraperitoneally . to investigate the effect of different dosing of the virus , the animals were divided to four groups ( n = 3 ) and treated with ad5/3-e2f - d24-htnfa - ires - hil2 in concentrations of 1 10 , 1 10 , or 1 10 vps in 300 l pbs intraperitoneally . animals were imaged once a week with an ivis 100 imaging system ( xenogen ) . three milligrams of d - luciferin ( synchem ) in 100 l pbs was administered intraperitoneally 8 minutes before bioluminescent imaging as previously described . to explore the differences in efficacy of the viruses in vivo , the mice were randomized into groups of five to seven mice and treated with 1 10 vps in 300 l pbs intraperitoneally or pbs alone once a week . subcutaneous hapt1 tumors ( 2 10 cells per tumor ) were established in 5- to 6-week - old immunocompetent male syrian hamsters . when the average tumor diameter reached 0.5 cm , the animals were randomized into groups of six to seven . intratumoral injection of 1 10 vps in 50 l pbs or pbs alone was performed . viral treatments were repeated on days 4 , 8 , 13 , and 19 . on day 2 , hamsters received intratumoral administration of either 4 10 hapt1-derived tils in 50 l plain rpmi or media only . twenty - five days after the treatments began , the animals were euthanized and tumors and selected organs were collected to evaluate the histopathological characteristics and immune cell subsets present . the experiment was repeated with reduced virus dosing ( 1 10 vps on days 1 and 8) . cured animals were re - challenged with the same tumors ( hapt1 ) or immunologically distinct tumors ( ddt1-mf2 ) ( 4 10 cells / tumor ) after a 2-week rest period . tumor growth was followed for 18 days until ddt1-mf2 tumors reached the maximum tolerated diameter ( 2 cm ) . cd8b , cd4 , and mhc class ii cells were detected from hamster tumors , spleens , and lymph nodes as described by siurala et al . polyclonal anti - asialo - gm1 af488 antibody ( ebioscience ) was used for detecting natural killer cells and a subset of monocytes / macrophages ( 0.5 g per reaction ) and anti - human / mouse galectin-3 ( mac-2 ) pe ( ebioscience ) for detecting macrophages and dendritic cells ( 0.2 g per reaction ) . in addition , hamster splenocytes from each treatment group were pooled and assessed for proliferation after 72 hr of cultivation . tumor samples along with tissue samples from the hamster heart , lung , liver , spleen , and kidney were collected for pathological evaluation . samples were fixed in 10% formalin for 48 hr and stored in 70% ethanol until histological processing . sections at 4-m thickness were cut from paraffin blocks and the slides were stained with hematoxylin and eosin . differences between groups were estimated with the two - tailed student s t test , non - parametric mann - whitney test , or anova with graphpad prism software ( version 6.05 ) . ibm ssps statistics software ( version 22.0.0.1 ) was utilized when analyzing log - transformed tumor volume data from hamster experiment using a linear mixed - effects model and for analyzing the curative effect of tils with the wilcoxon signed - rank test . , a.e . , and a.k . provided materials for the experiments that were performed by r.h . r.h . has been supported by the university of helsinki doctoral programme in clinical research . received grants from helsinki university central hospital ( huch ) research funds , the sigrid juselius foundation , biocentrum helsinki , biocenter finland , and finnish cancer organizations , and he is a jane and aatos erkko professor of oncology at the university of helsinki . in addition , a.h . is a shareholder in targovax asa and an employee and a shareholder in tilt biotherapeutics , ltd .","<S> adoptive cell therapy holds much promise in the treatment of cancer but results in solid tumors have been modest . </S> <S> the notable exception is tumor - infiltrating lymphocyte ( til ) therapy of melanoma , but this approach only works with high - dose preconditioning chemotherapy and systemic interleukin ( il)-2 postconditioning , both of which are associated with toxicities . to improve and broaden the applicability of adoptive cell transfer , we constructed oncolytic adenoviruses coding for human il-2 ( hil2 ) , tumor necrosis factor alpha ( tnf- ) , or both . </S> <S> the viruses showed potent antitumor efficacy against human tumors in immunocompromised severe combined immunodeficiency ( scid ) mice . in immunocompetent </S> <S> syrian hamsters , we combined the viruses with til transfer and were able to cure 100% of the animals . </S> <S> cured animals were protected against tumor re - challenge , indicating a memory response . </S> <S> arming with il-2 and tnf- increased the frequency of both cd4 + and cd8 + tils in vivo and augmented splenocyte proliferation ex vivo , suggesting that the cytokines were important for t cell persistence and proliferation . </S> <S> cytokine expression was limited to tumors and treatment - related signs of systemic toxicity were absent , suggesting safety . </S> <S> to conclude , cytokine - armed oncolytic adenoviruses enhanced adoptive cell therapy by favorable alteration of the tumor microenvironment . </S> <S> a clinical trial is in progress to study the utility of ad5/3-e2f - d24-htnfa - ires - hil2 ( tilt-123 ) in human patients with cancer . </S>"
4,"soybean meal is an ingredient of choice to supply energy and proteins to layers and broilers . because of that , costs and availability of soybean meal are strongly correlated with the price of agricultural commodities on the world market [ 1 , 2 ] . the trypsin inhibitors of soybean grain are well characterized and are an important determinant of nutritive value [ 3 , 4 ] . toasting is suggested within other heat processing procedures to reduce trypsin inhibitors in soybean grains or meals [ 6 , 7 ] . in benin , only jupiter variety of soybean is produced ; and the toasting method is adopted for processing grains and meal . on other side , laying performance of hens is lower when they are too fat . thus , soybean grains even toasted are used at lower rate in pullets and layers diets than broilers diets . farmers reject the utilization of toasted soybean grains in layers diet in benin ; but included efficiently up to 22% of toasted soybean grains in broiler diet . in tropical climate , important increase of dietary energy may be result a decrease of feed intake by poultry . toasted soybean grains have a high content of energy and protein , it is important to evaluate their optimal rate in diets of pullets and layer hens in hot and humid climate , mainly for birds from heavy lines often used in africa . the study was conducted in a poultry house ( 20 m 15 m ) . the house was divided into twelve ( 12 ) partitions of 25 m each . each partition had three feeders ( 1.5 m of length ) and two automatic drinkers . a total of 1000 harco ( rhode island red plymouth rock ) day - old chicks were imported from nigeria . they were vaccinated against newcastle disease , gumboro , infectious bronchitis , and avian pox . chicks were also treated regularly against helminthes and coccidiosis . at three - week - old ( starting of the experiment ) , the average weight of chicks was 206.5 2.69 g. chicks were divided into 12 groups of 81 chicks each . thus , at pullet and laying phases , there were 3.2 chicks / m . diets were formulated by phases . at starter ( 4 to 8 weeks - old ) , pullet ( 9 to 18 ) and laying ( 19 to 26 ) phases , respectively , four diets were formulated ( tables 1 , 2 , and 3 ) . in diets , soybean grains were included at 0% ( r0 , control ) , 5% ( r5 ) , 10% ( r10 ) , and 15% ( r15 ) . soybean grains were toasted before the processing of diets to reduce trypsin inhibitors effect . at each phase , the same quantity of feed was provided to each replicate and the birds consumed all the available feed . they are used to compare the efficiency of diets . the general linear model ( glm ) a significant effect of diets is stated when p value ( p ) is less than 0.05 . the effects of replication and of the interaction between diets and replications were not significant ( p > 0.05 ) . hence , the statistical model was \n ( 1)yi=+gi+i , \n where yi is the observation for dependent variables ; is the general mean ; gi is the fixed effect of soybean grains ; i is the residual error . the results are presented in two phases : the growth phase ( starting and pullet phases ) and the laying phase . performances during the five weeks of starting phase are shown in table 4 and figure 1 . no significant difference ( p > 0.05 ) was recorded on , daily weight gain , mortality and feed conversion ratio in spite of difference in feed composition between diets . the inclusion of toasted soybean grains at different rates in the diet did not affect significantly the growth and survivability of chicks . these results suggest a similar efficacy of diets at starter phase . also , during the pullet phase the growth performance was not significantly affected by the diet ( table 5 and figure 1 ) . however , the feed conversion ratio and the mortality rate increased at pullet compared to the starter phase . at starter and pullet phases , the prices of the formulated feeds decreased when the soybean grains rate increased in the diet ( table 6 ) . thus , at pullet phase , the feeding cost per kg of live body weight gain decreased in soybean - based diets compared to the control diet ( table 7 ) . however , at starter phase , due to the light decrease of live body weight gain in the soybean grains - based diets , the feeding cost increased ( table 6 ) . the soybean grains based diets are therefore more efficient at pullet phase than at starter phase . the feed efficiency evaluated at the end of pullet phase , demonstrated that for each unit of money invested in feed , the revenue from the selling of the live weight gain varied between 2.15 and 2.48 times ( table 7 ) . the efficiency of the diet increased significantly ( p < 0.05 ) with an increase of the toasted soybean grains rate . the laying phase was recorded until the peak of lay between 25 and 26 week - old ( figure 2 ) . while feed intake was equal between treatments , the results demonstrated an improvement of laying rate in layers fed r15 diet compared to the control diet . in the second month , the average laying rate in r15 was 67.8% versus 64.7% in control diet , without any significant difference between treatments . at the peak ( 26-week - old ) , laying rates were 73.9% , 68.3% , 67.1% , and 82.5% in r0 , r5 , r10 , and r15 diets , respectively . furthermore , the egg weight of layers fed r15 diet increased significantly ( p < 0.05 ) in the second month of laying ( table 8) . thus , during the two first months of laying the feed conversion of layers fed r15 was the lowest , and there was no significant effect of diet on mortality rate of hens ( p > 0.05 ) . thus , the feeding cost per egg decreased significantly during the first month of laying , but not later ( table 9 ) . respectively , in first and second months , the feeding cost in r15 diet represented about 32% and 86% of that in control diet , indicating a better efficiency of r15 diet compared to the control diet and the two other soybean grains - based diets . the similarity on growth performance of pullets up to 18-week - old demonstrates the efficiency of toasted soybean grains based diets in general and r15 diet in particular . the live body weight gain of pullets was higher than reported , and they grew regularly . thus , the final weights of pullets ( 1366 to 1456 g ) are in the range of 1341 to 1594 g recorded at 20-week - old in shika - brown pullets . the live weight of pullets at the starting of the laying period is one of more important criteria focused by farmers . in this study , at 18-week - old the live weight of pullets was very similar in r0 and r15 diets ( 1454 g versus 1456 g ) . during pullet phase the feed conversion ratios were lower than the 5.73 to 6.62 found between 8 and 20 weeks in hy - line pullets . furthermore , the light increase of mortality rate in soybean based diets compared to control diet was not significant . these results confirmed the efficacy of all the diets . up to 15% of toasted soybean grain up to peak , the laying rate in r15 diet was the highest , with a significant difference from 24 to 26 weeks of age . thus , efficiency of toasted soybean grains based diets was also effective during the first eight laying weeks . the average laying rate was higher than 63.868.4% , 64.0% , and 61.4% recorded respectively with harco and hy - line layers up to 28-week - old . at laying phase , the feed conversion ratio was also lower in r15 diet compared to the control diet . however , the feed conversion ratios were higher than 2.814.07 g feed / g egg . no significant diet effect was found on hens ' mortality . in the second month , the egg weights were significantly higher in r15 diet , but lower than the 60 g reported by other [ 1517 ] . in first month , the egg weights were in the range of 41.047.4 g . energy requirement of hens is lower in hot and humid climate than in temperate climate . soybean grains being very energetic , during the latest laying weeks , hens could get fat . that might reduce their laying performance . an evaluation of the laying performance during the whole laying period is therefore relevant for heavy layers breeds fed with whole toasted soybean grains in hot and humid climate . the incorporation of toasted soybean grains in diets reduces feed prices . at growth and laying phases , thus , feeding cost and feed efficiency improved significantly in soybean - based diets during the growth phase of pullets . the feeding of harco pullets with toasted soybean grains diets can be therefore recommended in hot and humid regions . the significant decrease of feeding cost from first to second laying months was due to the increase of laying rate until the peak . this study shows the efficiency of toasted soybean - based diets in harco pullets and hens feeding in hot and humid climate . the toasting processing used in benin to improved soybean grain efficacy in poultry diet is therefore suitable . however , the price of toasted soybean grains should be kept at a level where the energy and protein costs from these grains should be lower than those from other main energy and protein sources like soybean and fish meals .","<S> the aim of this paper was to evaluate the effects of toasted soybean grains on bioeconomic performance of pullets and layer hens in hot and humid environment . </S> <S> a total of 972 three - week - old harco chicks were divided into 12 groups . at starter , </S> <S> pullet and laying phases , birds were fed four diets containing 0% ( r0 ) , 5% ( r5 ) , 10% ( r10 ) , and 15% ( r15 ) of soybean grains . </S> <S> results showed similar feed intake , body weight gain , laying rate , feed conversion ratio , and mortality rate between dietary treatments at each phase . </S> <S> the egg weight increased significantly in diet r15 ( p < 0.05 ) . </S> <S> the use of soybean grains reduced the feed prices . </S> <S> feeding cost decreased significantly ( p < 0.05 ) during growth and laying phases in soybean grains added diets . feeds efficiency increased significantly ( p < 0.05 ) with the increase of dietary soybean grains rate . properly toasted soybean grains can be therefore included up to 15% in heavy line layer hens ' diet in tropical conditions . </S>"


The metric is an instance of [`datasets.Metric`](https://huggingface.co/docs/datasets/package_reference/main_classes.html#datasets.Metric):

In [13]:
metric

Metric(name: "rouge", features: {'predictions': Value(dtype='string', id='sequence'), 'references': Value(dtype='string', id='sequence')}, usage: """
Calculates average rouge scores for a list of hypotheses and references
Args:
    predictions: list of predictions to score. Each predictions
        should be a string with tokens separated by spaces.
    references: list of reference for each prediction. Each
        reference should be a string with tokens separated by spaces.
    rouge_types: A list of rouge types to calculate.
        Valid names:
        `"rouge{n}"` (e.g. `"rouge1"`, `"rouge2"`) where: {n} is the n-gram based scoring,
        `"rougeL"`: Longest common subsequence based scoring.
        `"rougeLSum"`: rougeLsum splits text using `"
"`.
        See details in https://github.com/huggingface/datasets/issues/617
    use_stemmer: Bool indicating whether Porter stemmer should be used to strip word suffixes.
    use_agregator: Return aggregates if this is set to True
Retu

You can call its `compute` method with your predictions and labels, which need to be list of decoded strings:

In [14]:
fake_preds = ["hello there", "general kenobi"]
fake_labels = ["hello there", "general kenobi"]
metric.compute(predictions=fake_preds, references=fake_labels)

{'rouge1': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0)),
 'rouge2': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0)),
 'rougeL': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0)),
 'rougeLsum': AggregateScore(low=Score(precision=1.0, recall=1.0, fmeasure=1.0), mid=Score(precision=1.0, recall=1.0, fmeasure=1.0), high=Score(precision=1.0, recall=1.0, fmeasure=1.0))}

## Preprocessing the data

Before we can feed those texts to our model, we need to preprocess them. This is done by a 🤗 `Transformers` `Tokenizer` which will (as the name indicates) tokenize the inputs (including converting the tokens to their corresponding IDs in the pretrained vocabulary) and put it in a format the model expects, as well as generate the other inputs that the model requires.

To do all of this, we instantiate our tokenizer with the `AutoTokenizer.from_pretrained` method, which will ensure:

- we get a tokenizer that corresponds to the model architecture we want to use,
- we download the vocabulary used when pretraining this specific checkpoint.

That vocabulary will be cached, so it's not downloaded again the next time we run the cell.

In [15]:
from transformers import AutoTokenizer
    
tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)

Downloading:   0%|          | 0.00/26.0 [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/1.76k [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/878k [00:00<?, ?B/s]

Downloading:   0%|          | 0.00/446k [00:00<?, ?B/s]

By default, the call above will use one of the fast tokenizers (backed by Rust) from the 🤗 `Tokenizers` library.

You can directly call this tokenizer on one sentence or a pair of sentences:

In [16]:
tokenizer("Hello, this one sentence!")

{'input_ids': [0, 31414, 6, 42, 65, 3645, 328, 2], 'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1]}

Depending on the model you selected, you will see different keys in the dictionary returned by the cell above. They don't matter much for what we're doing here (just know they are required by the model we will instantiate later), you can learn more about them in [this tutorial](https://huggingface.co/transformers/preprocessing.html) if you're interested.

Instead of one sentence, we can pass along a list of sentences:

In [17]:
tokenizer(["Hello, this one sentence!", "This is another sentence."])

{'input_ids': [[0, 31414, 6, 42, 65, 3645, 328, 2], [0, 713, 16, 277, 3645, 4, 2]], 'attention_mask': [[1, 1, 1, 1, 1, 1, 1, 1], [1, 1, 1, 1, 1, 1, 1]]}

To prepare the targets for our model, we need to tokenize them inside the `as_target_tokenizer` context manager. This will make sure the tokenizer uses the special tokens corresponding to the targets:

In [18]:
with tokenizer.as_target_tokenizer():
    print(tokenizer(["Hello, this one sentence!", "This is another sentence."]))

{'input_ids': [[0, 31414, 6, 42, 65, 3645, 328, 2], [0, 713, 16, 277, 3645, 4, 2]], 'attention_mask': [[1, 1, 1, 1, 1, 1, 1, 1], [1, 1, 1, 1, 1, 1, 1]]}


If you are using one of the five T5 checkpoints we have to prefix the inputs with "summarize:" (the model can also translate and it needs the prefix to know which task it has to perform).

In [19]:
if model_checkpoint in ["t5-small", "t5-base", "t5-larg", "t5-3b", "t5-11b"]:
    prefix = "summarize: "
else:
    prefix = ""

We can then write the function that will preprocess our samples. We just feed them to the `tokenizer` with the argument `truncation=True`. This will ensure that an input longer that what the model selected can handle will be truncated to the maximum length accepted by the model. The padding will be dealt with later on (in a data collator) so we pad examples to the longest length in the batch and not the whole dataset.

The max input length of `sshleifer/distilbart-cnn-6-6` is 1024, so `max_input_length = 1024`.

In [20]:
max_input_length = 1024
max_target_length = 256

def preprocess_function(examples):
    inputs = [prefix + doc for doc in examples["article"]]
    model_inputs = tokenizer(inputs, max_length=max_input_length, truncation=True)

    # Setup the tokenizer for targets
    with tokenizer.as_target_tokenizer():
        labels = tokenizer(examples["abstract"], max_length=max_target_length, truncation=True)

    model_inputs["labels"] = labels["input_ids"]
    return model_inputs

This function works with one or several examples. In the case of several examples, the tokenizer will return a list of lists for each key:

In [21]:
preprocess_function(raw_datasets['train'][:2])

{'input_ids': [[0, 405, 11493, 11, 55, 87, 654, 207, 9, 1484, 8, 189, 1338, 1814, 207, 11, 1402, 3505, 9, 16640, 2156, 941, 11, 1484, 11793, 17930, 8, 73, 368, 13785, 5804, 4, 134, 41, 23249, 16, 6533, 25, 41, 15650, 17215, 672, 9, 23385, 43202, 36, 1368, 428, 4839, 36, 1368, 428, 28696, 316, 821, 1589, 385, 462, 4839, 8, 189, 16072, 25, 10, 898, 9, 5, 7482, 2199, 2156, 13162, 2156, 2129, 10894, 2156, 17930, 2156, 50, 13785, 5804, 479, 6104, 3218, 3608, 14, 7967, 8, 18327, 139, 111, 2174, 797, 71, 13785, 5804, 2156, 941, 11, 471, 8, 5397, 16640, 2156, 189, 28, 13969, 30, 41, 23249, 4, 1978, 41, 23249, 747, 41089, 1290, 5298, 215, 25, 16069, 2156, 8269, 2156, 8, 25599, 642, 22423, 2156, 8, 4634, 189, 33, 10, 2430, 1683, 15, 1318, 9, 301, 36, 2231, 1168, 4839, 8, 819, 2194, 11, 1484, 19, 1668, 479, 4634, 2156, 7, 1477, 2166, 13838, 2156, 2231, 1168, 2156, 8, 17618, 32444, 11, 1484, 19, 1668, 2156, 24, 74, 28, 5701, 7, 185, 10, 16300, 1548, 11, 9397, 9883, 54, 240, 1416, 13, 1668, 111, 30

To apply this function on all the pairs of sentences in our dataset, we just use the `map` method of our `dataset` object we created earlier. This will apply the function on all the elements of all the splits in `dataset`, so our training, validation and testing data will be preprocessed in one single command.

In [22]:
tokenized_datasets = raw_datasets.map(preprocess_function, batched=True)

  0%|          | 0/8 [00:00<?, ?ba/s]

  0%|          | 0/2 [00:00<?, ?ba/s]

  0%|          | 0/2 [00:00<?, ?ba/s]

Even better, the results are automatically cached by the 🤗 `Datasets` library to avoid spending time on this step the next time you run your notebook. The 🤗 `Datasets` library is normally smart enough to detect when the function you pass to map has changed (and thus requires to not use the cache data). For instance, it will properly detect if you change the task in the first cell and rerun the notebook. 🤗 `Datasets` warns you when it uses cached files, you can pass `load_from_cache_file=False` in the call to `map` to not use the cached files and force the preprocessing to be applied again.

Note that we passed `batched=True` to encode the texts by batches together. This is to leverage the full benefit of the fast tokenizer we loaded earlier, which will use multi-threading to treat the texts in a batch concurrently.

## Fine-tuning the model

Now that our data is ready, we can download the pretrained model and fine-tune it. Since our task is of the sequence-to-sequence kind, we use the `AutoModelForSeq2SeqLM` class. Like with the tokenizer, the `from_pretrained` method will download and cache the model for us.

In [23]:
from transformers import AutoModelForSeq2SeqLM, DataCollatorForSeq2Seq, Seq2SeqTrainingArguments, Seq2SeqTrainer

model = AutoModelForSeq2SeqLM.from_pretrained(model_checkpoint)

Downloading:   0%|          | 0.00/439M [00:00<?, ?B/s]

Note that  we don't get a warning like in our classification example. This means we used all the weights of the pretrained model and there is no randomly initialized head in this case.

To instantiate a `Seq2SeqTrainer`, we will need to define three more things. The most important is the [`Seq2SeqTrainingArguments`](https://huggingface.co/transformers/main_classes/trainer.html#transformers.Seq2SeqTrainingArguments), which is a class that contains all the attributes to customize the training. It requires one folder name, which will be used to save the checkpoints of the model, and all other arguments are optional:

In [24]:
batch_size = 2
model_name = model_checkpoint.split("/")[-1]
args = Seq2SeqTrainingArguments(
    f"{model_name}-finetuned-pubmed",
    evaluation_strategy = "epoch",
    learning_rate=2e-5,
    per_device_train_batch_size=batch_size,
    per_device_eval_batch_size=batch_size,
    weight_decay=0.01,
    save_total_limit=3,
    num_train_epochs=5,
    predict_with_generate=True,
    fp16=True,
    push_to_hub=True,
    seed = 42,
)

Here we set the evaluation to be done at the end of each epoch, tweak the learning rate, use the `batch_size` defined at the top of the cell and customize the weight decay. Since the `Seq2SeqTrainer` will save the model regularly and our dataset is quite large, we tell it to make three saves maximum. Lastly, we use the `predict_with_generate` option (to properly generate summaries) and activate mixed precision training (to go a bit faster).

The last argument to setup everything so we can push the model to the [Hub](https://huggingface.co/models) regularly during training. Remove it if you didn't follow the installation steps at the top of the notebook. If you want to save your model locally in a name that is different than the name of the repository it will be pushed, or if you want to push your model under an organization and not your name space, use the `hub_model_id` argument to set the repo name (it needs to be the full name, including your namespace: for instance `"sgugger/t5-finetuned-xsum"` or `"huggingface/t5-finetuned-xsum"`).

Then, we need a special kind of data collator, which will not only pad the inputs to the maximum length in the batch, but also the labels:

In [25]:
data_collator = DataCollatorForSeq2Seq(tokenizer, model=model)

The last thing to define for our `Seq2SeqTrainer` is how to compute the metrics from the predictions. We need to define a function for this, which will just use the `metric` we loaded earlier, and we have to do a bit of pre-processing to decode the predictions into texts:

In [26]:
import nltk
import numpy as np

def compute_metrics(eval_pred):
    predictions, labels = eval_pred
    decoded_preds = tokenizer.batch_decode(predictions, skip_special_tokens=True)
    # Replace -100 in the labels as we can't decode them.
    labels = np.where(labels != -100, labels, tokenizer.pad_token_id)
    decoded_labels = tokenizer.batch_decode(labels, skip_special_tokens=True)
    
    # Rouge expects a newline after each sentence
    decoded_preds = ["\n".join(nltk.sent_tokenize(pred.strip())) for pred in decoded_preds]
    decoded_labels = ["\n".join(nltk.sent_tokenize(label.strip())) for label in decoded_labels]
    
    result = metric.compute(predictions=decoded_preds, references=decoded_labels, use_stemmer=True)
    # Extract a few results
    result = {key: value.mid.fmeasure * 100 for key, value in result.items()}
    
    # Add mean generated length
    prediction_lens = [np.count_nonzero(pred != tokenizer.pad_token_id) for pred in predictions]
    result["gen_len"] = np.mean(prediction_lens)
    
    return {k: round(v, 4) for k, v in result.items()}

Then we just need to pass all of this along with our datasets to the `Seq2SeqTrainer`:

In [27]:
trainer = Seq2SeqTrainer(
    model,
    args,
    train_dataset=tokenized_datasets["train"],
    eval_dataset=tokenized_datasets["validation"],
    data_collator=data_collator,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics
)

Cloning https://huggingface.co/Kevincp560/distilbart-cnn-6-6-finetuned-pubmed into local empty directory.
Using amp half precision backend


We can now finetune our model by just calling the `train` method:

In [28]:
trainer.train()

The following columns in the training set  don't have a corresponding argument in `BartForConditionalGeneration.forward` and have been ignored: abstract, article. If abstract, article are not expected by `BartForConditionalGeneration.forward`,  you can safely ignore this message.
***** Running training *****
  Num examples = 8000
  Num Epochs = 5
  Instantaneous batch size per device = 2
  Total train batch size (w. parallel, distributed & accumulation) = 2
  Gradient Accumulation steps = 1
  Total optimization steps = 20000


Epoch,Training Loss,Validation Loss,Rouge1,Rouge2,Rougel,Rougelsum,Gen Len
1,2.2215,2.078066,37.2476,14.2852,22.6875,33.1607,141.97
2,2.0105,2.021676,37.8038,14.7869,23.2025,33.7069,141.918
3,1.8331,2.02426,39.0497,15.8077,24.2237,34.9371,141.822
4,1.6936,2.048705,38.7059,15.4364,23.8514,34.7771,141.878
5,1.5817,2.064848,39.2769,15.876,24.2306,35.267,141.8565


Saving model checkpoint to distilbart-cnn-6-6-finetuned-pubmed/checkpoint-500
Configuration saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-500/config.json
Model weights saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-500/pytorch_model.bin
tokenizer config file saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-500/tokenizer_config.json
Special tokens file saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-500/special_tokens_map.json
tokenizer config file saved in distilbart-cnn-6-6-finetuned-pubmed/tokenizer_config.json
Special tokens file saved in distilbart-cnn-6-6-finetuned-pubmed/special_tokens_map.json
Saving model checkpoint to distilbart-cnn-6-6-finetuned-pubmed/checkpoint-1000
Configuration saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-1000/config.json
Model weights saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-1000/pytorch_model.bin
tokenizer config file saved in distilbart-cnn-6-6-finetuned-pubmed/checkpoint-1000/tokenizer_config

TrainOutput(global_step=20000, training_loss=1.8993048187255859, metrics={'train_runtime': 17182.6976, 'train_samples_per_second': 2.328, 'train_steps_per_second': 1.164, 'total_flos': 4.32583651196928e+16, 'train_loss': 1.8993048187255859, 'epoch': 5.0})

You can now upload the result of the training to the Hub, just execute this instruction:

In [29]:
trainer.push_to_hub()

Saving model checkpoint to distilbart-cnn-6-6-finetuned-pubmed
Configuration saved in distilbart-cnn-6-6-finetuned-pubmed/config.json
Model weights saved in distilbart-cnn-6-6-finetuned-pubmed/pytorch_model.bin
tokenizer config file saved in distilbart-cnn-6-6-finetuned-pubmed/tokenizer_config.json
Special tokens file saved in distilbart-cnn-6-6-finetuned-pubmed/special_tokens_map.json
Several commits (2) will be pushed upstream.
The progress bars may be unreliable.


Upload file pytorch_model.bin:   0%|          | 3.37k/877M [00:00<?, ?B/s]

Upload file runs/Mar04_12-48-38_a11fcb1b1ace/events.out.tfevents.1646398165.a11fcb1b1ace.77.0:  25%|##4       …

To https://huggingface.co/Kevincp560/distilbart-cnn-6-6-finetuned-pubmed
   7599b30..1d44d32  main -> main

To https://huggingface.co/Kevincp560/distilbart-cnn-6-6-finetuned-pubmed
   1d44d32..e0ae1c7  main -> main



'https://huggingface.co/Kevincp560/distilbart-cnn-6-6-finetuned-pubmed/commit/1d44d3207702fe480ff1ac485db7373a44f23de3'

You can now share this model with all your friends, family, favorite pets: they can all load it with the identifier `"your-username/the-name-you-picked"` so for instance:

```python
from transformers import AutoModelForSeq2SeqLM

model = AutoModelForSeq2SeqLM.from_pretrained("sgugger/my-awesome-model")
```