Introduction

This is a competition using doctor-patient dialogue finding privacy sensitive span.

Model Structure

Input doctor-patient dialogue (split by speaker) through language model.
Each word embedding go through 2 different feed-dorward network get BIO type and privacy type.
- BIO type use for checking what span need to be output.
- Privacy type use for checking what type of privacy sensitive to be output.

Thank @eric88525 doing data preprocessing (preprocess_to_json.py); @HongYun0901 pretraining the lenguage model.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
README.md		README.md
data_loader.py		data_loader.py
eval.py		eval.py
get_proportion.py		get_proportion.py
model_budling.py		model_budling.py
preprocess.py		preprocess.py
preprocess_to_json.py		preprocess_to_json.py
test.py		test.py
train.py		train.py