Skip to content

sai4july/CusText

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 

Repository files navigation

Introduction

This repo contains source code for "A Customized Text Sanitization Mechanism with Differential Privacy" (accepted to the Findings of ACL 2023 )

How to get MedSTS dataset?

The MedSTS dataset is from the paper: MedSTS: A Resource for Clinical Semantic Textual Similarity https://arxiv.org/pdf/1808.09397.pdf
This is a medical dataset and cannot be directly downloaded. You can contact the first author of the paper for the access to the dataset.

How to run CusText?

python main.py
--dataset sst2
--eps 1.0
--top_k 20
--embedding_type ct_vectors

How to run CusText+?

python main.py
--dataset sst2
--eps 1.0
--top_k 20
--embedding_type ct_vectors
--save_stop_words True

About

Code for CusText.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages