Email Datasets can be found here
-
Updated
Jan 21, 2020 - Python
Email Datasets can be found here
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
The fraud identification models were build using Python Scikit-learn machine-learning module.
A project on Extract-Transform-Load (ETL) operations performed on the emails from the infamous enron corpus database.
Enron Email Analysis
📧 A data engineering exercise
Identifying and cleaning the outliers of the Enron Dataset.
LT2212 V20 Assignment 3: Same-author-classification via feed-forward neural networks: Transformed email text (Enron) into a machine readable representation and built a classifier that determines whether two texts are authored by the same person or not.
Machine learning algorithms are used to determine some possible people involved in Enron fraud---Udacity project
Predict whether an individual is a person of interest based on their enron email.
Add a description, image, and links to the enron-emails topic page so that developers can more easily learn about it.
To associate your repository with the enron-emails topic, visit your repo's landing page and select "manage topics."