Skip to content

Topic modelling and Text summarization project for the Text mining and search exam, Data Science Msc @ University of Milan Bicocca

License

Notifications You must be signed in to change notification settings

grusso98/Text-Mining

Repository files navigation

alt text

Text-Mining

Topic modelling and Text summarization project for the Text mining and search exam, Data Science Msc @ University of Milan Bicocca

NIPS Papers: Topic Modelling and Text Summarization

Authors:

Gaetano Chiriaco

Riccardo Porcedda

Gianmarco Russo



MSc Data Science

How to navigate the repo

WARNING:The original work was conducted in a different folder. Before executing the notebooks, please, modify all paths in the import.

  • Papers_Preproc.ipynb: contains all papers preprocessing and feature engineering described in the report

  • LDA.ipynb: Topic modeling with Latent Dirichlet Allocation

  • PLSA.ipynb: Topic modeling with probabilistic Latent Semantic Analysis

  • DocSignal.ipynb: Our 1D U-net Text Summarizer as described in the report

  • Sumy.ipynb: Text Summarization with Sumy TextRank for comparison

About

Topic modelling and Text summarization project for the Text mining and search exam, Data Science Msc @ University of Milan Bicocca

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published