Skip to content

Pre-processing Text Data In Multiple Language(tested in Nepali).

Notifications You must be signed in to change notification settings

acharyabi/Text-Data-Cleaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

This Code Can Be Used For Filtering Text Data Using Regex and Basic Data Frame Techniques, In Our Project Nepali Paraphrasing Using Trasnfomer We Have Used It For Following Things In Our Custom Dataset:
-Metrics Based Filtering
-Removal Of English Text And Numeric[Rows Removal] #Can Use Regex For Removal From Sentences Without Removing The Row
-Removal Of Hyperlinks and Unwanted Characters

Prerequisites:
#Basic Knowledge of Python, Data Frames and Regex.

Hope This Helped In Your Project.

Regards,
Abinash.

Contact: acharyabinash@gmail.com

About

Pre-processing Text Data In Multiple Language(tested in Nepali).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published