Skip to content

generateNscore/excelcleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 

Repository files navigation

excelcleaner

What is it?

A package that helps you clean and recover columns of Excel data converted from PDF files.

Where to get it

pip install excelcleaner

Dependencies

Changes

  • Version 0.0.4
  • import excelcleaner as xl
    
    xlpath='(excel file name with extension)'
    xl.sheet(xlpath)
    • A tksheet window will be opened with the contents of excel file.
    • Here are recommending orders of actions to try:
      1. Remove unwanted rows (F7)
      2. Remove unwanted columns (F7) or combine columns (F5)
      3. Click a cell to insert (F1) or delete the cell (F3)

    • If the same pattern or kind of selected rows/columns is found at other locations, the same action is repeated for the found rows/columns.
      • Thus, it would better try one action at the top
      • To remove rows, click the row names.
      • To remove or combine columns, click the column names.

    • After every action, the contents will be saved to a file that is the original name with a string "_cleaned".
    • When a cell is inserted, all the cells at right will be shifted to the right and an extra column will be created with empty for all other rows that are not changed.
    • When a cell is deleted, all the cells at right will be shifted to the left and the cell at the last column will be empty.
    • If one action is done by mistake, press function-key F12 to restore the the data before he last action

    • After each action is completed,
      • the number of columns and rows and the number of patterns of rows will be displayed
      • either a few columns or muliple rows are highlighted, which are indicative of ok to be deleted.

About

A package that helps you recover columns of Excel data converted from PDF files.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published