A package that helps you clean and recover columns of Excel data converted from PDF files.
pip install excelcleaner
- Version 0.0.4
- A tksheet window will be opened with the contents of excel file.
- Here are recommending orders of actions to try:
- Remove unwanted rows (F7)
- Remove unwanted columns (F7) or combine columns (F5)
- Click a cell to insert (F1) or delete the cell (F3)
- If the same pattern or kind of selected rows/columns is found at other locations, the same action is repeated for the found rows/columns.
- Thus, it would better try one action at the top
- To remove rows, click the row names.
- To remove or combine columns, click the column names.
- After every action, the contents will be saved to a file that is the original name with a string "_cleaned".
- When a cell is inserted, all the cells at right will be shifted to the right and an extra column will be created with empty for all other rows that are not changed.
- When a cell is deleted, all the cells at right will be shifted to the left and the cell at the last column will be empty.
- If one action is done by mistake, press function-key F12 to restore the the data before he last action
- After each action is completed,
- the number of columns and rows and the number of patterns of rows will be displayed
- either a few columns or muliple rows are highlighted, which are indicative of ok to be deleted.
import excelcleaner as xl
xlpath='(excel file name with extension)'
xl.sheet(xlpath)