xls2csv #1480

ashishpriyadarshiCIC · 2020-06-15T20:19:13Z

xls to CSV conversion function

apoorvaeternity · 2020-06-24T14:18:56Z

retriever/lib/engine_tools.py

+    dire = book.rstrip(".xlsx")
+    if not os.path.exists(dire):
+        os.makedirs(dire)
+    else:


Is this else required?

I was using it just to remove path already exists error, I'll remove it by adding exist_ok=True

apoorvaeternity · 2020-06-24T14:21:10Z

retriever/lib/engine_tools.py

+        pass
+    os.chdir(dire)
+    res = len(workbook.sheet_names())
+    for sheet in range(0, res):


I think we can omit the 0 and just use range(res).

apoorvaeternity · 2020-06-24T14:21:54Z

retriever/lib/engine_tools.py

+        for index in range(worksheet.nrows):
+            df.loc[len(df)] = worksheet.row_values(1)
+        table_name = workbook.sheet_names()
+        df.to_csv(table_name[sheet] + '.csv', index_label='index')


We can make use of python f-strings here.

apoorvaeternity · 2020-06-24T14:25:00Z

retriever/lib/engine_tools.py

+            df.loc[len(df)] = worksheet.row_values(1)
+        table_name = workbook.sheet_names()
+        df.to_csv(table_name[sheet] + '.csv', index_label='index')
+    os.chdir("..")


Is there a cleaner way to do this? I am not sure if this will always be compatible across different OS - it maybe. If we can pass the absolute path in os.chdir() somehow that would be great.

apoorvaeternity · 2020-07-17T08:57:39Z

@ashishpriyadarshiCIC Any updates on this?

apoorvaeternity · 2020-08-24T14:24:47Z

retriever/lib/engine_tools.py

@@ -162,6 +164,21 @@ def xml2csv(input_file, outputfile=None, header_values=None, row_tag="row"):
    return outputfile


+def xlsxcsv(book):


The name should be xslx2csv to be consistent with other functions. Also, worth adding some comments to the code here.

ashishpriyadarshiCIC · 2020-08-24T16:46:22Z

Hi @apoorvaeternity, I just shifted all the work on Xlsx to CSV conversion function to #1506

henrykironde · 2020-08-26T05:08:17Z

Ref: #1506

conversion code with fixed style

0bd0424

ashishpriyadarshiCIC force-pushed the xls2csv branch from e6c54e6 to 0bd0424 Compare June 15, 2020 21:54

henrykironde requested a review from apoorvaeternity June 22, 2020 12:46

apoorvaeternity reviewed Jun 24, 2020

View reviewed changes

ashishpriyadarshiCIC added 2 commits July 24, 2020 23:10

suggestion applied

5543f77

suggestions applied

cd6427a

apoorvaeternity reviewed Aug 24, 2020

View reviewed changes

henrykironde closed this Aug 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xls2csv #1480

xls2csv #1480

ashishpriyadarshiCIC commented Jun 15, 2020

apoorvaeternity Jun 24, 2020

ashishpriyadarshiCIC Jun 26, 2020

apoorvaeternity Jun 24, 2020

ashishpriyadarshiCIC Jun 26, 2020

apoorvaeternity Jun 24, 2020

apoorvaeternity Jun 24, 2020

apoorvaeternity commented Jul 17, 2020

apoorvaeternity Aug 24, 2020

ashishpriyadarshiCIC commented Aug 24, 2020

henrykironde commented Aug 26, 2020

		@@ -162,6 +164,21 @@ def xml2csv(input_file, outputfile=None, header_values=None, row_tag="row"):
		return outputfile


		def xlsxcsv(book):

xls2csv #1480

xls2csv #1480

Conversation

ashishpriyadarshiCIC commented Jun 15, 2020

apoorvaeternity Jun 24, 2020

Choose a reason for hiding this comment

ashishpriyadarshiCIC Jun 26, 2020

Choose a reason for hiding this comment

apoorvaeternity Jun 24, 2020

Choose a reason for hiding this comment

ashishpriyadarshiCIC Jun 26, 2020

Choose a reason for hiding this comment

apoorvaeternity Jun 24, 2020

Choose a reason for hiding this comment

apoorvaeternity Jun 24, 2020

Choose a reason for hiding this comment

apoorvaeternity commented Jul 17, 2020

apoorvaeternity Aug 24, 2020

Choose a reason for hiding this comment

ashishpriyadarshiCIC commented Aug 24, 2020

henrykironde commented Aug 26, 2020