Fixes for records() speedup #67

karimbahgat · 2016-08-24T20:51:22Z

This fixes the issues and doctest-failures introduced in the records() speedup #62, see issue #66.

Problem was I thought all row values were unpacked with the correct format, but instead they are unpacked as strings and have to be manually parsed into their appropriate types. Also forgot to account for the DeleteFlag field when grouping the flat rowlist into rows.

In order to fix these issues, this PR moves the row value type parsing into a separate method and calling it where necessary.

Sync before fixing error

Previous records() speedup forgot that row values had to be parsed from string into their appropriate field types. So moved the record parsing into a separate method and call it where needed. Also forgot that deleteflag is a field when grouping the flat rowlist into rows. Changed to correct length. All doctests pass again after these fixes.

karimbahgat · 2016-08-24T21:24:28Z

Actually, it appears the 20x speedup was almost entirely from not having to parse the value types. After this fix, the batch "records()" method seems to not have any noticable speedup from the original. So not sure the changes to the internal API are worth it.

Closing this. I suggest just changing "records()" back to the pre-speedup merge.

karimbahgat · 2016-08-24T21:28:41Z

I.e, this should fix it back to normal:

    def records(self):
        """Returns all records in a dbf file."""
        if not self.numRecords:
            self.__dbfHeader()
        records = []
        f = self.__getFileObj(self.dbf)
        f.seek(self.__dbfHeaderLength())
        for i in range(self.numRecords):
            r = self.__record()
            if r:
                records.append(r)
        return records

micahcochran · 2016-08-26T18:29:12Z

@GeospatialPython
How do you want to handle this? Does someone need to put a PR together to restore the records() function?

karimbahgat · 2016-08-27T12:51:00Z

I put together PR #68 that reverts records() to the original, which should do it. All tests pass.

karimbahgat and others added 4 commits August 24, 2016 22:16

Merge pull request #1 from GeospatialPython/master

af24202

Sync before fixing error

Fixed wrong doc for the __recordFmt() method

0e88185

Fixed wrong doc for the __recordFmt() method

2c8fd19

This was referenced Aug 24, 2016

Record Alignment Issue #66

Closed

Pip install of latest release is broken #63

Closed

karimbahgat closed this Aug 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for records() speedup #67

Fixes for records() speedup #67

karimbahgat commented Aug 24, 2016

karimbahgat commented Aug 24, 2016

karimbahgat commented Aug 24, 2016

micahcochran commented Aug 26, 2016

karimbahgat commented Aug 27, 2016

Fixes for records() speedup #67

Fixes for records() speedup #67

Conversation

karimbahgat commented Aug 24, 2016

karimbahgat commented Aug 24, 2016

karimbahgat commented Aug 24, 2016

micahcochran commented Aug 26, 2016

karimbahgat commented Aug 27, 2016