Simplify deltacode output #16

steven-esser · 2017-11-07T16:31:18Z

For the csv output, it would be a better presentation if we simply included a single path value, instead of empty or repeated paths that are redundant.

steven-esser · 2017-11-16T23:21:08Z

@johnmhoran Plan of attack:

Review + merge PR 19 fix to dict #20
Modify Delta object to_dict function to return only minimal info (category and path for the most part)
Modify CSV ouput function to handle new data structure
Modify JSON output to Add header information that has been removed from DeltaCode.to_dict()

steven-esser · 2017-11-16T23:22:33Z

Feel free to push your changes in a branch without opening a PR. This is easier and less noisy than constantly updating a PR.

If you have questions or would like me to expand in detail on any of the above items, let me know.

johnmhoran · 2017-11-18T00:40:41Z

@MaJuRG I've modified Delta.to-dict() and the CSV output to handle the new data structure, and fixed the 9 failing tests as well. About to add missing headers to the JSON output. Here's an excerpt from the current JSON output file testing a set of test scans with 1 added file:

{
    "added": [
        {
            "category": "added", 
            "path": "a/a5.py"
        }
    ], 
    "removed": [], 
    "modified": [], 
    "unmodified": [
        { . . .

How do we want to handle the redundant category information? Replace the 4 category keys with a single key named deltas and a value containing a list of category/path key/value pairs? Example:

{
    "deltas": [
        {
            "category": "added", 
            "path": "a/a5.py"
        },
        {
            "category": "unmodified", 
            "path": "a/a1.py"
        } . . .

Also, I assume we want the missing header info added to the top of the JSON output file. If so, I think that means adding values to the top of the incoming OrderedDict created by DeltaCode.to_dict(). Do I have that right??

* Modify Delta object to_dict function to return only minimal info (category and path). * Modify CSV ouput function to handle new data structure. * Refactor 9 failing tests and related test files. Signed-off-by: John M. Horan <johnmhoran@gmail.com>

johnmhoran · 2017-11-18T00:58:56Z

@MaJuRG I just committed and pushed my work to date so you can vet including in connection with my recent questions.

steven-esser · 2017-11-18T01:04:36Z

We can keep the redundant categories around for now. If you recall our conversation yesterday, the delta object's category field will no longer match the Deltas dict category keys once we add license/copyright information.

johnmhoran · 2017-11-18T01:11:39Z

I do recall, e.g., license_change. Thanks @MaJuRG . When you have a chance, can you sketch out an example excerpt of how you see the future JSON structure?

Re the 2nd question, do we want to add the missing header info to the top of the incoming OrderedDict. If yes, how does one do that? I've seen 2 approaches: rewrite the OrderedDict -- said to be slow but relatively straightforward -- or write a function to prepend.

Finally, once I finish this, what shall I tackle next?

johnmhoran · 2017-11-18T01:39:43Z

@MaJuRG I can add the version header from inside the JSON function by moving the variable from __init__.py to cli.py, but adding the stats from DeltaCode.get_stats() has eluded me so far. Do we need to restore that to the DeltaCode.to_dict() method in order to add it to the JSON output file?

johnmhoran · 2017-11-18T02:02:08Z

@MaJuRG I'm able to add the stats by passing new and old to generate_json and calling the stats like this: data['deltacode_stats'] = DeltaCode(new, old).get_stats(). The 2 headers (version and stats) have been added to the JSON file, though they appear at the bottom rather than the top (per my question above re adding to the top).

steven-esser · 2017-11-18T02:10:12Z

a = OrderedDict([
    ('header', header_info),
    ('deltacode_stats' deltacode.get_stats()),
    ('deltas', deltacode.to_dict())
])

pass the deltacode object into generate_json instead of the data dict and just call to_dict inside the csv function at the right place.

steven-esser · 2017-11-18T02:12:39Z

Shouldnt need to move version to cli.py. Just import deltacode at the top and reference the version like: deltacode.__version__

johnmhoran · 2017-11-18T03:16:33Z

Very nice. Thanks, @MaJuRG .

My only open issue: the list comprehension you suggested -- still digging into how to include the 3 variable assignments and the tuple_list.append() operation currently inside the 2nd of the nested for loops.

johnmhoran · 2017-11-18T17:53:17Z

@MaJuRG I believe I've answered my last question. Spent time yesterday trying to figure out how to address the double-nested multiple variable assignments and the append operation inside a list comprehension. Nothing seemed to work, no hints from my research.

Took a fresh look this morning and realized that I've seen this before in simpler form: initializing variables earlier than necessary. When all is said and done, it's just an append operation.

And this:

    tuple = ()
    tuple_list = []
    deltas = data

    for delta in deltas:
        category = delta
        for f in deltas[delta]:
            category = f['category']
            path = f['path']
            tuple = (category, path)
            tuple_list.append(tuple)

. . . can be replaced with this:

    deltas = data
    tuple_list = [(f['category'], f['path']) for delta in deltas for f in deltas[delta]]

All 79 tests pass. I want to do a little command-line testing just to be sure, and if all looks good, will clean up the code, commit, push and open a PR.

Signed-off-by: John M. Horan <johnmhoran@gmail.com>

* Remove call to Delta.to_dict(). Signed-off-by: John M. Horan <johnmhoran@gmail.com>

steven-esser · 2017-11-28T04:42:06Z

#21 merged, closing this.

Add basic docs

johnmhoran self-assigned this Nov 16, 2017

steven-esser mentioned this issue Nov 16, 2017

Fix to_dict() method #19

Closed

steven-esser added enhancement refactoring labels Nov 16, 2017

johnmhoran added a commit that referenced this issue Nov 20, 2017

Refactor JSON- and CSV-output functions #16

054888e

Signed-off-by: John M. Horan <johnmhoran@gmail.com>

johnmhoran added a commit that referenced this issue Nov 21, 2017

Refactor generate_csv() #16

5a42628

* Remove call to Delta.to_dict(). Signed-off-by: John M. Horan <johnmhoran@gmail.com>

steven-esser closed this as completed Nov 28, 2017

steven-esser added a commit that referenced this issue Feb 17, 2021

Merge pull request #16 from AyanSinhaMahapatra/add-basic-docs

b6ef568

Add basic docs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify deltacode output #16

Simplify deltacode output #16

steven-esser commented Nov 7, 2017

steven-esser commented Nov 16, 2017

steven-esser commented Nov 16, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

steven-esser commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

steven-esser commented Nov 18, 2017 •

edited

Loading

steven-esser commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

steven-esser commented Nov 28, 2017

Simplify deltacode output #16

Simplify deltacode output #16

Comments

steven-esser commented Nov 7, 2017

steven-esser commented Nov 16, 2017

steven-esser commented Nov 16, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

steven-esser commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

steven-esser commented Nov 18, 2017 • edited Loading

steven-esser commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

johnmhoran commented Nov 18, 2017

steven-esser commented Nov 28, 2017

steven-esser commented Nov 18, 2017 •

edited

Loading