analysis tests #62

kosovan · 2024-05-17T16:24:48Z

Provides unit tests for:

lib.analysis.get_params_from_file_name()
lib.analysis.block_analyze()
lib.analysis.get_dt()
lib.analysis.split_dataframe_in_equal_blocks()
lib.analysis.split_dataframe()
lib.analysis.add_data_to_df()
lib.analysis.analyze_time_series()

Deprecates:

lib.analysis.do_binning_analysis()
lib.analysis.merge_time_series_dfs()
lib.analysis.get_time_series_from_average_df()
lib.analysis.read_csv_file()
lib.analysis.get_distribution_from_df()
lib.analysis.create_histogram_df_from_distribution_list()
lib.analysis.find_index_with_value_in_df()

Coverage in analysis.py: 100%; Increases the total coverage of the module to 83%

kosovan · 2024-05-17T16:26:53Z

@pm-blanco Are these analysis functions actually used by pyMBE? Maybe we should consider removing or at least updating some of them?

pm-blanco · 2024-05-17T16:44:28Z

@kosovan block_analyze and functions called therein are currently called by the sample scripts samples/branched_polyampholyte.py, samples/peptide_mixture_grxmc_ideal.py, samples/peptide.py and also by the script that we use to reproduce the data of our manuscript , samples/Beyer2024/create_paper_data.py , via analysis.analyze_time_series(). I am not sure if all functions in lib.analysis are actually called within pyMBE, it would definitively be worth checking if we can clean-up some. Updating them to their current counterpart in our private repository is a good idea, would you take care of that?

…verage of the same function

…rom_file_name

pm-blanco · 2024-06-05T11:48:30Z

@kosovan I took the liberty to finish this PR. I cleaned up lib/analysis.py of all the functions that are not used within pyMBE and I updated the functions to work following our current standards. I have also provided unit tests for all the functions in the library, which cover the 100% of the code.

pm-blanco · 2024-06-05T15:35:42Z

@paobtorres this PR is now ready for review

paobtorres · 2024-06-06T17:11:05Z

lib/analysis.py

There are a few inconsistencies in the docstring of this file. For example, in Args from analyze_time_series() has no items (-) and the Returns provides only the format output. Then, in block_analyze() each Args has an item symbol (-) the the Returns contains the variable name of the output.

@paobtorres thank you for your revision and for bringing up this point. I have reviewed the docstrings of analysis.py and pyMBE.py to tried to standardize the format and, in the process, I solved numerous inaccuracies and ambiguities on variable types. Regarding the format of the Return arguments, I think that we should actually not enforce that there should be a variable name in the docstrings because there are some functions like pmb.check_if_df_cell_has_a_value or check_if_name_is_defined_in_df that do not really store their output into a variable but simply return it and then the user can put any arbitrary variable name to it. I would suggest to either drop all variable names for return arguments (because they are actually not important for users anyway) or to only add the variable name in return arguments when it is actually defined in the function.

@pm-blanco I think that depends on which Python code docstring guideline we follow. I believe that when there is a variable name we should put it because it helps the user to better understand how each function works by reading the docstring.

@paobtorres we follow the Google Style Python Docstrings. In that standard, the variable names are not provided for return arguments. I do not think that it is enforced that we should not provide variable names, so we can agree that we provide variable names for return variables to help users reading the code but we do not enforce that all functions should have a return variable name.

paobtorres

I think the docstring in analysis.py should be changed to keep a consisting format. Other than that, I did some testing and everything ran successfully.

paobtorres

I reviewed the changes and quickly ran a test everything is ok.

test_get_dt first draft

0df2219

make pylint happy

afbfb2a

pm-blanco assigned kosovan May 17, 2024

pm-blanco added the ci-improvement label May 17, 2024

pm-blanco added this to the first stable version milestone May 17, 2024

pm-blanco added 5 commits June 4, 2024 11:25

merge main

288b99d

deprecate unused methods in analysis

91da8c7

solved bug in get_dt for dataframes with repeated entries, improve co…

be1c5fe

…verage of the same function

adopt new formatting for filenames, add unit testing for get_params_f…

dc3a0d4

…rom_file_name

solve code quality issues

2907bde

pm-blanco self-requested a review June 4, 2024 14:04

pm-blanco added 4 commits June 4, 2024 17:52

add unit tests for block_analyze, deprecate unused method

f98287d

remove dependency on deprecated function

dea2b3a

add missing testing data

8296fad

Add unit tests for all analysis functions

ffdc225

pm-blanco added 4 commits June 5, 2024 13:56

shorten the time series for testing

53281e7

add missing file, fix format of the filename

c15b5bf

sort columns before testing

85c605b

ignore different types

786e099

pm-blanco requested a review from paobtorres June 5, 2024 14:41

pm-blanco added 2 commits June 5, 2024 17:29

Merge branch 'main' into analysis_tests

8d0eedf

Add license to new unit test

cf8b1ae

paobtorres reviewed Jun 6, 2024

View reviewed changes

paobtorres requested changes Jun 6, 2024

View reviewed changes

pm-blanco added 2 commits June 7, 2024 11:47

docstrings: fix format inconsistencies and variable type ambiguities

5fa05a4

fix some remamaining formatting issues

b7230b9

pm-blanco removed their request for review June 7, 2024 13:55

paobtorres marked this pull request as ready for review June 7, 2024 14:09

paobtorres approved these changes Jun 7, 2024

View reviewed changes

pm-blanco merged commit 5e98340 into pyMBE-dev:main Jun 7, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

analysis tests #62

analysis tests #62

kosovan commented May 17, 2024 •

edited by pm-blanco

Loading

kosovan commented May 17, 2024

pm-blanco commented May 17, 2024 •

edited

Loading

pm-blanco commented Jun 5, 2024

pm-blanco commented Jun 5, 2024

paobtorres Jun 6, 2024 •

edited

Loading

pm-blanco Jun 7, 2024 •

edited

Loading

paobtorres Jun 7, 2024

pm-blanco Jun 7, 2024 •

edited

Loading

paobtorres left a comment

paobtorres left a comment

analysis tests #62

analysis tests #62

Conversation

kosovan commented May 17, 2024 • edited by pm-blanco Loading

kosovan commented May 17, 2024

pm-blanco commented May 17, 2024 • edited Loading

pm-blanco commented Jun 5, 2024

pm-blanco commented Jun 5, 2024

paobtorres Jun 6, 2024 • edited Loading

Choose a reason for hiding this comment

pm-blanco Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

paobtorres Jun 7, 2024

Choose a reason for hiding this comment

pm-blanco Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

paobtorres left a comment

Choose a reason for hiding this comment

paobtorres left a comment

Choose a reason for hiding this comment

kosovan commented May 17, 2024 •

edited by pm-blanco

Loading

pm-blanco commented May 17, 2024 •

edited

Loading

paobtorres Jun 6, 2024 •

edited

Loading

pm-blanco Jun 7, 2024 •

edited

Loading

pm-blanco Jun 7, 2024 •

edited

Loading