re-worked cutoff_error parameter for Automatic linestrength cutoff #646

code29563 · 2024-03-21T22:16:07Z

Description

Addresses issue #268 building on PR #451.

I adjusted some of the docstrings and printed messages to be a bit more informative by clarifying what the function does with the input arguments, i.e. that the user-input cutoff is being adjusted to conform to the user-input cutoff_error.

I chose to stick to pandas methods for simplicity rather than using numpy. But is compatibility with vaex dataframes also sought in this issue?

…t None

Had accidentally changed the docstring in my previous commit; changed it back to raw string

…error

addressing issue radis#268 building on PR radis#451

minouHub

Seems good to me but I would like @erwanp feedback. We are also waiting for Travis to be back for automatic tests

…t None

Had accidentally changed the docstring in my previous commit; changed it back to raw string

…error

addressing issue radis#268 building on PR radis#451

erwanp · 2024-04-21T09:59:45Z

Rebased on latest develop version (with fixed tests), and triggering the test suite to test this PR

codecov-commenter · 2024-04-21T10:14:54Z

Codecov Report

Attention: Patch coverage is 68.69480% with 638 lines in your changes are missing coverage. Please review.

Project coverage is 72.95%. Comparing base (c5b130d) to head (7eb1ddd).
Report is 806 commits behind head on develop.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #646      +/-   ##
===========================================
- Coverage    73.16%   72.95%   -0.22%     
===========================================
  Files          137      148      +11     
  Lines        18864    21230    +2366     
===========================================
+ Hits         13802    15488    +1686     
- Misses        5062     5742     +680

radis/lbl/base.py

code29563 · 2024-04-21T20:59:01Z

I've tried fixing the code so it currently supports Pandas DataFrames. What would be the best approach for supporting Vaex, as it doesn't seem to have a cumsum() method? Convert to a numpy array then use numpy.cumsum()?

erwanp · 2024-04-21T22:09:02Z

If we convert to Numpy we will loose all the performance benefit of Vaex. We use Vaex when arrays cannot be loaded in RAM.

There is no straightforward solution. You could try to benchmark a few things on the side; for instance creating a function in radis.misc.arrays to find a cumsum treshold on a >1e10 Vaex array. Make sure that the array doesn't fit in memory, to make sure you are dealing with the worst case scenario. Then; it is probably possible to use vaex to sort and do a bining; and use Pandas cumsum() the binned array (which will clearly reduce the memory usage).
This shall be enough to reach less than the user-specified error, without having to load the full array

code29563 · 2024-05-05T09:04:16Z

I chose 100000 as the limit on length of a df to be processed in memory, which I tested on a dataset with > 3e9 rows, but might be worth testing further.

minouHub · 2024-05-17T16:44:09Z

Seems good. Could you provide an example that raises a warning?

code29563 · 2024-05-19T19:55:32Z

Seems good. Could you provide an example that raises a warning?

For an example that both adjusts the cutoff and raises the LinestrengthCutoffWarning (based on example from docs for eq_spectrum):

from radis import SpectrumFactory
sf = SpectrumFactory(
    wavenum_min=2900,
    wavenum_max=3200,
    molecule="OH",
    wstep='auto',
    cutoff=1e-23
)
sf.misc.warning_linestrength_cutoff = 1e-24
sf.params.cutoff_error = 2
sf.fetch_databank("hitemp")

s1 = sf.eq_spectrum(Tgas=300, path_length=1, pressure=0.1)

or set sf.params.cutoff_error to >=3.44 for an example without adjusting.

sagarchotalia and others added 6 commits April 14, 2022 14:12

Added cutoff_error parameter

6a2810a

Added condition to compare cutoff_error with error only when it is no…

12ce893

…t None

Minor nitpick

06fbc5c

Had accidentally changed the docstring in my previous commit; changed it back to raw string

Added better way of removing lines according to user-inputted cutoff …

87c03b7

…error

Merge branch 'radis:develop' into develop

bf4b8ba

re-worked cutoff_error parameter

a1f1624

addressing issue radis#268 building on PR radis#451

minouHub reviewed Mar 30, 2024

View reviewed changes

sagarchotalia and others added 5 commits April 21, 2024 11:57

Added cutoff_error parameter

3a7b63d

Added condition to compare cutoff_error with error only when it is no…

baf39cf

…t None

Minor nitpick

92c194a

Had accidentally changed the docstring in my previous commit; changed it back to raw string

Added better way of removing lines according to user-inputted cutoff …

71eca87

…error

re-worked cutoff_error parameter

5b4fac4

addressing issue radis#268 building on PR radis#451

erwanp force-pushed the cutoff branch from a1f1624 to 5b4fac4 Compare April 21, 2024 09:59

erwanp requested changes Apr 21, 2024

View reviewed changes

radis/lbl/base.py Outdated Show resolved Hide resolved

code29563 added 2 commits April 21, 2024 16:54

implement cutoff_error for pandas dfs

e3966d9

merge

32838fd

implement cutoff_error for vaex dfs

7eb1ddd

erwanp added this to the 0.16 milestone Jun 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

re-worked cutoff_error parameter for Automatic linestrength cutoff #646

re-worked cutoff_error parameter for Automatic linestrength cutoff #646

code29563 commented Mar 21, 2024

minouHub left a comment

erwanp commented Apr 21, 2024

codecov-commenter commented Apr 21, 2024 •

edited

Loading

code29563 commented Apr 21, 2024

erwanp commented Apr 21, 2024

code29563 commented May 5, 2024

minouHub commented May 17, 2024

code29563 commented May 19, 2024

re-worked cutoff_error parameter for Automatic linestrength cutoff #646

Are you sure you want to change the base?

re-worked cutoff_error parameter for Automatic linestrength cutoff #646

Conversation

code29563 commented Mar 21, 2024

Description

minouHub left a comment

Choose a reason for hiding this comment

erwanp commented Apr 21, 2024

codecov-commenter commented Apr 21, 2024 • edited Loading

Codecov Report

code29563 commented Apr 21, 2024

erwanp commented Apr 21, 2024

code29563 commented May 5, 2024

minouHub commented May 17, 2024

code29563 commented May 19, 2024

codecov-commenter commented Apr 21, 2024 •

edited

Loading