Add audinterface.Segment.process_table() #172

maxschmitt · 2024-04-29T16:15:44Z

New method Segment.process_table().
Added Usage instructions in Usage.rst and tests.

From usage documentation:

maxschmitt · 2024-04-29T17:46:03Z

I don't get why we run into a

E           soundfile.LibsndfileError: Error opening '/home/runner/work/audinterface/audinterface/file.wav': System error.

for most of the test when running process_table(). The method works when testing locally.

Concerning Linter, this is also not clear to me:

ruff-format..............................................................Failed
- hook id: ruff-format
- files were modified by this hook

I checked all files with ruff and there were no issues.

hagenw · 2024-04-30T07:10:18Z

Regarding ruff, we use two command (linting with automatic fix, and automatic code formatting, compare https://github.com/audeering/audinterface/blob/main/CONTRIBUTING.rst#coding-convention).

The easiest solution locally would be to run pre-commit as well, e.g. (after installing it)

$ pre-commit install
$ pre-commit run --all-files

When running it the first time it will fail as it had to make changes, but when running it again you will see that it passes.
You can then inspect (and commit) the changes it did.

hagenw · 2024-04-30T07:15:02Z

The test fails with the same error locally for me. /home/hwierstorf/git/audeering/audinterface/file.wav does also not look like a correct path, as usually we store the files for testing inside a tempdir, so maybe you are handing the wrong path to the processing function?

tests/test_segment.py

… coverage)

codecov · 2024-05-07T10:28:30Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.0%. Comparing base (bdb078c) to head (b9b6ee2).

Additional details and impacted files

Files	Coverage Δ
audinterface/core/segment.py	`100.0% <100.0%> (ø)`

…gmentation returning additional segments

maxschmitt · 2024-05-07T17:31:35Z

$ pre-commit install
$ pre-commit run --all-files

thanks

maybe you are handing the wrong path to the processing function?

yes, I forgot root=root in the test for process_table()

This seems not to test if the labels are correctly assigned if the number of segments increases.

I added a test here ll. 333-.

Generally, handling all edge cases and ensuring that the dtype of each column is correctly transferred is a bit cumbersome. I'm not sure if there is a better solution than the one implemented.
process_table() ll. 577-

Otherwise, all tests are running without errors, now. @hagenw

maxschmitt · 2024-05-07T18:47:24Z

8bd24aa resolved a bug for CategoricalDtype columns.

  File "xxx/audinterface/core/segment.py", line 596, in <dictcomp>
    col: labels[:, icol].astype(dtypes[icol])
TypeError: Cannot interpret 'CategoricalDtype(categories=['anger', 'boredom', 'disgust', 'fear', 'happiness',
                  'sadness', 'neutral'],
, ordered=False, categories_dtype=object)' as a data type

hagenw

Very nice, thanks for adding this.

Besides the comments I made, it would be nice to have a test that shows the new method behaves well if the segment function returns overlapping segments, e.g.

start,end
0,2
1,3
2,4
...

audinterface/core/segment.py

docs/usage.rst

tests/test_segment.py

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

maxschmitt · 2024-05-15T10:29:15Z

Besides the comments I made, it would be nice to have a test that shows the new method behaves well if the segment function returns overlapping segments, e.g.

This was now added by: e393dc8

All other suggestions were adopted.

Some tests are failing now (especially python 3.8, everything seems to be working for newer versions), but the errors are likely to originate from code that has not been touched.
@hagenw Do you know why it fails?

hagenw · 2024-05-16T08:49:01Z

It's indeed interesting that the tests for Python 3.8 fail due to audb. As in https://github.com/audeering/audb the tests for Python 3.8 are not failing.

hagenw · 2024-05-16T08:57:41Z

I can also not reproduce the failing test locally with Python 3.8.
My guess is that it has something to do with the audb cache we use during the tests.
The code for which audb fails is:

        try:
            deps = Dependencies()
            deps.load(cached_deps_file)
        except (AttributeError, FileNotFoundError, ValueError, EOFError):
            # If loading cached file fails, load again from backend
            backend_interface = utils.lookup_backend(name, version)
            deps = download_dependencies(backend_interface, name, version, verbose)
            # Store as pickle in cache
            deps.save(cached_deps_file)

So it seems we need to catch KeyError there as well.

hagenw · 2024-05-16T09:33:04Z

I proposed a fix for audb in audeering/audb#411.

hagenw · 2024-05-16T13:33:56Z

The test for Python 3.8 is now passing, but the test under Windows fails for the same reason as before the test under Python 3.8. We will discuss in audb how to better handle this.

maxschmitt · 2024-05-16T13:40:29Z

Now, only the tests on Windows are failing due to pyarrow:
https://github.com/audeering/audinterface/actions/runs/9094046905/job/25054683122?pr=172

hagenw · 2024-05-16T14:10:23Z

Yes, but the reason is basically the same as before under Python 3.8, see audeering/audb#411 (comment).

So we need again to fix it in audb.

hagenw

As the audb error is related to the cache, I managed to get the tests pass by deleting the existing cache and re-running the tests. The underlying problem, and how to solve it in audb is discussed in audeering/audb#413.

I have updated the description of the pull request by adding two screenshots of the new documentation, as it is always helpful to have the description as documentation on what was added by this pull request. Otherwise, I have just one other suggestion, and we should be fine to go here.

docs/usage.rst

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

maxschmitt · 2024-05-17T11:01:21Z

The Windows test (at least for Python 3.9) is still failing. Can we delete the cache also for this one?

hagenw · 2024-05-17T12:36:29Z

As I understood it, it should have started with creating a new cache already. Which means for some reason the new audb can no longer be shared between Windows and other platforms.

I could delete the cache and start the pipeline again, but then we would most likely need to do that again after merging.

If you don't need this feature next week, I would propose to postpone until we have found a better solution in audb.

hagenw · 2024-05-17T12:42:10Z

Just for the record:

The cache did not existed before the last run, and was created and uploaded by the Ubuntu Python 3.10 job
This cache was then downloaded by the failing Windows Python 3.9 job

maxschmitt · 2024-05-17T13:06:06Z

If you don't need this feature next week, I would propose to postpone until we have found a better solution in audb.

Makes sense, no rush!

hagenw · 2024-06-04T07:26:01Z

Good news, with audb 1.7.3 all tests are passing.

hagenw

All fine for merging here.

Maximilian Schmitt added 4 commits April 29, 2024 12:58

Initial implementation of Segment.process_table()

fd35a83

Merge branch 'main' into process_table

75e33bd

fix

717fe92

Adding notes on usage and tests for process_table().

31a3734

maxschmitt requested a review from hagenw April 29, 2024 16:15

maxschmitt assigned hagenw Apr 29, 2024

fix

450f91e

hagenw reviewed Apr 30, 2024

View reviewed changes

tests/test_segment.py Outdated Show resolved Hide resolved

Maximilian Schmitt added 4 commits May 6, 2024 12:15

code formatting fixed by pre-commit

3b47dfe

Fixing tests for process_table() with relative path

f7bea4e

renaming test

f4b391b

Adding test for calling process_table with an index (ValueError, code…

d3c5693

… coverage)

Maximilian Schmitt added 9 commits May 7, 2024 14:08

Test assignment of labels for dataframe with a segmented index and se…

6219510

…gmentation returning additional segments

fix

589ed6e

Fixing transfer of dtype and corresponding test

df02e33

Fixing other tests not to expect a different dtype

3c86379

fix

2980bc3

fix if processing function returns empty table

79b4833

better fix and 1D-dataframe test

da4d433

fixing dtype for empty segments test

187fe10

trying to resolve file loading issue in documentation

faa9968

Fixing issue for category type columns

8bd24aa

hagenw reviewed May 15, 2024

View reviewed changes

Update audinterface/core/segment.py

ee4b742

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

Maximilian Schmitt and others added 10 commits May 15, 2024 11:17

moving dtypes and adding description

0d47e90

check error

fe6f86e

revert

5ef1f7e

Update audinterface/core/segment.py

991d8a2

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

chaning indexes to usual convention

607888d

Update docs/usage.rst

b6aa70c

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

Update tests/test_segment.py

dfdd96e

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

Update docs/usage.rst

f4b71d7

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

adapt header of usage.rst to make new example work

4c197b9

Adding test for overlapping segments

e393dc8

hagenw mentioned this pull request May 16, 2024

Ensure dependency loading from cache does not fail audeering/audb#411

Merged

hagenw reviewed May 17, 2024

View reviewed changes

docs/usage.rst Outdated Show resolved Hide resolved

Update docs/usage.rst

b9b6ee2

Co-authored-by: Hagen Wierstorf <hwierstorf@audeering.com>

hagenw mentioned this pull request May 17, 2024

Test backward compatibility audeering/audb#413

Closed

hagenw changed the title ~~Process table~~ Add audinterface.Segment.process_table() Jun 4, 2024

hagenw approved these changes Jun 4, 2024

View reviewed changes

maxschmitt merged commit f094aaf into main Jun 4, 2024
16 checks passed

maxschmitt deleted the process_table branch June 4, 2024 07:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add audinterface.Segment.process_table() #172

Add audinterface.Segment.process_table() #172

maxschmitt commented Apr 29, 2024 •

edited by hagenw

Loading

maxschmitt commented Apr 29, 2024

hagenw commented Apr 30, 2024

hagenw commented Apr 30, 2024

codecov bot commented May 7, 2024 •

edited

Loading

maxschmitt commented May 7, 2024

maxschmitt commented May 7, 2024

hagenw left a comment

maxschmitt commented May 15, 2024

hagenw commented May 16, 2024

hagenw commented May 16, 2024

hagenw commented May 16, 2024

hagenw commented May 16, 2024

maxschmitt commented May 16, 2024

hagenw commented May 16, 2024

hagenw left a comment

maxschmitt commented May 17, 2024

hagenw commented May 17, 2024

hagenw commented May 17, 2024

maxschmitt commented May 17, 2024

hagenw commented Jun 4, 2024

hagenw left a comment

Add audinterface.Segment.process_table() #172

Add audinterface.Segment.process_table() #172

Conversation

maxschmitt commented Apr 29, 2024 • edited by hagenw Loading

maxschmitt commented Apr 29, 2024

hagenw commented Apr 30, 2024

hagenw commented Apr 30, 2024

codecov bot commented May 7, 2024 • edited Loading

Codecov Report

maxschmitt commented May 7, 2024

maxschmitt commented May 7, 2024

hagenw left a comment

Choose a reason for hiding this comment

maxschmitt commented May 15, 2024

hagenw commented May 16, 2024

hagenw commented May 16, 2024

hagenw commented May 16, 2024

hagenw commented May 16, 2024

maxschmitt commented May 16, 2024

hagenw commented May 16, 2024

hagenw left a comment

Choose a reason for hiding this comment

maxschmitt commented May 17, 2024

hagenw commented May 17, 2024

hagenw commented May 17, 2024

maxschmitt commented May 17, 2024

hagenw commented Jun 4, 2024

hagenw left a comment

Choose a reason for hiding this comment

maxschmitt commented Apr 29, 2024 •

edited by hagenw

Loading

codecov bot commented May 7, 2024 •

edited

Loading