feat: add audio narration (updated) #346

angelala3252 · 2023-07-03T20:47:12Z

Summary

Updated version of PR #195 due to deleting previous fork. This feature is blocked until MacOS accessibility issues are fixed.

(Addresses #164)

Checklist

My code follows the style guidelines of OpenAdapt
I have performed a self-review of my code
If applicable, I have added tests to prove my fix is functional/effective
I have linted my code locally prior to submission
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation (e.g. README.md, requirements.txt)
New and existing unit tests pass locally with my changes

…conversion

…udio_narration # Conflicts: # requirements.txt

openadapt/record.py

abrichr · 2023-08-24T18:45:48Z

@0dm can you please comment about what would be required to get an analogous implementation of this for Mac? 🙏

abrichr · 2023-08-24T18:49:54Z

openadapt/record.py

+        audio_frames.append(indata.copy())
+
+    # open InputStream and start recording while ActionEvents are recorded
+    audio_stream = sounddevice.InputStream(


@angelala3252 @0dm what is the easiest way to implement a MacOS-compatible analog of this? Can we re-use existing code in other PRs?

#362 has a good method of getting audio devices via Apple AVFoundation, I'm sure it can be used here with minimal issue. I'm not sure if it'll be plug & play with my PR though, would need some changes depending on the implementation.

0dm · 2023-08-24T22:29:13Z

@0dm can you please comment about what would be required to get an analogous implementation of this for Mac? 🙏

The issue with not being able to capture audio with openadapt.record running should be fixed after the changes in _macos a while ago.

abrichr · 2023-08-30T20:46:50Z

openadapt/record.py

@@ -24,6 +24,11 @@



Please update module documentation with optional flag example

abrichr · 2023-08-30T20:49:12Z

@angelala3252 can you please fix merge commits? I believe you will need to re-generate the alembic scripts as well.

Please also confirm this works with poetry. After checking out your branch and running poetry install followed by python -m openadapt.record foo --enable_audio, I get:


Traceback (most recent call last):
  File "/usr/local/Cellar/python@3.10/3.10.11/Frameworks/Python.framework/Versions/3.10/lib/python3.10/runpy.py", line 196, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/usr/local/Cellar/python@3.10/3.10.11/Frameworks/Python.framework/Versions/3.10/lib/python3.10/runpy.py", line 86, in _run_code
    exec(code, run_globals)
  File "/Users/abrichr/oa/src/OpenAdapt/openadapt/record.py", line 27, in <module>
    import sounddevice
ModuleNotFoundError: No module named 'sounddevice'

abrichr · 2023-08-30T20:51:14Z

requirements.txt

Is this still the recommended way to install whisper?

I also found https://pypi.org/project/openai-whisper/

poetry add openai-whisper isn't working for me as I get issues building wheel for llvmlite and numba, and also Unable to find installation candidates for triton (2.0.0). The fixes online all suggest that this is because pip is older than version 19.0, but our pip is version 23.1.2. poetry add git+https://github.com/openai/whisper.git works so for now I'm just going with that.

angelala3252 · 2023-08-31T11:48:31Z

After pulling from main, for some reason importing sounddevice breaks recordings. I can record audio just fine, but nothing else can be recorded and no window events can be registered and the recording hangs indefinitely after Ctrl+C. This even happens when I'm not recording audio and I can't figure out why. I'm using the same version (0.4.6) I was when everything still worked.

abrichr · 2024-03-02T23:06:50Z

@angelala3252 this may be due to openadapt.capture

@0dm any ideas? 🙏

abrichr · 2024-06-18T03:27:26Z

Implemented in #673

@angelala3252 thank you for blazing the trail here!! 🙏

angelala3252 added 25 commits May 26, 2023 17:40

added sounddevice to optionally record narration

351d87b

added sounddevice to optionally record narration and initial whisper …

f19a84a

…conversion

updated requirements for audio narration

e143767

small changes

6f07b93

fixed issue with created audio file being really slow

d3ef09a

updated to save audio data and transcribed text in database

9e86193

pull from main

87a814f

new alembic migration

ce84a1b

edited audio tables

5c584b2

convert audio array to required format for whisper

802c8a2

visualize audio info

aca8cdc

FLAC compression before storing

42b1007

store word by word timestamps

9f4c280

style changes

20d29e1

Merge branch 'main' into feat/audio_narration

109ffe0

changed tiktoken version

8d27b4f

removed unused tiktoken code

d631b2d

Merge branch 'main' into feat/audio_narration

ab0805e

alphabetic order, removed redundant dependencies

e30538b

merged AudioInfo and AudioFile

9469043

Merge remote-tracking branch 'audio/feat/audio_narration' into feat/a…

47bf845

…udio_narration # Conflicts: # requirements.txt

move audio recording into record_audio function

e9f2d36

use thread-local scoped_session

9293b0b

Merge branch 'main' into feat/audio_narration

a66acbc

remove redundant requirement

888d335

angelala3252 mentioned this pull request Jul 3, 2023

Audio narration #195

Closed

KrishPatel13 reviewed Aug 3, 2023

View reviewed changes

openadapt/record.py Show resolved Hide resolved

KrishPatel13 reviewed Aug 3, 2023

View reviewed changes

openadapt/record.py Show resolved Hide resolved

KrishPatel13 reviewed Aug 3, 2023

View reviewed changes

openadapt/record.py Outdated Show resolved Hide resolved

abrichr reviewed Aug 24, 2023

View reviewed changes

abrichr reviewed Aug 30, 2023

View reviewed changes

openadapt/record.py Outdated

@@ -24,6 +24,11 @@

Copy link

Member

abrichr Aug 30, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please update module documentation with optional flag example

abrichr reviewed Aug 30, 2023

View reviewed changes

angelala3252 added 8 commits August 30, 2023 18:11

pull from main

e1a3a18

pull from main

d7c54f2

remove unused tiktoken function

3eaa3a8

add audio dependencies

05834c4

style changes

a6e45bd

new alembic file

f23df51

delete old requirements.txt

f6cdbc0

added audio dependencies

873cf6d

KIRA009 mentioned this pull request May 15, 2024

Audio narration #673

Merged

7 tasks

abrichr closed this Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add audio narration (updated) #346

feat: add audio narration (updated) #346

angelala3252 commented Jul 3, 2023 •

edited by abrichr

Loading

abrichr commented Aug 24, 2023

abrichr Aug 24, 2023

0dm Aug 24, 2023

0dm commented Aug 24, 2023

abrichr Aug 30, 2023

abrichr commented Aug 30, 2023

abrichr Aug 30, 2023

angelala3252 Aug 31, 2023

angelala3252 commented Aug 31, 2023

abrichr commented Mar 2, 2024

abrichr commented Jun 18, 2024 •

edited

Loading

feat: add audio narration (updated) #346

feat: add audio narration (updated) #346

Conversation

angelala3252 commented Jul 3, 2023 • edited by abrichr Loading

abrichr commented Aug 24, 2023

abrichr Aug 24, 2023

Choose a reason for hiding this comment

0dm Aug 24, 2023

Choose a reason for hiding this comment

0dm commented Aug 24, 2023

abrichr Aug 30, 2023

Choose a reason for hiding this comment

abrichr commented Aug 30, 2023

abrichr Aug 30, 2023

Choose a reason for hiding this comment

angelala3252 Aug 31, 2023

Choose a reason for hiding this comment

angelala3252 commented Aug 31, 2023

abrichr commented Mar 2, 2024

abrichr commented Jun 18, 2024 • edited Loading

angelala3252 commented Jul 3, 2023 •

edited by abrichr

Loading

abrichr commented Jun 18, 2024 •

edited

Loading