Skip to content

feat(expressions, embedder, wavtool): add Breathiness & Voicing param…#30

Merged
NewComer00 merged 1 commit into
mainfrom
feat/breathiness-voicing-mhubert
Apr 30, 2026
Merged

feat(expressions, embedder, wavtool): add Breathiness & Voicing param…#30
NewComer00 merged 1 commit into
mainfrom
feat/breathiness-voicing-mhubert

Conversation

@NewComer00
Copy link
Copy Markdown
Owner

@NewComer00 NewComer00 commented Apr 30, 2026

…s, mHuBERT backend

feat(expressions): add BrecLoader and VoicLoader for Breathiness and Voicing

  • Expression parameter extraction using HPSS-based breath/voice index analysis feat(wavtool): add extract_wav_breath_voice() for HPSS frequency-band RMS
    extraction; add extract_wav_embeddings() with mHuBERT/PCA support and caching

feat(embedder): introduce mHuBERT as advanced speech feature extraction backend
for improved cross-language phoneme alignment

refactor(expressions): rename pitd cache dir from "pitd" to "f0"; remove
add_cuda_to_path import in crepe branch; force RMVPE to device="cpu"

docs(readme): restructure ML models section with Pitch Extraction and
mHuBERT Feature Extractor subsections; add CC-BY-NC-SA-4.0 license warning

docs(readme): add Get Model Weights section with HF mirror tip and
separate RMVPE/mHuBERT download instructions

docs(readme): add Breathiness and Voicing to supported params and features list;
add BrecLoader/VoicLoader subgraphs to algorithm workflow diagram;
fold v0.5.0/v0.6.0 version tips into inline bullet text; remove v0.9.1 tag

examples: add demo scripts for mHuBERT embedder and Breathiness/Voicing expressions

  • examples/mhubert_embedder_demo.py: minimal usage of mHuBERT feature extraction
  • examples/breath_voicing_control.ipynb: interactive tutorial for expression parameter tuning with visualization and audio preview

Acknowledgements:

License Reminder:

  • Model weights, pretrained assets, and derived features are distributed under CC-BY-NC-SA-4.0. Commercial use requires separate authorization. Please review LICENSE and README.md for full terms.

…s, mHuBERT backend

feat(expressions): add BrecLoader and VoicLoader for Breathiness and Voicing
  - Expression parameter extraction using HPSS-based breath/voice index analysis
feat(wavtool): add extract_wav_breath_voice() for HPSS frequency-band RMS
  extraction; add extract_wav_embeddings() with mHuBERT/PCA support and caching
feat(embedder): introduce mHuBERT as advanced speech feature extraction backend
  for improved cross-language phoneme alignment
refactor(expressions): rename pitd cache dir from "pitd" to "f0"; remove
  add_cuda_to_path import in crepe branch; force RMVPE to device="cpu"
docs(readme): restructure ML models section with Pitch Extraction and
  mHuBERT Feature Extractor subsections; add CC-BY-NC-SA-4.0 license warning
docs(readme): add Get Model Weights section with HF mirror tip and
  separate RMVPE/mHuBERT download instructions
docs(readme): add Breathiness and Voicing to supported params and features list;
  add BrecLoader/VoicLoader subgraphs to algorithm workflow diagram;
  fold v0.5.0/v0.6.0 version tips into inline bullet text; remove v0.9.1 tag
examples: add demo scripts for mHuBERT embedder and Breathiness/Voicing expressions
  - examples/mhubert_embedder_demo.py: minimal usage of mHuBERT feature extraction
  - examples/breath_voicing_control.ipynb: interactive tutorial for expression
    parameter tuning with visualization and audio preview

Acknowledgements:
- Special thanks to @ma0shu for foundational work on expressive parameter
  extraction: ma0shu@a15858d

License Reminder:
- Model weights, pretrained assets, and derived features are distributed under
  CC-BY-NC-SA-4.0. Commercial use requires separate authorization.
  Please review LICENSE and README.md for full terms.
@codecov
Copy link
Copy Markdown

codecov Bot commented Apr 30, 2026

Codecov Report

❌ Patch coverage is 72.34637% with 99 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
expressions/voic.py 31.03% 40 Missing ⚠️
expressions/brec.py 31.57% 39 Missing ⚠️
expressions/pitd.py 33.33% 10 Missing ⚠️
utils/embedder.py 93.87% 9 Missing ⚠️
expressions/tenc.py 50.00% 1 Missing ⚠️

📢 Thoughts on this report? Let us know!

@NewComer00 NewComer00 merged commit 55fe4f4 into main Apr 30, 2026
3 of 4 checks passed
@NewComer00 NewComer00 deleted the feat/breathiness-voicing-mhubert branch April 30, 2026 13:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant