feat(expressions, embedder, wavtool): add Breathiness & Voicing param…#30
Merged
Conversation
…s, mHuBERT backend
feat(expressions): add BrecLoader and VoicLoader for Breathiness and Voicing
- Expression parameter extraction using HPSS-based breath/voice index analysis
feat(wavtool): add extract_wav_breath_voice() for HPSS frequency-band RMS
extraction; add extract_wav_embeddings() with mHuBERT/PCA support and caching
feat(embedder): introduce mHuBERT as advanced speech feature extraction backend
for improved cross-language phoneme alignment
refactor(expressions): rename pitd cache dir from "pitd" to "f0"; remove
add_cuda_to_path import in crepe branch; force RMVPE to device="cpu"
docs(readme): restructure ML models section with Pitch Extraction and
mHuBERT Feature Extractor subsections; add CC-BY-NC-SA-4.0 license warning
docs(readme): add Get Model Weights section with HF mirror tip and
separate RMVPE/mHuBERT download instructions
docs(readme): add Breathiness and Voicing to supported params and features list;
add BrecLoader/VoicLoader subgraphs to algorithm workflow diagram;
fold v0.5.0/v0.6.0 version tips into inline bullet text; remove v0.9.1 tag
examples: add demo scripts for mHuBERT embedder and Breathiness/Voicing expressions
- examples/mhubert_embedder_demo.py: minimal usage of mHuBERT feature extraction
- examples/breath_voicing_control.ipynb: interactive tutorial for expression
parameter tuning with visualization and audio preview
Acknowledgements:
- Special thanks to @ma0shu for foundational work on expressive parameter
extraction: ma0shu@a15858d
License Reminder:
- Model weights, pretrained assets, and derived features are distributed under
CC-BY-NC-SA-4.0. Commercial use requires separate authorization.
Please review LICENSE and README.md for full terms.
Codecov Report❌ Patch coverage is
📢 Thoughts on this report? Let us know! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
…s, mHuBERT backend
feat(expressions): add BrecLoader and VoicLoader for Breathiness and Voicing
extraction; add extract_wav_embeddings() with mHuBERT/PCA support and caching
feat(embedder): introduce mHuBERT as advanced speech feature extraction backend
for improved cross-language phoneme alignment
refactor(expressions): rename pitd cache dir from "pitd" to "f0"; remove
add_cuda_to_path import in crepe branch; force RMVPE to device="cpu"
docs(readme): restructure ML models section with Pitch Extraction and
mHuBERT Feature Extractor subsections; add CC-BY-NC-SA-4.0 license warning
docs(readme): add Get Model Weights section with HF mirror tip and
separate RMVPE/mHuBERT download instructions
docs(readme): add Breathiness and Voicing to supported params and features list;
add BrecLoader/VoicLoader subgraphs to algorithm workflow diagram;
fold v0.5.0/v0.6.0 version tips into inline bullet text; remove v0.9.1 tag
examples: add demo scripts for mHuBERT embedder and Breathiness/Voicing expressions
Acknowledgements:
License Reminder: