Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix download in LibriCSS recipe #1148

Merged
merged 53 commits into from
Sep 14, 2023
Merged
Changes from all commits
Commits
Show all changes
53 commits
Select commit Hold shift + click to select a range
ce0f5c1
add transform attribute for MixedCut
desh2608 Apr 20, 2023
ab18682
add mix_first option in normalize_loudness
desh2608 Apr 20, 2023
e4bca74
handle the case when mix is called on MixedCut with existing transforms
desh2608 Apr 20, 2023
71a9236
add test for mixing with transformed MixedCut
desh2608 Apr 20, 2023
2e54646
enhancements and bug fixes
desh2608 May 16, 2023
db37a75
small changes in some cutset methods
desh2608 May 16, 2023
7b59ecd
small fix in error message
desh2608 May 16, 2023
a64727a
return word alignments from ami recipe
desh2608 May 17, 2023
850ce2c
add word alignments for ICSI
desh2608 May 18, 2023
4b39c6f
remove unwanted whitespace
desh2608 May 18, 2023
3c16b90
fix IHM preparation
desh2608 May 18, 2023
9921575
remove words with zero or negative duration
desh2608 May 18, 2023
dba413f
ensure word alignments respect segment boundary
desh2608 May 18, 2023
12be424
add save-to-wav option for icsi
desh2608 May 22, 2023
c4b957d
add test for mixing cut with recording
desh2608 May 22, 2023
04ca4aa
Merge branch 'ami_icsi'
desh2608 May 22, 2023
fef3aa3
Merge branch 'cuts'
desh2608 May 22, 2023
0de443e
Merge branch 'mixed_cut_transform'
desh2608 May 22, 2023
80619bb
Merge branch 'master' of https://github.com/lhotse-speech/lhotse
desh2608 Jun 8, 2023
752be69
style fix
desh2608 Jun 8, 2023
5bd483d
Merge branch 'master' of https://github.com/lhotse-speech/lhotse
desh2608 Jun 11, 2023
68f3ffd
Merge branch 'loudness_fix'
desh2608 Jun 11, 2023
2171d7e
add data prep for voxpopuli
desh2608 Jun 12, 2023
df32e5c
Merge branch 'master' of https://github.com/lhotse-speech/lhotse
desh2608 Jun 15, 2023
67c9223
Merge branch 'recipe/voxpopuli'
desh2608 Jun 28, 2023
c5efbd7
small changes in recipes
desh2608 Jul 30, 2023
fa00d7c
merge upstream
desh2608 Jul 30, 2023
826dbc2
changes for max segment duration
desh2608 Aug 3, 2023
69cba45
Merge branch 'master' of https://github.com/lhotse-speech/lhotse into…
desh2608 Aug 3, 2023
e297c13
remove extra code
desh2608 Aug 3, 2023
4711576
add alignment scores from CTM
desh2608 Aug 3, 2023
128374f
minor change
desh2608 Aug 3, 2023
82157d9
merge trunk
desh2608 Aug 7, 2023
5b51cc4
made suggested changes
desh2608 Aug 7, 2023
998c245
Merge branch 'master' of https://github.com/lhotse-speech/lhotse into…
desh2608 Aug 11, 2023
a347447
apply change to multi custom merge func
desh2608 Aug 11, 2023
b9c1704
remove old code
desh2608 Aug 11, 2023
22d18d6
fix failing tests
desh2608 Aug 11, 2023
958a8ec
add tests for trim to alignments with max segment duration
desh2608 Aug 11, 2023
90f2fe0
add tests for merge supervisions
desh2608 Aug 11, 2023
917d4f8
Merge branch 'master' of https://github.com/lhotse-speech/lhotse into…
desh2608 Aug 11, 2023
0e36e3f
fix merge conflicts
desh2608 Aug 11, 2023
399834b
Merge branch 'master' of https://github.com/lhotse-speech/lhotse
desh2608 Aug 23, 2023
5ecc856
fix bug in eval2000
desh2608 Aug 23, 2023
293082f
remove storing unnecessary things
desh2608 Aug 23, 2023
71ed2e1
Merge branch 'master' of https://github.com/lhotse-speech/lhotse into…
desh2608 Aug 23, 2023
c4fe1c9
add fix_manifests for all recipes
desh2608 Aug 23, 2023
910f540
Merge branch 'bug/eval2000'
desh2608 Aug 23, 2023
b8af9ff
Merge branch 'sups/alignment'
desh2608 Aug 23, 2023
f5b643e
Merge branch 'sups/ctm'
desh2608 Aug 23, 2023
b82e2d0
Merge branch 'master' of https://github.com/lhotse-speech/lhotse
desh2608 Sep 7, 2023
03470ee
Merge branch 'master' of https://github.com/lhotse-speech/lhotse
desh2608 Sep 14, 2023
97d4443
fix unzipping
desh2608 Sep 14, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 6 additions & 1 deletion lhotse/recipes/libricss.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,10 +10,13 @@
import json
import logging
import subprocess
import zipfile
from collections import defaultdict
from pathlib import Path
from typing import Dict, Union

from tqdm import tqdm

from lhotse import (
CutSet,
RecordingSet,
Expand Down Expand Up @@ -120,7 +123,9 @@ def download_libricss(target_dir: Pathlike, force_download: bool = False) -> Pat
# Extract the zipped file
if not corpus_dir.exists() or force_download:
logging.info(f"Extracting {corpus_zip} to {target_dir}")
corpus_zip.unzip(target_dir)
with zipfile.ZipFile(corpus_zip, "r") as corpus_zip:
for member in tqdm(corpus_zip.infolist(), desc="Extracting"):
corpus_zip.extract(member, target_dir)

return target_dir

Expand Down
Loading