Skip to content

Commit

Permalink
add chars.txt and clean text scripts
Browse files Browse the repository at this point in the history
  • Loading branch information
zxie committed Jun 18, 2015
1 parent 6abb41e commit 4e5d2ab
Show file tree
Hide file tree
Showing 2 changed files with 44 additions and 0 deletions.
34 changes: 34 additions & 0 deletions ctc_fast/swbd-utils/chars.txt
@@ -0,0 +1,34 @@
[vocalized-noise] 1
[laughter] 2
' 3
& 4
[space] 5
/ 6
[noise] 7
_ 8
a 9
c 10
b 11
e 12
d 13
g 14
f 15
i 16
h 17
k 18
j 19
m 20
l 21
o 22
n 23
q 24
p 25
s 26
r 27
u 28
t 29
w 30
v 31
y 32
x 33
z 34
10 changes: 10 additions & 0 deletions ctc_fast/swbd-utils/clean_text_ctc.sh
@@ -0,0 +1,10 @@
#!/bin/bash

for x in 'train' 'dev' 'eval2000'
do
text=data/$x/text
ctctext=data/$x/text_ctc
cp $text $ctctext
sed -i 's/_1/ /g' $ctctext
done

0 comments on commit 4e5d2ab

Please sign in to comment.