-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tedlium chain config question #914
Comments
Sounds like the --speed-perturb thing would cause a crash. Let me check it out. Regarding ali_dir=tri3_ali_sp, Vijay actually advised me to to use non-speed-perturbed alignments for tree building. I can't rembmer if $ali_dir was used anywhere else. The repo version was the last version as of the first submission. We got about 1% absolute WER decrease for the camera ready submission with some data cleanup scripts by Vimal Manohar, but that hasn't made it into master. I'd recommend waiting on Dan's changes to tedlium to explore the effect of data cleanup for tedlium 2 |
okay, thanks. Just out of curiosity what was the swbd baseline script you took to start with ? 'cause now there are so many. |
Initially, it was based on swbd/s5c/local/chain/run_tdnn_2y.sh. However, this used the "jesus" nonlinearity, which we later abandoned. Since, then it organically developed, partially with hints from Vijay and Dan. Are you wondering anything in particular? |
Nothing particular, just making sure it was the best known parameters. Actually it has 7 layers right now with the current splices indexes. (run_tdnn_2y.sh has only 5). Anyway on tedlium2 it's giving me the best results right now (with 7.98M parameters vs 6.39 for nnet3) numbers for nnet3 [13.2% 12.1% 12.2% 10.9%] |
Cool. Hopefully the chain models will be even better once we get the On Mon, Jul 25, 2016 at 4:44 AM, vince62s notifications@github.com wrote:
|
I would like to try the left-biphone config with Tedlium. If my understanding is ok it is just a matter of changing the following line |
You need to do more. The key changes are, firstly if you use a shared tree-dir you should change
to On Fri, Aug 5, 2016 at 4:20 AM, vince62s notifications@github.com wrote:
|
ok left-biphone results very close to "regular" chain. %WER 12.5 | 507 17792 | 89.6 7.2 3.2 2.2 12.5 84.8 | -0.077 | exp/chain/tdnn/decode_dev/score_9_0.0/ctm.filt.filt.sys |
OK- it's hard to interpret these without seeing the baseline; but anyway, On Sun, Aug 7, 2016 at 6:47 AM, vince62s notifications@github.com wrote:
|
it was 5 comments above last one, same thread. |
OK thanks. |
@danpovey @vijayaditya , I noticed that the splicing indexes copied over from the AMI s5b recipes are different from what was usually taken. it has 2 final layers 0 0. Is there any improvements using this splicing versus previous ones ? |
IIRC TDNN in AMI recipe has more layers as I wanted to create a comparison --Vijay On Tue, Aug 9, 2016 at 5:43 PM, vince62s notifications@github.com wrote:
|
I just copied it over. @vince62s-- I haven't really done any tuning on Dan On Tue, Aug 9, 2016 at 2:49 PM, Vijayaditya Peddinti <
|
I am running the new baseline now to see if I match the results first. touch $dir/egs/.nodelete # keep egs around when that run dies.otherwise it hangs because before first run this dir does not exist. when done I will try various set ups |
thanks, fixing that bug. On Fri, Aug 12, 2016 at 7:11 AM, vince62s notifications@github.com wrote:
|
hmm weird, results chain no cleanup are different from yours: yours are: |
It looks like, for, you, the LM rescoring makes much less difference. On Fri, Aug 12, 2016 at 3:05 PM, vince62s notifications@github.com wrote:
|
Yes I rebuilt everything |
Actually i have to check the lexicon i took... |
I think you misread, my rescoring is -0.5% and -0.6% when yours is -0.4% and -0.3%. |
I think also that method is different. I guess you trained on cleaned data and decoded first cleaned data and then just decoded non cleaned data on pre trained (on cleaned) since I read this: my run was on non cleaned data all the way through, including training. |
Firstly, ignore the --stage, that shouldn't have been there. I didn't do The runs aren't exactly replicable anyway, but if your LMs and lexicons are Dan On Sat, Aug 13, 2016 at 1:57 AM, vince62s notifications@github.com wrote:
|
ok but I want to replicate your results first then. |
That should take you most of the way there-- however, you should verify The training of the models wouldn't be exactly the same, but it should be Dan On Sat, Aug 13, 2016 at 1:31 PM, vince62s notifications@github.com wrote:
|
very different results so re-running from scratch, I want a clean start. |
my results, from scratch (no cleanup): |
If the num-jobs-nnet are different, we don't expect the WERs to be the same. Dan On Tue, Aug 16, 2016 at 6:17 AM, vince62s notifications@github.com wrote:
|
When running the chain config script for Tedlium, I saw that run_tdnn.sh and run_ivector_common.sh are not in sync since the former call the latter with --speed-perturb true which is not defined the repo version.
Also Dan fixed this one ali_dir=tri3_ali_sp.
Now my question is: is the repo version of the Chain script the last version related to the paper, and subsequently is it the best config according to the various testing done in the SWBD repo ?
thanks.
Vincent
The text was updated successfully, but these errors were encountered: