Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

evaluation of short sentences #96

Closed
pajkossy opened this issue Sep 18, 2015 · 3 comments
Closed

evaluation of short sentences #96

pajkossy opened this issue Sep 18, 2015 · 3 comments
Assignees

Comments

@pajkossy
Copy link
Collaborator

currently while preparing datasets very short sentences are dropped.
even if training is not possible with them the test data could contain them so that test results are reliable

@pajkossy
Copy link
Collaborator Author

I changed line 138 of word_tagger_dataset.py (if len(word) < 3; continue to if len(word)< 1, continue), when tring to train in the resulted dataset I got the error below
(it is possible to train with the < 2 constraint)

Traceback (most recent call last):
File "hunvec/seqtag/trainer.py", line 123, in
main()
File "hunvec/seqtag/trainer.py", line 119, in main
wt.train()
File "/home/pajkossy/Proj/hunvec/hunvec/seqtag/sequence_tagger.py", line 242, in train
self.algorithm.train(dataset=self.dataset['train'])
File "/home/pajkossy/pylearn2/pylearn2/training_algorithms/sgd.py", line 455, in train
self.sgd_update(*batch)
File "/home/pajkossy/hunvec_env/local/lib/python2.7/site-packages/theano/compile/function_module.py", line 606, in call
storage_map=self.fn.storage_map)
File "/home/pajkossy/hunvec_env/local/lib/python2.7/site-packages/theano/compile/function_module.py", line 595, in call
outputs = self.fn()
File "/home/pajkossy/hunvec_env/local/lib/python2.7/site-packages/theano/scan_module/scan_op.py", line 672, in rval
r = p(n, [x[0] for x in i], o)
File "/home/pajkossy/hunvec_env/local/lib/python2.7/site-packages/theano/scan_module/scan_op.py", line 661, in
self, node)
File "scan_perform.pyx", line 207, in theano.scan_module.scan_perform.perform (/home/pajkossy/.theano/compiledir_Linux-3.16--amd64-x86_64-with-debian-8.2--2.7.9-64/scan_perform/mod.cpp:2172)
NotImplementedError: We didn't implemented yet the case where scan do 0 iteration
Apply node that caused the error: forall_inplace,gpu,scan_fn}(Elemwise{Composite{minimum(maximum(((i0 + i1) - i1), (i2 - i1)), i3)}}.0, GpuSubtensor{int64:int64:int64}.0, GpuIncSubtensor{InplaceSet;:int64:}.0, GpuDimShuffle{1,0}.0)
Inputs types: [TensorType(int64, scalar), CudaNdarrayType(float32, matrix), CudaNdarrayType(float32, matrix), CudaNdarrayType(float32, matrix)]
Inputs shapes: [(), (0, 17), (2, 17), (17, 17)]
Inputs strides: [(), (17, 1), (17, 1), (1, 17)]
Inputs values: [array(0), <CudaNdarray object at 0x7f9a661fb270>, 'not shown', 'not shown']

HINT: Re-running with most Theano optimization disabled could give you a back-trace of when this node was created. This can be done with by setting the Theano flag 'optimizer=fast_compile'. If that does not work, Theano optimizations can be disabled with 'optimizer=None'.
HINT: Use the Theano flag 'exception_verbosity=high' for a debugprint and storage map footprint of this apply node.

@zseder zseder self-assigned this Oct 2, 2015
@zseder
Copy link
Owner

zseder commented Oct 2, 2015

I think Theano/Theano#3276 will solve the issue, so only an update of theano is needed, but still testing...

@zseder
Copy link
Owner

zseder commented Oct 2, 2015

solved in #102

@zseder zseder closed this as completed Oct 2, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants