Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Segmentation fault #24

Closed
cffan opened this issue Sep 26, 2014 · 4 comments
Closed

Segmentation fault #24

cffan opened this issue Sep 26, 2014 · 4 comments
Assignees

Comments

@cffan
Copy link

cffan commented Sep 26, 2014

Hi, I got a segmentation fault when running "lmplz -o 3 < text > arpa" on a corpus. Stack trace is pasted below. I've got lmplz running fine on several other corpora. The only thing special about this corpus is it contains a lot of duplicated sentences, don't know if this could cause the segmentation fault.

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7fffca054700 (LWP 9216)]
0x00000000004856ca in lm::builder::NGram::IsMarked (this=0x7fffca053c20) at ./lm/builder/ngram.hh:77
77 return Value().count >> (sizeof(Value().count) * 8 - 1);
(gdb) bt
#0 0x00000000004856ca in lm::builder::NGram::IsMarked (this=0x7fffca053c20)

at ./lm/builder/ngram.hh:77

#1 0x000000000048e12a in lm::builder::NGram::CutoffCount (this=0x7fffca053c20)

at ./lm/builder/ngram.hh:93

#2 0x000000000048afa6 in lm::builder::(anonymous namespace)::PruneNGramStream::operator++ (

this=0x7fffca053c20) at /home/cfan/tools/kenlm/lm/builder/initial_probabilities.cc:74

#3 0x000000000048bb40 in lm::builder::(anonymous namespace)::MergeRight::Run (this=0x95bc78,

primary=...) at /home/cfan/tools/kenlm/lm/builder/initial_probabilities.cc:238

#4 0x000000000048df48 in util::stream::Thread::operator()<util::stream::ChainPosition, lm::builder::{anonymous}::MergeRight>(const util::stream::ChainPosition &, lm::builder::(anonymous namespace)::MergeRight &) (this=0x928170, position=..., worker=...) at ./util/stream/chain.hh:77
#5 0x000000000048ddf1 in boost::_bi::list2boost::_bi::value<util::stream::ChainPosition, boost::_bi::valuelm::builder::{anonymous}::MergeRight >::operator()boost::reference_wrapper<util::stream::Thread, boost::_bi::list0>(boost::_bi::type, boost::reference_wrapperutil::stream::Thread &, boost::_bi::list0 &, int) (this=0x95bc40, f=..., a=...) at /usr/include/boost/bind/bind.hpp:313
#6 0x000000000048dccf in boost::_bi::bind_t<void, boost::reference_wrapperutil::stream::Thread, boost::_bi::list2boost::_bi::value<util::stream::ChainPosition, boost::_bi::valuelm::builder::{anonymous}::MergeRight > >::operator()(void) (this=0x95bc38)

at /usr/include/boost/bind/bind_template.hpp:20

#7 0x000000000048dc34 in boost::detail::thread_data<boost::_bi::bind_t<void, boost::reference_wrapperutil::stream::Thread, boost::_bi::list2boost::_bi::value<util::stream::ChainPosition, boost::_bi::valuelm::builder::{anonymous}::MergeRight > > >::run(void) (this=0x95bab0)

at /usr/include/boost/thread/detail/thread.hpp:61
@kpu kpu self-assigned this Sep 27, 2014
@kpu
Copy link
Owner

kpu commented Sep 27, 2014

Hmmm I've heard rumors of a problem with duplicated corpora. Is there a chance I can get a copy of the corpus? Right now it looks like classic buffer overrun but it's unclear whose fault that is.

@cffan
Copy link
Author

cffan commented Sep 29, 2014

I've email you the link for corpus. Thanks!

@kpu kpu closed this as completed in 57a1dd0 Oct 1, 2014
@kpu
Copy link
Owner

kpu commented Oct 1, 2014

Paging @emjotde probably the same segfault that @phikoehn saw.

@cffan
Copy link
Author

cffan commented Oct 11, 2014

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants