Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Divide Input option is skipping files #67

Closed
senderle opened this issue Nov 30, 2017 · 5 comments
Closed

Divide Input option is skipping files #67

senderle opened this issue Nov 30, 2017 · 5 comments
Assignees

Comments

@senderle
Copy link
Owner

senderle commented Nov 30, 2017

As reported in #65. @shawngraham writes:

I went and tried it again, armed with my new knowledge of how it works. In the results, when I opened the metadata.csv, a number of my documents were no longer present; that is to say, no results recorded for them. I had n set for 1000, so I thought perhaps the missing ones were smaller and somehow got folded into the previous 1000-chunk, but no, the missing ones should have been split into three or four chunks at least. So I'm not sure what's going on there... I can't seem to see the commonality between the documents that get dropped.

@senderle
Copy link
Owner Author

@shawngraham, it also occurs to me that odd characters in filenames can sometimes cause problematic behavior. If possible, could you upload just the topic-metadata.csv file here?

@senderle
Copy link
Owner Author

@shawngraham, are you still seeing this problem? It would be great to be able to reproduce it so I can understand how to fix it. No worries if not, but if you have any thoughts, let me know.

@shawngraham
Copy link

Hi, I'm sorry - this fell off the radar because I got swamped with other things. I'll try to return to it later this month once the smoke settles!

@senderle
Copy link
Owner Author

No problem! And thanks!

@senderle
Copy link
Owner Author

senderle commented Mar 1, 2021

Unfortunately I think I have to close this as unable-to-reproduce. Will reopen if it comes up again.

@senderle senderle closed this as completed Mar 1, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants