-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use geographic binary relation info/context #5
Comments
How many combinations of "X, etc Y" are there?: I'm guessing a dictionary-based approach would be fairly effective. Presumably the phrases follow the usual rank-size distribution. We don't really need a parse since we've got the commas, and we can automatically generate the candidate phrases with a simple regex search (maybe with some simple markup first. But not a full parse). Alternatively, try to generate a conditional random field model or something similar to catch these. But I'd try getting the candidate phrases first and see how many we've got. |
Maryam here at UTD has already tried the CRF approach to get at subnational Spoke with her and Andy about this today. He is making other modifications Further issue is that we need more labelled training data. Talking with C. On Tue, Jun 28, 2016 at 4:39 PM, Philip Schrodt notifications@github.com
Patrick T. Brandt |
Lots of sentences with geographic information are structured like "X, a town 30 km south of Y", or "X, a neighborhood in Y". In both cases, we want to:
Neither MITIE's binary relation detection nor Freebase have this. We could use parse info, but that would be tricky and require lots of labeled examples. Thoughts?
The text was updated successfully, but these errors were encountered: