Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

avoid indexing "email" field type in Solr #759

Closed
pdurbin opened this issue Jul 22, 2014 · 2 comments
Closed

avoid indexing "email" field type in Solr #759

pdurbin opened this issue Jul 22, 2014 · 2 comments
Assignees
Labels
Type: Bug a defect

Comments

@pdurbin
Copy link
Member

pdurbin commented Jul 22, 2014

The email field type was added in #494 but we want to special case indexing of it.

@pdurbin pdurbin changed the title Special case indexing of field type "email" avoid indexing "email" field type in Solr Jul 22, 2014
@pdurbin pdurbin added this to the Beta 3 - Dataverse 4.0 milestone Jul 22, 2014
pdurbin added a commit that referenced this issue Jul 22, 2014
@pdurbin
Copy link
Member Author

pdurbin commented Jul 22, 2014

In general our mantra has been "make all dataset fields searchable" but we are deciding against this for the "email" field type since we don't want to risk exposing email addresses.

Now "email" field types such as distributorContact are no longer indexed into Solr. Passing to QA. Please note that you can use scripts/search/query to query Solr directly. Also, in order remove email addresses for existing datasets from Solr, you'll need to run "index all".

I didn't actually remove from schema.xml because we'll be updating it soon enough for #754 . Also, it's not clear from #747 if "the list of potential collaborators should be searchable" means a Solr or database search.

@kcondon
Copy link
Contributor

kcondon commented Jul 23, 2014

Confirmed, both author contact email and distributor contact are not longer searchable after index all.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Type: Bug a defect
Projects
None yet
Development

No branches or pull requests

2 participants