Skip to content
This repository has been archived by the owner on Feb 9, 2022. It is now read-only.

Search not detecting lvl0 selector on some pages #261

Closed
tomgenoni opened this issue Nov 17, 2017 · 3 comments
Closed

Search not detecting lvl0 selector on some pages #261

tomgenoni opened this issue Nov 17, 2017 · 3 comments

Comments

@tomgenoni
Copy link
Contributor

tomgenoni commented Nov 17, 2017

For https://thumbprint.thumbtack.com our lvl0 selector looks like:

"lvl0": {
 "selector": "//*[@data-id='header__links']//a[@data-active='true']",
 "type": "xpath",
  "default_value": "Documentation"
},

https://github.com/algolia/docsearch-configs/blob/master/configs/thumbprint.json

When I search for the page title "Using Thumbprint in Sass" — https://thumbprint.thumbtack.com/guide/creating-pages/ — the search result correctly categorizes it under "Guide".

But if I search for the pages titles of the following pages:

It categorizes them under "Documentation" instead of "Guide".

In this screenshot "https://thumbprint.thumbtack.com/guide/utility-classes/" is among the results, note that it's categorized under "Documentation"

screen shot 2017-11-17 at 10 16 07 am

I've confirmed the lvl0 xpath works on those pages so am not sure what would have caused it to fail. Maybe your crawler searched cached pages that didn't have this selector available?

@s-pace
Copy link
Contributor

s-pace commented Nov 21, 2017

Hi Tom,
Thank you for the feedback.
The crawler isn't using any cache. We are doing a total new crawling each 24h. I have checked it and it looks like this issue is solve.
This issue should be because of the time needed to have the crawling done.
Let us know if you need anything.

@tomgenoni
Copy link
Contributor Author

tomgenoni commented Nov 21, 2017

Yes, the issue appears to have resolved itself. What I meant by cache is that it could have been a problem with our site. Our Cloudfront cache (which we'll eventually disable) takes about 24 hrs to clear so it's possible the Algolia search script was seeing older versions of some of our pages.

Thank you for your help! We have a better understanding of how the search config works now.

@s-pace
Copy link
Contributor

s-pace commented Nov 22, 2017

Thank you for your the amendment. Makes sense.
Glad to help.
Closing it since we are done with it.

@s-pace s-pace closed this as completed Nov 22, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants