Add duty station search improvements: 167501188 #2760

lynzt · 2019-10-02T13:34:45Z

Description

Make searching for a duty station more flexible... allow "fuzzy" matching.

Use trigram matching when letting a SM or an Office user search for duty stations. Calculate the similarity between the "search query" and the duty stations and return the top 7 results.

Reviewer Notes

Alternate spellings/abbrs we want to catch are in the pivotal story linked as a spreadsheet.

Setup

Add any steps or code to run in this section to help others prepare to run your code:

make server_test
make server_run
make client_run

Log into SM or Office app and do a duty station search using "alternate" duty station names... or try fuzzy searching for things:

ft bragg => fort bragg
joint base => recs for JB *
etc

Code Review Verification Steps

The requirements listed in
Querying the Database Safely
have been satisfied.
Any new migrations/schema changes:
- Follow our guidelines for zero-downtime deploys (see Zero-Downtime Deploys)
Tested in the Experimental environment (for changes to containers, app startup, or connection to data stores)
User facing changes have been reviewed by design.
Request review from a member of a different team.
Have the Pivotal acceptance criteria been met for this change?

References

Pivotal story for this change
this article explains more about the approach used.

Screenshots

codecov · 2019-10-02T18:04:07Z

Codecov Report

Merging #2760 into master will increase coverage by 0.1%.
The diff coverage is 100%.

@@           Coverage Diff            @@
##           master   #2760     +/-   ##
========================================
+ Coverage    57.6%   57.6%   +0.1%     
========================================
  Files         278     276      -2     
  Lines       12567   12550     -17     
========================================
- Hits         7233    7226      -7     
+ Misses       4586    4578      -8     
+ Partials      748     746      -2

Impacted Files	Coverage Δ
pkg/handlers/internalapi/transportation_offices.go	`100% <100%> (ø)`	⬆️
pkg/handlers/internalapi/duty_stations.go	`87% <100%> (-0.5%)`	⬇️
pkg/models/duty_station.go	`50.7% <100%> (+15.7%)`	⬆️
pkg/handlers/adminapi/api.go	`0% <0%> (ø)`	⬆️
pkg/services/office_user/office_user_fetcher.go
pkg/services/office_user/office_user_updater.go
...g/services/office_user/office_user_list_fetcher.go
pkg/handlers/adminapi/admin_users.go
pkg/services/office_user/office_user_creator.go
pkg/services/admin_user/admin_user_list_fetcher.go
... and 5 more

lynzt · 2019-10-02T19:30:18Z

There's another pivotal story that deals with the fact we're doing an Eager("address") when looking up duty stations for the user to choose from...

chrisgilmerproj · 2019-10-02T20:45:25Z

migrations/20190924175530_add_trigram_matching_extension.up.sql

@@ -0,0 +1,3 @@
+CREATE EXTENSION pg_trgm;
+
+CREATE INDEX duty_stations_name_trgm_idx ON duty_stations USING gin(name gin_trgm_ops);


chrisgilmerproj

Awesome work! Can we test this on experimental with the load testing framework? I can show you how.

Ryan-Koch · 2019-10-03T18:26:09Z

pkg/models/duty_station.go

+with names as (
+(select id as duty_station_id, name, similarity(name, $1) as sim
+from duty_stations
+where similarity(name, $1) > 0.03


We might be able to get away with using the % operator with these similarity calls, if we set the pg_trgm.similarity_threshold to the 0.03 value we're using (default is 0.3, so definitely would need to override in that case). Since the operator returns a true/false based on the threshold as opposed to the value, I suspect it might run a bit faster (I wouldn't hold the PR up over this, just something to think about).

I didn't use % in case we want to tweek the similarity threashold and figured it clearer what was happening if things were in the query... and setting the threashold didn't add much/any speed gains since we're not dealing w/ a ton of rows.

However, if/when we add additional duty stations - we should def. keep this in mind for query optimization.

Ryan-Koch

This looks really great! I left a minor comment about the query, but otherwise this seems to do what it's supposed to. I think after the load testing that was mentioned in another comment it's good to go. 🚀

lynzt · 2019-10-04T15:00:49Z

@chrisgilmerproj results from load testing.

chrisgilmerproj

🚀 - Awesome work. Thanks for doing the load testing check on this endpoint because its caused us so much trouble in the past.

Also, don't forget to undo the changes to experimental :)

lynzt · 2019-10-04T16:23:52Z

@chrisgilmerproj do you mean pushing to experimental again to reset the auth variable?
I've already had @rdhariwal add the rate-limiting back in the other repo...

chrisgilmerproj · 2019-10-04T17:18:56Z

@chrisgilmerproj do you mean pushing to experimental again to reset the auth variable?
I've already had @rdhariwal add the rate-limiting back in the other repo...

Yep! You'll want to turn off that variable. And then remove the stuff from this PR.

lynzt added the ttv label Oct 2, 2019

lynzt force-pushed the lt-add-search-duty-station-improvements-167501188 branch from c6302cb to 143a816 Compare October 2, 2019 17:35

lynzt marked this pull request as ready for review October 2, 2019 20:01

lynzt requested review from chrisgilmerproj, reggieriser, jim, Ryan-Koch, kahlouie and blvrd October 2, 2019 20:01

chrisgilmerproj reviewed Oct 2, 2019

View reviewed changes

chrisgilmerproj suggested changes Oct 2, 2019

View reviewed changes

Ryan-Koch reviewed Oct 3, 2019

View reviewed changes

Ryan-Koch approved these changes Oct 3, 2019

View reviewed changes

chrisgilmerproj approved these changes Oct 4, 2019

View reviewed changes

lynzt added 11 commits October 4, 2019 20:06

update tests for fuzzy searching

8611810

add duty_station_names table and data

94b8cdf

modify how to search for duty stations - interm update

123bf78

add skipped file

dfb83bd

add alternate names, and tests for duty staton searching

b27f835

show list of duty stations to user

216a036

dont use empty eager (only need address)

106fddb

optimize query - trgm index and limit

391ef16

fix broken tests, limit duty stations returned to 7

5794aa0

updates to make swagger happy for load testing

f51771b

deploy to experimental - load testing

d2b998e

lynzt added 3 commits October 4, 2019 20:06

revert DEVLOCAL_AUTH in experimental

368a005

revert deploy to experimental

3330b64

add test for 0 rows returned - duty station search

cde02be

lynzt force-pushed the lt-add-search-duty-station-improvements-167501188 branch from a2ac006 to cde02be Compare October 4, 2019 20:49

lynzt merged commit 47442ea into master Oct 4, 2019

lynzt deleted the lt-add-search-duty-station-improvements-167501188 branch December 18, 2019 19:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add duty station search improvements: 167501188 #2760

Add duty station search improvements: 167501188 #2760

lynzt commented Oct 2, 2019 •

edited

codecov bot commented Oct 2, 2019 •

edited

lynzt commented Oct 2, 2019

chrisgilmerproj Oct 2, 2019

chrisgilmerproj left a comment

Ryan-Koch Oct 3, 2019

lynzt Oct 4, 2019

Ryan-Koch left a comment

lynzt commented Oct 4, 2019

chrisgilmerproj left a comment

lynzt commented Oct 4, 2019

chrisgilmerproj commented Oct 4, 2019

		@@ -0,0 +1,3 @@
		CREATE EXTENSION pg_trgm;

		CREATE INDEX duty_stations_name_trgm_idx ON duty_stations USING gin(name gin_trgm_ops);

Add duty station search improvements: 167501188 #2760

Add duty station search improvements: 167501188 #2760

Conversation

lynzt commented Oct 2, 2019 • edited

Description

Reviewer Notes

Setup

Code Review Verification Steps

References

Screenshots

codecov bot commented Oct 2, 2019 • edited

Codecov Report

lynzt commented Oct 2, 2019

chrisgilmerproj Oct 2, 2019

Choose a reason for hiding this comment

chrisgilmerproj left a comment

Choose a reason for hiding this comment

Ryan-Koch Oct 3, 2019

Choose a reason for hiding this comment

lynzt Oct 4, 2019

Choose a reason for hiding this comment

Ryan-Koch left a comment

Choose a reason for hiding this comment

lynzt commented Oct 4, 2019

chrisgilmerproj left a comment

Choose a reason for hiding this comment

lynzt commented Oct 4, 2019

chrisgilmerproj commented Oct 4, 2019

lynzt commented Oct 2, 2019 •

edited

codecov bot commented Oct 2, 2019 •

edited