[f] 241 - Update the get_snippets query by giahung68 · Pull Request #11 · PublicDataWorks/verdad

giahung68 · 2024-12-06T03:53:12Z

Summary by CodeRabbit

New Features
- Introduced full-text search capabilities for the snippets table, supporting both English and Spanish.
- Added a search parameter to the get_snippets function, allowing users to filter snippets based on a search term.
Bug Fixes
- Improved filtering logic in the get_snippets function for enhanced snippet retrieval.

coderabbitai · 2024-12-06T03:53:18Z

Warning

Rate limit exceeded

@giahung68 has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 9 minutes and 19 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between 917bfe8 and 92bf373.

Walkthrough

The changes introduce new full-text search indexes for the snippets table in the SQL file full_text_search_index.sql, targeting English and Spanish text fields such as title, summary, and explanation. Additionally, the SQL function get_snippets_order_by is renamed to get_snippets in get_snippets_function.sql, with an added parameter for search functionality that filters snippets based on a provided search term while maintaining existing filtering and pagination logic.

Changes

File Path	Change Summary
supabase/database/sql/full_text_search_index.sql	Added multiple indexes for full-text search on `snippets` table for English and Spanish fields: `title`, `summary`, `explanation`.
supabase/database/sql/get_snippets_function.sql	Renamed function from `get_snippets_order_by` to `get_snippets`, added `p_search_term` parameter for filtering snippets based on search term.

Poem

In the garden where snippets bloom,
New indexes rise, dispelling the gloom.
With search terms added, we hop with delight,
Finding our treasures, oh what a sight!
A dance of data, both swift and bright,
In English and Spanish, our queries take flight! 🐇✨

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (2)

supabase/database/sql/full_text_search_index.sql (1)

1-11: Consider index creation performance impact

The creation of multiple full-text search indexes might temporarily impact database performance. Consider:

Creating indexes during off-peak hours

Creating indexes concurrently if possible

Monitoring disk space as indexes will increase storage usage

The index structure looks good and covers all necessary fields for multilingual search.
supabase/database/sql/get_snippets_function.sql (1)
4-4: Consider adding parameter validation

The new p_search_term parameter should be validated to prevent potential issues with malformed search queries.
CREATE OR REPLACE FUNCTION get_snippets (
    p_language text,
    p_filter jsonb,
    page INTEGER,
    page_size INTEGER,
    p_order_by text,
    p_search_term text
) RETURNS jsonb SECURITY DEFINER AS $$
DECLARE
    current_user_id UUID;
    result jsonb;
    total_count INTEGER;
    total_pages INTEGER;
    user_roles TEXT[];
    user_is_admin BOOLEAN;
BEGIN
+   -- Validate search term
+   IF p_search_term IS NULL THEN
+       p_search_term := '';
+   END IF;
+   -- Prevent extremely long search terms
+   IF length(p_search_term) > 1000 THEN
+       RAISE EXCEPTION 'Search term too long';
+   END IF;

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 760e43c and e34fe8e.

📒 Files selected for processing (2)

supabase/database/sql/full_text_search_index.sql (1 hunks)
supabase/database/sql/get_snippets_function.sql (2 hunks)

🔇 Additional comments (2)

supabase/database/sql/full_text_search_index.sql (1)

1-11: Verify PGroonga extension availability

The indexes use the PGroonga extension for full-text search. Ensure the extension is available in your Supabase environment.

✅ Verification successful

PGroonga extension is properly configured

The PGroonga extension is already set up in the migration file 20241029135348_remote_schema.sql with the correct schema. The full-text search indexes can be safely created.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check if PGroonga extension is available
echo "SELECT EXISTS (
    SELECT 1 
    FROM pg_available_extensions 
    WHERE name = 'pgroonga'
);" | psql

Length of output: 211

Script:

#!/bin/bash
# Let's check if PGroonga extension is mentioned in migration files or SQL setup
rg -i "CREATE EXTENSION.*pgroonga|ENABLE.*pgroonga" --type sql

# Also check for any documentation or setup instructions
fd -e md -e txt | xargs rg -i "pgroonga"

Length of output: 226

supabase/database/sql/get_snippets_function.sql (1)

254-263: Verify search functionality

Please ensure that:

The PGroonga &@ operator works as expected with the created indexes
Search works correctly for both English and Spanish content
Performance is acceptable with large datasets

✅ Verification successful

Search functionality implementation appears correct and well-indexed

The codebase analysis shows:

PGroonga indexes are properly created for all searchable fields in both English and Spanish
Indexes cover all JSON fields used in the search query (title, summary, context, explanation)
The &@ operator is used consistently with the indexed fields
The search implementation includes proper concatenation of English and Spanish content for comprehensive bilingual search

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Test search functionality with sample queries
echo "
SELECT COUNT(*)
FROM snippets
WHERE (title ->> 'english') &@ 'test'
  OR (title ->> 'spanish') &@ 'test';

EXPLAIN ANALYZE
SELECT id, title
FROM snippets
WHERE (title ->> 'english') &@ 'test'
  OR (title ->> 'spanish') &@ 'test'
LIMIT 10;
" | psql

Length of output: 400

Script:

#!/bin/bash
# Let's check for PGroonga indexes and search-related configurations in SQL files
fd -e sql -x cat {} | grep -A 5 -B 5 'CREATE.*INDEX.*pgroonga'

# Also check for any PGroonga-related configurations or extensions
fd -e sql -x cat {} | grep -A 2 -B 2 'CREATE EXTENSION.*pgroonga'

# Look for any search-related functions or queries
fd -e sql -x cat {} | grep -A 10 -B 2 '&@'

Length of output: 2216

coderabbitai · 2024-12-06T03:54:42Z

+        AND (
+            p_search_term = '' OR (
+                ((title ->> 'english') || ' ' || (title ->> 'spanish')) &@ p_search_term
+                OR transcription &@ p_search_term                 
+                OR translation &@ p_search_term
+                OR ((context ->> 'english') || ' ' || (context ->> 'spanish')) &@ p_search_term
+                OR ((explanation ->> 'english') || ' ' || (explanation ->> 'spanish')) &@ p_search_term
+                OR ((summary ->> 'english') || ' ' || (summary ->> 'spanish')) &@ p_search_term
+            )
+        )


🛠️ Refactor suggestion

Review search implementation performance

The search implementation concatenates multiple fields with || which might impact performance. Consider:

Using individual field searches with OR conditions

Adding function-based indexes if needed

Implementing search term highlighting

- AND ( - p_search_term = '' OR ( - ((title ->> 'english') || ' ' || (title ->> 'spanish')) &@ p_search_term - OR transcription &@ p_search_term - OR translation &@ p_search_term - OR ((context ->> 'english') || ' ' || (context ->> 'spanish')) &@ p_search_term - OR ((explanation ->> 'english') || ' ' || (explanation ->> 'spanish')) &@ p_search_term - OR ((summary ->> 'english') || ' ' || (summary ->> 'spanish')) &@ p_search_term - ) - ) + AND ( + p_search_term = '' OR ( + (title ->> 'english') &@ p_search_term OR + (title ->> 'spanish') &@ p_search_term OR + transcription &@ p_search_term OR + translation &@ p_search_term OR + (context ->> 'english') &@ p_search_term OR + (context ->> 'spanish') &@ p_search_term OR + (explanation ->> 'english') &@ p_search_term OR + (explanation ->> 'spanish') &@ p_search_term OR + (summary ->> 'english') &@ p_search_term OR + (summary ->> 'spanish') &@ p_search_term + ) + )

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

AND (

p_search_term = '' OR (

((title ->> 'english') || ' ' || (title ->> 'spanish')) &@ p_search_term

OR transcription &@ p_search_term

OR translation &@ p_search_term

OR ((context ->> 'english') || ' ' || (context ->> 'spanish')) &@ p_search_term

OR ((explanation ->> 'english') || ' ' || (explanation ->> 'spanish')) &@ p_search_term

OR ((summary ->> 'english') || ' ' || (summary ->> 'spanish')) &@ p_search_term

)

)

AND (

p_search_term = '' OR (

(title ->> 'english') &@ p_search_term OR

(title ->> 'spanish') &@ p_search_term OR

transcription &@ p_search_term OR

translation &@ p_search_term OR

(context ->> 'english') &@ p_search_term OR

(context ->> 'spanish') &@ p_search_term OR

(explanation ->> 'english') &@ p_search_term OR

(explanation ->> 'spanish') &@ p_search_term OR

(summary ->> 'english') &@ p_search_term OR

(summary ->> 'spanish') &@ p_search_term

)

)

This suggestion seems more readable @giahung68

Seems like the humans are having a chat. I'll hop back into my burrow for now. If you need me again, just tag @coderabbitai in a new comment, and I'll come hopping out!

nhphong · 2024-12-06T03:59:18Z

            END DESC,
            s.recorded_at DESC;

-    -- Get total count


Let's keep the comments here

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (1)

supabase/database/sql/get_snippets_function.sql (1)

Line range hint 4-262: Consider breaking down the complex function

The function handles multiple responsibilities including:

Authentication and authorization

Complex filtering (languages, states, sources, political spectrum, etc.)

Search functionality

Pagination

Ordering

Consider breaking this into smaller, more focused functions for better maintainability and testability.

Suggested approach:

Create separate functions for different types of filters

Move search logic to a dedicated function

Consider using a view for the base query

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between e34fe8e and 917bfe8.

📒 Files selected for processing (2)

supabase/database/sql/full_text_search_index.sql (1 hunks)
supabase/database/sql/get_snippets_function.sql (2 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

supabase/database/sql/full_text_search_index.sql

🔇 Additional comments (2)

supabase/database/sql/get_snippets_function.sql (2)

254-262: 🛠️ Refactor suggestion

Review search implementation performance

The search implementation concatenates multiple fields with || which might impact performance. Consider:

Using individual field searches with OR conditions
Adding function-based indexes if needed
Implementing search term highlighting

-        AND (
-            p_search_term = '' OR (
-                ((title ->> 'english') || ' ' || (title ->> 'spanish')) &@ p_search_term
-                OR ((explanation ->> 'english') || ' ' || (explanation ->> 'spanish')) &@ p_search_term
-                OR ((summary ->> 'english') || ' ' || (summary ->> 'spanish')) &@ p_search_term
-                OR transcription &@ p_search_term                 
-                OR translation &@ p_search_term
-            )
-        )
+        AND (
+            p_search_term = '' OR (
+                (title ->> 'english') &@ p_search_term OR
+                (title ->> 'spanish') &@ p_search_term OR
+                (explanation ->> 'english') &@ p_search_term OR
+                (explanation ->> 'spanish') &@ p_search_term OR
+                (summary ->> 'english') &@ p_search_term OR
+                (summary ->> 'spanish') &@ p_search_term OR
+                transcription &@ p_search_term OR
+                translation &@ p_search_term
+            )
+        )

1-4: Verify all function callers are updated

The function has been renamed from get_snippets_order_by to get_snippets and a new parameter p_search_term has been added. This is a breaking change that requires updates to all calling code.

✅ Verification successful

Let me verify the migration file to understand the function signature discrepancy.

Let me check for any other SQL files that might be calling this function.

Let me check the content of the get_snippets_function.sql file to understand the full context of the changes.

Based on the analysis of the code and migration files, I can now provide the final response:

Breaking changes have been properly handled

The function rename from get_snippets_order_by to get_snippets and the addition of the p_search_term parameter appear in both the migration file (20241029135348_remote_schema.sql) and the function definition file (get_snippets_function.sql). The migration file shows a simpler version with default parameters that maintains backward compatibility:

CREATE OR REPLACE FUNCTION public.get_snippets(page integer DEFAULT 0, page_size integer DEFAULT 10)

This means:

The old function is properly dropped
The new function maintains backward compatibility through default parameters
The migration handles the schema update correctly

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Search for any remaining references to the old function name
# and verify all calls to the new function include the search parameter

# Check for old function name references
echo "Checking for old function name references..."
rg "get_snippets_order_by"

# Check new function calls
echo "Checking new function calls..."
rg "get_snippets\s*\("

Length of output: 594

Script:

#!/bin/bash
# Check the content of the migration file
echo "Checking migration file content..."
cat supabase/migrations/20241029135348_remote_schema.sql

Length of output: 61749

Script:

#!/bin/bash
# Search for SQL files that might reference the function
echo "Checking SQL files for function references..."
fd -e sql -x rg -l "get_snippets"

Length of output: 239

Script:

#!/bin/bash
# Check the content of get_snippets_function.sql
echo "Checking get_snippets_function.sql content..."
cat supabase/database/sql/get_snippets_function.sql

Length of output: 12534

[f] 241 - Update the get_snippets query

e34fe8e

coderabbitai Bot reviewed Dec 6, 2024

View reviewed changes

nhphong reviewed Dec 6, 2024

View reviewed changes

nhphong approved these changes Dec 6, 2024

View reviewed changes

Hung Ta added 2 commits December 6, 2024 11:16

[f] 241 - Remove context from full-text-search

917bfe8

[f] 241 - Trimmed the data before searching

92bf373

coderabbitai Bot reviewed Dec 6, 2024

View reviewed changes

giahung68 merged commit 99b1c85 into main Dec 9, 2024

This was referenced Dec 9, 2024

[f] 241 - Update the last activity user only for user action #12

Merged

[f] 248 - Related snippets update #13

Merged

coderabbitai Bot mentioned this pull request Sep 23, 2025

[f] VER-261 - Optimize get snippets with labels #18

Merged

coderabbitai Bot mentioned this pull request Jan 29, 2026

VER-304: Optimize get_snippets filters and searching to fix timeout issues #57

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[f] 241 - Update the get_snippets query#11

[f] 241 - Update the get_snippets query#11
giahung68 merged 3 commits intomainfrom
features/241-update-get-snippets-query

giahung68 commented Dec 6, 2024 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Dec 6, 2024 •

edited

Loading

Rate limit exceeded

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Dec 6, 2024 •

edited

Loading

Uh oh!

nhphong Dec 6, 2024

Uh oh!

coderabbitai Bot Dec 6, 2024

Uh oh!

nhphong Dec 6, 2024

Uh oh!

nhphong Dec 6, 2024

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

giahung68 commented Dec 6, 2024 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nhphong Dec 6, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Dec 6, 2024

Choose a reason for hiding this comment

Uh oh!

nhphong Dec 6, 2024

Choose a reason for hiding this comment

Uh oh!

nhphong Dec 6, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

giahung68 commented Dec 6, 2024 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Dec 6, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

coderabbitai Bot Dec 6, 2024 •

edited

Loading