fix: paginate reading log query when no shelf is selected by AhmedxSaid · Pull Request #12499 · internetarchive/openlibrary

AhmedxSaid · 2026-04-30T17:11:21Z

Problem

When a user views their reading log without selecting a specific shelf (the "All" view), get_sorted_reading_log_books() in bookshelves.py overwrites the query with one that has no LIMIT or OFFSET:

if not bookshelf_id:
    query = "SELECT * from bookshelves_books WHERE username=$username"
    # XXX Removing limit, offset, etc from data looks like a bug
    # unrelated / not fixing in this PR.
    query_params = {"username": username}

This fetches every book the user has ever logged into memory at once, then passes all those keys to a Solr get_many call. For power users with thousands of books this causes significant memory pressure and slow/failed page loads — and gets worse as their library grows.

The total_results calculation also had a related bug: shelf_totals.get(bookshelf_id, 0) returns 0 when bookshelf_id is falsy, so pagination controls showed incorrect totals.

Fix

Add LIMIT $limit OFFSET $offset to the unbounded query, preserving the existing limit/offset from query_params
Fix total_results to sum across all shelves when no specific shelf is selected

Test plan

View reading log "All" tab as a user with many books — verify only one page loads
Paginate through the reading log — verify correct pages are returned
View a specific shelf — verify existing behavior unchanged
Verify total book count shown in pagination is correct for the "All" view

🤖 Generated with Claude Code

When bookshelf_id is falsy (all-books view), the query fetched every book for the user with no LIMIT or OFFSET, causing unbounded memory and slow Solr get_many calls for users with large reading logs. Also fixes total_results returning 0 in this case — it now sums across all shelves instead of looking up a None key. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

mekarpeles · 2026-04-30T17:13:45Z

Copilot

Pull request overview

Fixes performance and pagination correctness for the “All” (no-shelf-selected) reading log view by ensuring the DB query remains paginated and totals reflect all shelves.

Changes:

Add ORDER BY created … LIMIT … OFFSET … to the all-shelves bookshelves_books query to prevent unbounded reads.
Update total_results logic to compute totals across shelves when no specific shelf is selected.

Copilot · 2026-04-30T17:15:46Z

                cls.add_storage_items_for_deletes(reading_log_keys, solr_docs)

-            total_results = shelf_totals.get(bookshelf_id, 0)
+            total_results = shelf_totals.get(bookshelf_id) or sum(shelf_totals.values())


total_results = shelf_totals.get(bookshelf_id) or sum(...) will return the sum of all shelves whenever the selected shelf count is 0 or missing from shelf_totals (e.g., a user has no books on that shelf). That makes pagination totals incorrect for empty shelves. Consider branching on bookshelf_id (all-shelves vs specific shelf) and using a default of 0 for the specific-shelf case instead of relying on truthiness.

Suggested change

total_results = shelf_totals.get(bookshelf_id) or sum(shelf_totals.values())

if not bookshelf_id:

total_results = sum(shelf_totals.values())

else:

total_results = shelf_totals.get(bookshelf_id, 0)

Copilot · 2026-04-30T17:15:47Z

-                # unrelated / not fixing in this PR.
-                query_params = {"username": username}
+                query = (
+                    "SELECT * from bookshelves_books WHERE username=$username "


In the all-shelves branch this query uses SELECT *, but downstream only relies on work_id, created, and edition_id. Selecting only the needed columns would reduce DB I/O and memory use, and keeps this query consistent with the shelf-specific query above.

Suggested change

"SELECT * from bookshelves_books WHERE username=$username "

"SELECT work_id, created, edition_id from bookshelves_books WHERE username=$username "

mekarpeles requested a review from Copilot April 30, 2026 17:11

Copilot started reviewing on behalf of mekarpeles April 30, 2026 17:12 View session

mekarpeles added the Needs: Submitter Input Waiting on input from the creator of the issue/pr [managed] label Apr 30, 2026

Copilot AI reviewed Apr 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: paginate reading log query when no shelf is selected#12499

fix: paginate reading log query when no shelf is selected#12499
AhmedxSaid wants to merge 1 commit intointernetarchive:masterfrom
AhmedxSaid:fix/reading-log-unbounded-query

AhmedxSaid commented Apr 30, 2026

Uh oh!

mekarpeles commented Apr 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 30, 2026

Uh oh!

Copilot AI Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	"SELECT * from bookshelves_books WHERE username=$username "
	"SELECT work_id, created, edition_id from bookshelves_books WHERE username=$username "

Uh oh!

Conversation

AhmedxSaid commented Apr 30, 2026

Problem

Fix

Test plan

Uh oh!

mekarpeles commented Apr 30, 2026

Possible improvements for this PR

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants