Fixes and improvements to getbugs() and related methods #160

Elkasitu · 2022-04-17T00:28:07Z

See individual commit messages for context

This allows users of python-bugzilla to override the default limit of 20 which can help with network overhead when pulling large amounts of data from bugzilla instances.

The new method `iterbugs()` is analogous to `getbugs()` in that it takes the same parameters and will fetch all bugs requested by the caller, however has two advantages over `getbugs()`: 1. Pagination is done automatically, meaning that `iterbugs()` returns a generator of chunks / pages so that users can simply iterate through the result instead of having to deal with pagination logic themselves. 2. Bug objects are created on demand when iterating through a chunk / page, helping save some memory which can be useful when dealing with large chunks. This commit also performs a mini-refactoring of the logic that converts bug data into a Bug object or None for reusability and clarity.

In `_getbugs()`, the bit of code that ensures that the output order of the bugs is the same as the input order of ids incorrectly checks whether idint / aliaslist are falsy instead of explicitly checking for None. If idlist / aliaslist contain falsy non-None values such as 0 or "", they will skip the check and be potentially incorrectly added to the return list. An example of where this fails would be an idlist = range(0, 20) From tests against the Red Hat bugzilla instance, 0 is not a valid bug_id thus the API will return 1-20, but since idval = 0 will skip the two checks, the end result will be of length 21 (unexpected) and the first bug will be in the return list twice.

crazyscientist · 2023-08-10T11:18:24Z

bugzilla/base.py

+        for i in range(0, len(idlist), limit):
+            yield (
+                self._to_bug(bug)
+                for bug in self._getbugs(
+                    idlist[i:i + limit],
+                    include_fields=include_fields,
+                    exclude_fields=exclude_fields,
+                    extra_fields=extra_fields,
+                    permissive=permissive,
+                    limit=limit,


I like the idea of having a generator like this in the library a lot.

If you create the chunks of idlist yourself, the use of limit becomes redundant.

You could use limit in combination with offset, but for really long lists we would pass a lot of unneeded data. May I suggest to undo the changes related to the limit argument and just go for chunked islists?

And if you don't mind, a unittest for this new method would be much appreciated. 🙂

crazyscientist · 2023-08-10T11:19:49Z

bugzilla/base.py

@@ -1131,20 +1134,52 @@ def getbug(self, objid,
            extra_fields=extra_fields)
        return Bug(self, dict=data, autorefresh=self.bug_autorefresh)


Let's use your new private method here, too.

Suggested change

return Bug(self, dict=data, autorefresh=self.bug_autorefresh)

return self._to_bug(bug_data=data, autorefresh=self.bug_autorefresh)

Yeah, this _to_bug cleanup is nice. If it was a separate patch I'd apply it

crazyscientist · 2023-08-10T11:23:42Z

bugzilla/base.py

-                           autorefresh=self.bug_autorefresh)) or None
-                for b in data]
+            permissive=permissive, limit=limit)
+        return [self._to_bug(b) for b in data]


When I look at the git history, I see that somebody added the construct (b and Bug(...)) or None deliberately. But from personal usage experience, I do not recall ever seeing a None instead of a Bug instance.

So, I would say, this simplifications makes a lot of sense 👍

@crobinso Would you agree?

I believe old bugzilla instances would return None if 'permissive=True. Maybe we don't need to maintain it anymore. Safest thing to do is keep it intact unless someone wants to dig through old bugzilla docs.

crobinso

@Elkasitu sorry for very late response. Are you still interested in this PR?

crobinso · 2024-02-14T16:27:55Z

bugzilla/base.py

@@ -1061,7 +1061,8 @@ def _supports_getbug_extra_fields(self):


    def _getbugs(self, idlist, permissive,
-            include_fields=None, exclude_fields=None, extra_fields=None):
+            include_fields=None, exclude_fields=None, extra_fields=None,
+            limit=None):


It seems like with bugzilla.redhat.com, getbugs does not have a limit anymore (query() still does that). So i think this was a transient change?

crobinso · 2024-02-14T16:29:49Z

bugzilla/base.py

-                           autorefresh=self.bug_autorefresh)) or None
-                for b in data]
+            permissive=permissive, limit=limit)
+        return [self._to_bug(b) for b in data]


I believe old bugzilla instances would return None if 'permissive=True. Maybe we don't need to maintain it anymore. Safest thing to do is keep it intact unless someone wants to dig through old bugzilla docs.

crobinso · 2024-02-14T16:31:36Z

bugzilla/base.py

@@ -1131,20 +1134,52 @@ def getbug(self, objid,
            extra_fields=extra_fields)
        return Bug(self, dict=data, autorefresh=self.bug_autorefresh)


Yeah, this _to_bug cleanup is nice. If it was a separate patch I'd apply it

Elkasitu · 2024-02-15T10:34:31Z

@Elkasitu sorry for very late response. Are you still interested in this PR?

Yes, I'll take another look in the coming days and address the review feedback

crobinso · 2024-09-23T17:35:36Z

This PR has been stalled for a while, so I'm closing it. @Elkasitu if you are still interested, please feel free to resubmit after addressing the review comments. Thanks!

Elkasitu added 3 commits April 17, 2022 00:51

base: Allow setting limit for getbugs()

2beb93f

This allows users of python-bugzilla to override the default limit of 20 which can help with network overhead when pulling large amounts of data from bugzilla instances.

crazyscientist requested changes Aug 10, 2023

View reviewed changes

crobinso requested changes Feb 14, 2024

View reviewed changes

crobinso closed this Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixes and improvements to getbugs() and related methods #160

Fixes and improvements to getbugs() and related methods #160

Uh oh!

Elkasitu commented Apr 17, 2022

Uh oh!

crazyscientist Aug 10, 2023

Uh oh!

crazyscientist Aug 10, 2023

Uh oh!

crobinso Feb 14, 2024

Uh oh!

crazyscientist Aug 10, 2023

Uh oh!

crobinso Feb 14, 2024

Uh oh!

crobinso left a comment

Uh oh!

crobinso Feb 14, 2024

Uh oh!

crobinso Feb 14, 2024

Uh oh!

crobinso Feb 14, 2024

Uh oh!

Elkasitu commented Feb 15, 2024

Uh oh!

crobinso commented Sep 23, 2024

Uh oh!

Uh oh!

		@@ -1131,20 +1134,52 @@ def getbug(self, objid,
		extra_fields=extra_fields)
		return Bug(self, dict=data, autorefresh=self.bug_autorefresh)

	return Bug(self, dict=data, autorefresh=self.bug_autorefresh)
	return self._to_bug(bug_data=data, autorefresh=self.bug_autorefresh)

Fixes and improvements to getbugs() and related methods #160

Fixes and improvements to getbugs() and related methods #160

Uh oh!

Conversation

Elkasitu commented Apr 17, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

crobinso left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Elkasitu commented Feb 15, 2024

Uh oh!

crobinso commented Sep 23, 2024

Uh oh!

Uh oh!