🐛 Source Linkedin Ads: fix changing next_page_token stopping criteria #34166

FVidalCarneiro · 2024-01-11T15:22:32Z

What

This pull request aims in changing the criteria used by the airbyte Linkedin Ads source connector for when to stop requesting the Linkedin API for more pages. Currently, the criteria is the following - stop requesting a new page when page size is smaller than maximum allowed records per page. We propose to change this criteria to - stop requesting a new page when page size is equal to zero.

Fixed #34164

How

This is done by modifying the next_page_token method in both the LinkedinAdsStream class and the LinkedInAdsAnalyticsStream.

🚨 User Impact 🚨

The source connector will behave as previously expected, no breaking changes. Given that an extra API call is made per account, this might slightly impact source connector performance, but this difference is negligible. The version change should therefore only be of type patch, rendering the connector version therefore equal to 0.6.5.

Pre-merge Actions

Expand the relevant checklist and delete the others.

Updating a connector

Community member or Airbyter

Grant edit access to maintainers (instructions)
Unit & integration tests added

Airbyter

If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.

Create a non-forked branch based on this PR and test the below items on it
Build is successful
If new credentials are required for use in CI, add them to GSM. Instructions.

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1206655469791930

vercel · 2024-01-11T15:22:47Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
airbyte-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Apr 23, 2024 7:45am

github-actions · 2024-01-11T15:22:55Z

Before Merging a Connector Pull Request

Wow! What a great pull request you have here! 🎉

To merge this PR, ensure the following has been done/considered for each connector added or updated:

PR name follows PR naming conventions
Breaking changes are considered. If a Breaking Change is being introduced, ensure an Airbyte engineer has created a Breaking Change Plan.
Connector version has been incremented in the Dockerfile and metadata.yaml according to our Semantic Versioning for Connectors guidelines
You've updated the connector's metadata.yaml file any other relevant changes, including a breakingChanges entry for major version bumps. See metadata.yaml docs
Secrets in the connector's spec are annotated with airbyte_secret
All documentation files are up to date. (README.md, bootstrap.md, docs.md, etc...)
Changelog updated in docs/integrations/<source or destination>/<name>.md with an entry for the new version. See changelog example
Migration guide updated in docs/integrations/<source or destination>/<name>-migrations.md with an entry for the new version, if the version is a breaking change. See migration guide example
If set, you've ensured the icon is present in the platform-internal repo. (Docs)

If the checklist is complete, but the CI check is failing,

Check for hidden checklists in your PR description
Toggle the github label checklist-action-run on/off to re-run the checklist CI.

CLAassistant · 2024-01-11T15:26:56Z

All committers have signed the CLA.

FVidalCarneiro · 2024-01-11T15:49:31Z

This PR solves related issue #34164

marcosmarxm

Thanks @FVidalCarneiro the issue you opened made more clear about the proposition you're suggesting. The change will impact in the connector behavior to probably always retrieve the N+1 page (the page doesn't exist) in most cases. What is the API response to that case? Can you confirm it won't return a 404 error?

Meanwhile I requested to the connector team take a look into your change

FVidalCarneiro · 2024-01-19T14:43:33Z

Thank you @marcosmarxm for your review.

Indeed, with the suggested implementation, we will always obtain the N+1 page, where N is the last page of the object. As you can see in the screenshot below, the API responds with an empty page and status code 200:

This means the current suggested implementation works.

If we wanted to avoid this "useless" call (but required when there is a broken creative, please read initial description of this PR), there is another solution which would be to add the condition parsed_response.get("paging")["start"] + self.records_limit > parsed_response.get("paging")["total"] to the end pagination criteria making the full condition equal to:
if len(parsed_response.get("elements")) < self.records_limit and (parsed_response.get("paging")["start"] + self.records_limit > parsed_response.get("paging")["total"]):. In this case we would maintain the original condition, but refine it by adding a condition which confirms that N is indeed the last page as the next page would request more elements than there is total. We could likewise only apply the condition parsed_response.get("paging")["start"] + self.records_limit > parsed_response.get("paging")["total"] which would maintain the current connector behavior and avoid the issue we identified.

Even though the current suggested implementation would work, this last suggestion would also protect the connector against a potential future Linkedin API update, where a non existing page responds with 404 instead of 200 HTTP code.

If you (and/or the connector team) give me your approval, I can add this last suggested implementation (bearing in mind that the downside is that it requires more parsing of the response elements). Do you have a preference on one of these two solutions ?

lazebnyi

Hey @FVidalCarneiro
Thanks for your PR. Let's leave len(parsed_response.get("elements")) < self.records_limit and (parsed_response.get("paging")["start"] + self.records_limit > parsed_response.get("paging")["total"]) as condition for last page. And add comment with link to issue why we need additional check

FVidalCarneiro · 2024-02-06T13:30:44Z

Following the feedback of the engineering team available here, I have decided to complement the pagination ending condition with a condition that ensures, based on the total record count field, that there are no other pages to request. This will both:

Allow to manage broken creatives that are missing from paginated responses (page responding with 99 records instead of 100).
Avoid sending a request to an empty page (the last request will be to the last page).

Ready for review @lazebnyi , many thanks !

lazebnyi · 2024-02-09T00:04:44Z

hey @FVidalCarneiro
can you grant edit access to maintainers - https://docs.github.com/en/github/collaborating-with-pull-requests/working-with-forks/allowing-changes-to-a-pull-request-branch-created-from-a-fork#enabling-repository-maintainer-permissions-on-existing-pull-requests

FVidalCarneiro · 2024-02-09T09:26:26Z

Hi @lazebnyi , I tried to follow the provided documentation to grant edit access to maintainers but I do not think this is possible given that my PR is coming from an organization GitHub project. I have found this issue describes this problem.

What is the recommended course of action ? Thanks !

lazebnyi · 2024-02-09T12:48:55Z

@FVidalCarneiro Can you pull changes from #35046 to your branch?

FVidalCarneiro · 2024-02-09T14:51:13Z

I merged your changes to this branch @lazebnyi , thank you. Let me know if it is ok on your end ?

FVidalCarneiro · 2024-02-14T15:52:59Z

Hi @lazebnyi , noticed recently there was a little merge conflict to be solved on docs/integrations/sources/linkedin-ads.md, should be good now. Let me know if there are any more actions on my side. Thank you for the support !

marcosmarxm · 2024-04-30T14:43:32Z

It was merged in #37421

octavia-squidington-iii added the area/connectors Connector related issues label Jan 11, 2024

octavia-squidington-iii added community connectors/source/linkedin-ads labels Jan 11, 2024

FVidalCarneiro marked this pull request as ready for review January 11, 2024 15:50

FVidalCarneiro mentioned this pull request Jan 11, 2024

🐛 Source Linkedin Ads: connector assumes that api page with size smaller than maximum page size is last page #34164

Open

1 task

marcosmarxm added gl gl-review labels Jan 17, 2024

marcosmarxm reviewed Jan 18, 2024

View reviewed changes

marcosmarxm added the certified label Feb 1, 2024

lazebnyi requested changes Feb 6, 2024

View reviewed changes

lazebnyi self-assigned this Feb 6, 2024

lazebnyi added the team/connectors-python label Feb 6, 2024

FVidalCarneiro requested review from lazebnyi and marcosmarxm February 6, 2024 13:30

lazebnyi mentioned this pull request Feb 8, 2024

[run CI] Source Linkedin Ads: fix changing next_page_token stopping criteria #35046

Closed

octavia-squidington-iii added the area/documentation Improvements or additions to documentation label Feb 9, 2024

vercel bot deployed to Preview February 14, 2024 15:49 View deployment

FVidalCarneiro requested review from a team as code owners February 27, 2024 10:12

marcosmarxm closed this Apr 30, 2024

FVidalCarneiro mentioned this pull request Jul 8, 2024

⭐ New Source: Goldcast #38786

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Source Linkedin Ads: fix changing next_page_token stopping criteria #34166

🐛 Source Linkedin Ads: fix changing next_page_token stopping criteria #34166

FVidalCarneiro commented Jan 11, 2024 •

edited

Loading

vercel bot commented Jan 11, 2024 •

edited

Loading

github-actions bot commented Jan 11, 2024 •

edited by lazebnyi

Loading

CLAassistant commented Jan 11, 2024 •

edited

Loading

FVidalCarneiro commented Jan 11, 2024

marcosmarxm left a comment

FVidalCarneiro commented Jan 19, 2024 •

edited

Loading

lazebnyi left a comment

FVidalCarneiro commented Feb 6, 2024

lazebnyi commented Feb 9, 2024

FVidalCarneiro commented Feb 9, 2024

lazebnyi commented Feb 9, 2024 •

edited

Loading

FVidalCarneiro commented Feb 9, 2024

FVidalCarneiro commented Feb 14, 2024

marcosmarxm commented Apr 30, 2024

🐛 Source Linkedin Ads: fix changing next_page_token stopping criteria #34166

🐛 Source Linkedin Ads: fix changing next_page_token stopping criteria #34166

Conversation

FVidalCarneiro commented Jan 11, 2024 • edited Loading

What

How

Recommended reading order

🚨 User Impact 🚨

Pre-merge Actions

Community member or Airbyter

Airbyter

vercel bot commented Jan 11, 2024 • edited Loading

github-actions bot commented Jan 11, 2024 • edited by lazebnyi Loading

Before Merging a Connector Pull Request

CLAassistant commented Jan 11, 2024 • edited Loading

FVidalCarneiro commented Jan 11, 2024

marcosmarxm left a comment

Choose a reason for hiding this comment

FVidalCarneiro commented Jan 19, 2024 • edited Loading

lazebnyi left a comment

Choose a reason for hiding this comment

FVidalCarneiro commented Feb 6, 2024

lazebnyi commented Feb 9, 2024

FVidalCarneiro commented Feb 9, 2024

lazebnyi commented Feb 9, 2024 • edited Loading

FVidalCarneiro commented Feb 9, 2024

FVidalCarneiro commented Feb 14, 2024

marcosmarxm commented Apr 30, 2024

FVidalCarneiro commented Jan 11, 2024 •

edited

Loading

vercel bot commented Jan 11, 2024 •

edited

Loading

github-actions bot commented Jan 11, 2024 •

edited by lazebnyi

Loading

CLAassistant commented Jan 11, 2024 •

edited

Loading

FVidalCarneiro commented Jan 19, 2024 •

edited

Loading

lazebnyi commented Feb 9, 2024 •

edited

Loading