feat(source-linkedin-ads): Remove custom cursors and retrievers from analytics streams to enable concurrent processing #58114

brianjlai · 2025-04-17T05:07:42Z

https://github.com/airbytehq/airbyte-internal-issues/issues/12144

What

We have 11 analytics streams that used to implement CustomRetriever and CustomIncrementalSync components. Because they use a custom cursor, we forced this to use the synchronous engine because custom cursors aren't compatible with the concurrent framework.

With the introduction of the property chunking feature introduced in the CDK in version 6.45.0, the custom component functionality now exists in the low-code framework without the need for custom cursors.

How

All of the 11 analytics streams follow roughly the same pattern so each change should apply to the others

Update the dateRange query parameter to read from the current slice to get year/month/day. The custom components has custom behavior to inject start.year into slices. Slices actually already have the correct information using jinja datetime functions like .year, .month
Move the transformers from being defined in $parameters and instead the idiomatic place under DeclarativeStream.transformations. We originaly did this because we did not have a good way to inject the transformation into our custom retriever. However, now that we don't have a custom retriever, we can define them as we do for other low-code connectors
Replace the custom component definitions with default implementations
Define QueryProperties component and use it in all analytics streams

User Impact

None

Can this PR be safely reverted and rolled back?

YES 💚
NO ❌

vercel · 2025-04-17T05:07:47Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
airbyte-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Apr 18, 2025 8:29pm

maxi297

LGTM! Just one question on something that I don't think is a problem and I would like to learn about

Also, can you compare speed before and after this change? It can be just one stream as a sample but it would be interesting to have a high level estimate of how much it helped that we can share to support

maxi297 · 2025-04-17T20:36:16Z

airbyte-integrations/connectors/source-linkedin-ads/source_linkedin_ads/manifest.yaml

+    property_chunking:
+      type: PropertyChunking
+      property_limit_type: property_count
+      property_limit: 18


Is the property_limit 18 because of dateRange and pivotValue which are not included in the count to 20?

yes that's correct. the actual max is 20, but we must include those two values in every request in order to merge records together correctly

It feels like it would have been a better UX to have the property chunking consider the values that are always present so that the limit aligns with the API restrictions. For property_count, it is probably fine but for number of bytes, it becomes a bit more annoying to calculate. Overall, it is probably a nit but still wanted to raise my concern here

…ht level

brianjlai · 2025-04-18T05:08:56Z

Passing regression test run:
https://github.com/airbytehq/airbyte/actions/runs/14526014907

brianjlai · 2025-04-18T21:00:24Z

regression test notes:

There are some differing values on one of the other test runs, but after running and inspecting some results, it seems like the mismatches might be drift between when control vs. candidate runs are kicked off. The exact counts of records are the same. I also looked at some local syncs of our test data to verify that fields exist in the final emitted record
And on the attached run above, there were exact correct record counts or all data matched. I am inclined to release this w/ a progressive rollout

remove custom cursors and retrievers from manifest

23dc2b8

brianjlai requested review from maxi297, tolik0 and darynaishchenko April 17, 2025 05:07

octavia-squidington-iii added the connectors/source/linkedin-ads label Apr 17, 2025

fix unit tests and remove no longer relevant code

14f20a5

vercel bot deployed to Preview April 17, 2025 19:36 View deployment

maxi297 approved these changes Apr 17, 2025

View reviewed changes

fix ad_creatives_analytics transformation was not indented to the rig…

c79837d

…ht level

vercel bot deployed to Preview April 17, 2025 21:53 View deployment

bump cdk version to the latest release

f128589

vercel bot deployed to Preview April 18, 2025 20:29 View deployment

brianjlai merged commit c723a2e into master Apr 18, 2025
27 checks passed

brianjlai deleted the brian/linkedin_ads_remove_custom_retriever_and_cursor branch April 18, 2025 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(source-linkedin-ads): Remove custom cursors and retrievers from analytics streams to enable concurrent processing #58114

feat(source-linkedin-ads): Remove custom cursors and retrievers from analytics streams to enable concurrent processing #58114

Uh oh!

brianjlai commented Apr 17, 2025

Uh oh!

vercel bot commented Apr 17, 2025 •

edited

Loading

Uh oh!

maxi297 left a comment

Uh oh!

maxi297 Apr 17, 2025

Uh oh!

brianjlai Apr 17, 2025

Uh oh!

maxi297 Apr 17, 2025

Uh oh!

brianjlai commented Apr 18, 2025

Uh oh!

brianjlai commented Apr 18, 2025

Uh oh!

Uh oh!

Uh oh!

feat(source-linkedin-ads): Remove custom cursors and retrievers from analytics streams to enable concurrent processing #58114

feat(source-linkedin-ads): Remove custom cursors and retrievers from analytics streams to enable concurrent processing #58114

Uh oh!

Conversation

brianjlai commented Apr 17, 2025

What

How

User Impact

Can this PR be safely reverted and rolled back?

Uh oh!

vercel bot commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maxi297 left a comment

Choose a reason for hiding this comment

Uh oh!

maxi297 Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

brianjlai Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

maxi297 Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

brianjlai commented Apr 18, 2025

Uh oh!

brianjlai commented Apr 18, 2025

Uh oh!

Uh oh!

Uh oh!

vercel bot commented Apr 17, 2025 •

edited

Loading