-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add company ID to ald_demo #355
Comments
Thanks! @jdhoffa what do you think? |
I think this is a good call, but have a hunch it may introduce some necessary changes to In any case, I think we should start with a PR here, introducing the new column, and leave this PR open while we test the effects on downstream packages. It should be a concerted effort, so let's flag this in the next PACTA dev prioritization session, and figure out when we'll have time to do it. |
Can you explain a bit what the company ID are and where they come from? Can you show an example?
@jdhoffa, would it be safer to move backwards? is there a way to map the matched output back to the company IDs so that we can get the benefits of the IDs in r2dii.analysis without changing upstream code? |
@maurolepore sorry about the slow response. It has been a busy month, already. The company IDs are generated in Asset Resolution database and are unique to each entity. If needed, @tposey28 can provide additional details. Here are some examples: |
Thanks @vintented I'm I right in understanding that the devtools::load_all()
#> Loading r2dii.match
ald_wish <- fake_ald(
id_ar = c(478460, 931),
name_company = tolower(c("Interoil Argentina As", "Boeing Co/The"))
)
ald_wish
#> # A tibble: 2 x 4
#> name_company sector alias_ald id_ar
#> <chr> <chr> <chr> <dbl>
#> 1 interoil argentina as power alpineknitsindiapvt ltd 478460
#> 2 boeing co/the power alpineknitsindiapvt ltd 931 Created on 2021-02-02 by the reprex package (v0.3.0) (Here |
Are they unique to each entity + sector combination? ie. |
@jdhoffa they are unique to company regardless of the sector, or in other words, consistent across sectors. |
@jdhoffa, am I right in thinking that the benefit that @vintented wants is at the level of r2dii.analysis -- not further upstream? To change things upstream we can, but do we have to? Or can we first get the benefits in a safer way then roll the solution deeper into the dependency tree? |
Hey jumping in here quickly! The IDs are kept consistent by comparing the Bloomberg IDs and LEIs if there is financial data, if not then Global Data ID if available, if not then unique simplified name and country combination. We match run this logic against every new quarter of data. Of course an old company may slip in with a new ID due to it failing at all of these steps, but then Vincent and I will often go through and reconcile these with the old IDs (the important noticeable ones at least) by looking for old IDs that lost production and new IDs that gained production. The main benefit for this is if Global Data or Bloomberg changes a name, but not their ID, we will update the name without changing the AR ID. This is useful for clients who may otherwise argue that a company disappeared. |
@maurolepore I think all of this is fine:
I will open the first point as a draft PR today, and let's see if anything breaks and go from there. |
@jdhoffa and @maurolepore super exciting! You would be surprised how excited people are about company IDs. Let me know if I can help in anyway :) |
@jdhoffa @maurolepore @tposey28 @vintented @daisy-pacheco @Lauramirez-2ii This is needed for some open engagement with emerging market banks. Thanks, |
Reconsidering the above comment and noting the conversation form PACTA - AR call This solution of adding Unique IDs to each AR data release for Banks and the subsequent code changes that this will require is no longer a top priority. This would be good to have for March 2021 and will help a bank to preserve there matches from a previous matching exercise, year on year. A short term solution for a bank wanting to match now using the old (q4 2019) data ALD is to proceed with matching and then use an excel bridging file to manually carry over the old matches to the new data set q4 2020. thanks all! |
I moved this issue to |
Closing in favour of #375 |
It would be great to add company IDs (unique numeric sequence) to ald_demo so Asset Resolution can start integrating them into the PACTA for Bank datasets. This addition would ensure that production is always aggregated to the correct entity and help with QA issues and tracking.
Let me know if I can provide any additional details.
The text was updated successfully, but these errors were encountered: