-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update mexico_plataforma_digital_nacional spider #976
Comments
How does PDN disambiguate the two publications? Is its CompraNet data identical to https://datos.gob.mx/busca/dataset/concentrado-de-contrataciones-abiertas-de-la-apf ? |
In the front end, they have a select where you can select the publisher (or, as they call it, "data provider") In the API, they have the "supplier_id" query param that accepts SCHP and SESEA_AGS as values.
I don't know, but I can check. Edit, in their FAQ section they say:
So they say they don't generate information, only create interoperability between the different entities, and the platform is only a "query" tool. So I guess we can assume their data from Compranet is identical to what Compranet publishes. |
Okay, so let's only add Aguascalientes to Collect and the data registry, since we don't have the upstream source (unless we can find out where it is). We can add their SHCP data to Collect, but I'm not sure who would want it in favor of the CompraNet data. |
I'd be curious if the ocid is the same, and if so, what ocid is used for Aguascalientes. |
For Compranet, the prefix is the one registered for SHCP ocds-07smqs, and for Aguascalientes, the one registered for Aguascalientes ocds-ty10ed (we registered that one in the past, so maybe they are sending their data directly to the platform and not publishing it themselves? Aha! from CRM-8104 they said, "we developed the open contracting system for the Plataforma Digital Nacional" so they are indeed only publishing their data through PDN, so new publisher alert? But who? Aguascalientes, I guess?) |
Yes, I'd say Aguascalientes. We haven't had a scenario where the publishing platform is independent of the data's author/steward, but I think we want to be tracking the author (despite the field being |
We can maybe add a note to the docs for the CompraNet spider, to indicate that it's also published via PDN. Not sure how useful that fact is for the data registry, but we can also add it to the end of the description. |
Ok, so I assume we want one spider for each. And if they add more publishers in the future will add new spiders as we did with Digiwhist, right? |
I meant the existing CompraNet spider (APF) can just have a note added. We don't need a CompraNet spider for PDN.
Yup! Assuming we don't have access to the original source. |
https://www.plataformadigitalnacional.org/contrataciones used to have a link to a Google Drive JSON file. Now they point directly to https://datos.gob.mx/busca/dataset/concentrado-de-contrataciones-abiertas-de-la-apf. However, they are also publishing data from other data source now "Secretaría Ejecutiva del Sistema Estatal Anticorrupción de Aguascalientes".
And now, they have an underlying API, for example:
curl 'https://api.plataformadigitalnacional.org/s6/api/v1/search?supplier_id=SHCP'
-X 'POST'
To get the list of releases by publisher, so we could update the spider to use that API endpoint instead of the Google Drive file.
The only thing to decide is if we want two spiders, one per publisher (e.g., one for SHCP and another one for Aguascalientes) or just one for Plataforma Digital Nacional as of now.
The text was updated successfully, but these errors were encountered: