Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IDL scraper is currently pulling in non-authoritative IDL #34

Open
mdittmer opened this issue Jan 19, 2017 · 5 comments
Open

IDL scraper is currently pulling in non-authoritative IDL #34

mdittmer opened this issue Jan 19, 2017 · 5 comments
Assignees
Labels

Comments

@mdittmer
Copy link
Owner

At least some of this is coming from <pre> without class="idl"; e.g., from the WebIDL spec's description of IDL syntax.

@mdittmer mdittmer added the bug label Jan 19, 2017
@mdittmer mdittmer self-assigned this Jan 19, 2017
@mdittmer
Copy link
Owner Author

@LoonyBean FYI. This might to be related to some false positives you've been seeing.

@mdittmer
Copy link
Owner Author

mdittmer commented Jan 19, 2017

Switching to matching on pre.idl instead of just pre results in losing data for the following items that appear in Blink's IDL (first two columns: count-of-things-in-current-data, count-of-things-in-pre.idl-data):

1       0       HTMLKeygenElement       https://html.spec.whatwg.org/
1       0       PermissionName          https://w3c.github.io/permissions/
1       0       MutationEvent           https://w3c.github.io/uievents/
1       0       KeyFormat               https://w3c.github.io/webcrypto/Overview.html
1       0       KeyUsage                https://w3c.github.io/webcrypto/Overview.html
1       0       SubtleCrypto            https://w3c.github.io/webcrypto/Overview.html
1       0       AlgorithmIdentifier     https://w3c.github.io/webcrypto/Overview.html
1       0       Crypto                  https://w3c.github.io/webcrypto/Overview.html
1       0       CryptoKey               https://w3c.github.io/webcrypto/Overview.html
1       0       PaymentAppRequestData   https://w3c.github.io/webpayments-payment-apps-api/
1       0       MediaImageInit          https://wicg.github.io/mediasession/

mdittmer added a commit that referenced this issue Jan 19, 2017
…be-lost that is defined in Blink to data/idl/blink/linked/manual.json. Towards #34
@mdittmer
Copy link
Owner Author

Also suspicious, but not in Blink:

1       0       ResponseBodyInit                        https://fetch.spec.whatwg.org/
1       0       PresentationConnectionClosedReason      https://w3c.github.io/presentation-api/
1       0       ProximitySensor                         https://w3c.github.io/sensors/
1       0       ProximitySensorDirection                https://w3c.github.io/sensors/
1       0       ProximitySensorOptions                  https://w3c.github.io/sensors/
1       0       ProximitySensorPosition                 https://w3c.github.io/sensors/
1       0       ProximitySensorReading                  https://w3c.github.io/sensors/
1       0       Keyframe                                https://w3c.github.io/web-animations/
1       0       KeyframeArgument                        https://w3c.github.io/web-animations/
1       0       PropertyIndexedKeyframes                https://w3c.github.io/web-animations/
1       0       ComputedKeyframe                        https://w3c.github.io/web-animations/
1       0       AesCbcParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       JsonWebKey                              https://w3c.github.io/webcrypto/Overview.html
1       0       KeyAlgorithm                            https://w3c.github.io/webcrypto/Overview.html
1       0       KeyType                                 https://w3c.github.io/webcrypto/Overview.html
1       0       NamedCurve                              https://w3c.github.io/webcrypto/Overview.html
1       0       Pbkdf2Params                            https://w3c.github.io/webcrypto/Overview.html
1       0       RsaHashedImportParams                   https://w3c.github.io/webcrypto/Overview.html
1       0       RsaHashedKeyAlgorithm                   https://w3c.github.io/webcrypto/Overview.html
1       0       RsaHashedKeyGenParams                   https://w3c.github.io/webcrypto/Overview.html
1       0       RsaKeyAlgorithm                         https://w3c.github.io/webcrypto/Overview.html
1       0       RsaKeyGenParams                         https://w3c.github.io/webcrypto/Overview.html
1       0       RsaOaepParams                           https://w3c.github.io/webcrypto/Overview.html
1       0       RsaOtherPrimesInfo                      https://w3c.github.io/webcrypto/Overview.html
1       0       RsaPssParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       AesCtrParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       AesDerivedKeyParams                     https://w3c.github.io/webcrypto/Overview.html
1       0       AesGcmParams                            https://w3c.github.io/webcrypto/Overview.html
1       0       AesKeyAlgorithm                         https://w3c.github.io/webcrypto/Overview.html
1       0       AesKeyGenParams                         https://w3c.github.io/webcrypto/Overview.html
1       0       Algorithm                               https://w3c.github.io/webcrypto/Overview.html
1       0       BigInteger                              https://w3c.github.io/webcrypto/Overview.html
1       0       CryptoKeyPair                           https://w3c.github.io/webcrypto/Overview.html
1       0       EcKeyAlgorithm                          https://w3c.github.io/webcrypto/Overview.html
1       0       EcKeyGenParams                          https://w3c.github.io/webcrypto/Overview.html
1       0       EcKeyImportParams                       https://w3c.github.io/webcrypto/Overview.html
1       0       EcdhKeyDeriveParams                     https://w3c.github.io/webcrypto/Overview.html
1       0       EcdsaParams                             https://w3c.github.io/webcrypto/Overview.html
1       0       GlobalCrypto                            https://w3c.github.io/webcrypto/Overview.html
1       0       HashAlgorithmIdentifier                 https://w3c.github.io/webcrypto/Overview.html
1       0       HkdfParams                              https://w3c.github.io/webcrypto/Overview.html
1       0       HmacImportParams                        https://w3c.github.io/webcrypto/Overview.html
1       0       HmacKeyAlgorithm                        https://w3c.github.io/webcrypto/Overview.html
1       0       HmacKeyGenParams                        https://w3c.github.io/webcrypto/Overview.html
1       0       RTCInboundRTPStreamStats                https://w3c.github.io/webrtc-pc/
1       0       RTCOutboundRTPStreamStats               https://w3c.github.io/webrtc-pc/
1       0       RTCRTPStreamStats                       https://w3c.github.io/webrtc-pc/
1       0       RTCStatsType                            https://w3c.github.io/webrtc-pc/
1       0       DetectedObject                          https://wicg.github.io/shape-detection-api/
1       0       VertexAttribFVSource                    https://www.khronos.org/registry/webgl/specs/latest/1.0/

@mdittmer
Copy link
Owner Author

@foolip PTAL. WDYT of these lists? Note that mentioned commit stores manual snapshot of first list, but not second.

This was referenced Jan 19, 2017
@foolip
Copy link
Collaborator

foolip commented Jan 19, 2017

Hmm, sure looks like some IDL that we do want in there. By requiring class=idl, what bogus things do we get rid of? Maybe those can be skipped in some other way?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants