New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can I extract attribute values using OpenScraping? #6
Comments
It should be possible. I will give it a try. |
Sorry, I mistakenly closed this item. This is still a problem. |
OpenScraping depends on Html Agility Pack, which does not support attribute selection, as explained here. What I can do is modify the code to support simple attribute selection, so your particular example would work. Let me know if you still need this and I will make the change. |
I could use this, I'm happy to do the work if needs be. @zmarty |
I'm pretty happy using the library but really really need to extract links using /a/@href or some other method in the lib. Any chance this will be available soon? |
@zmarty The library is working great so far but I'm in need to extract the href attribute of a link as well. A workaround would be great. |
@cbracht Acknowledged, will work on it. |
Thank you, your work is appreciated! |
I made it work like this:
And to use:
|
Thanks for the workaround! I am working on a permanent solution in another branch. Will merge to master as soon as it's ready. |
@zmarty Do you have a fix for this ? Thanks for the good work. |
Not yet, sorry. I did not have time to finish it. |
No worries. Thanks for the prompt reply. Hack posted by @marcel-silva is working fine for now. When you release the next version, please also update the dependencies, currently it is interfering with other stuff. Thank you.. |
Just wanted to chime in that I just ran into this as well. Using @marcel-silva workaround for time being. Thanks again for making this! It's very handy. |
I also really need this fix. Is it still being worked on? |
I have fixed this issue in pull request #14 |
Thank you @shawnshaddock, this is now in master. |
@shawnshaddock Reopening since checkin #14 breaks transformations such as CastToIntegerTransformation. I am working on a fix, since I need this for a project. |
Fixed (hopefully permanently) through #21 |
<a href="test">dfsdfsdf</a>
I have tried systax like this //a/@href
It just returns the contents of the anchor tag, but I'm looking for "test" in the href attribute.
Is this possible?
The text was updated successfully, but these errors were encountered: