-
-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactored the ContentBuilder out of the ContentExtractor #33
Conversation
Unsure about the message log test, will need to dig a bit deeper to fix the test. |
37483e4
to
536001c
Compare
Conflicts: src/Graby.php
The tests pass locally, I'm a bit confused now. Could somebody test ? |
And the tests are passing. |
I'll have a look later this week 👌 |
The HostFingerPrint check the given html and determine the website platform. In the default config, you have: 'fingerprints' => array(
'/\<meta\s*content=\"blogger\"\s*name=\"generator\"/i' => 'fingerprint.blogspot.com',
'/\<meta\s*name=\"generator\"\s*content=\"Blogger\"/i' => 'fingerprint.blogspot.com',
'/\<meta\s*name=\"generator\"\s*content=\"WordPress/i' => 'fingerprint.wordpress.com',
), Which mean, we are looking for that html content (meta blabla) and if we find it we'll apply the value corresponding as the host and then apply the config file (for example) Hope I'm clear :) |
Looks good to me, I'll tweaked few things after merge. Can't wait to the future steps in Wallabag :) |
Refactored the ContentBuilder out of the ContentExtractor
Moves the logic used in
ContentExtractor::buildSiteConfig()
to a new method in theContentBuilder
. I have left the HostFingerPrint part in the ContentExtractor, as I'm not 100% sure what it does (feedback welcome).This change makes it possible to build site config objects from anywhere using the host name, as done by http://github.com/wallabag/wallabag in the site login feature PR.
TODO