Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/link extraction #1

Merged
merged 3 commits into from Apr 18, 2019
Merged

Feature/link extraction #1

merged 3 commits into from Apr 18, 2019

Conversation

maciekmm
Copy link
Collaborator

No description provided.


object HeuristicExtractor {

val unlikelyClassCandidates: Array[String] = Array("-ad-", "ai2html", "banner", "breadcrumbs", "combx", "comment", "community", "cover-wrap", "disqus", "extra", "foot", "gdpr", "header", "legends", "menu", "related", "remark", "replies", "rss", "shoutbox", "sidebar", "skyscraper", "social", "sponsor", "supplemental", "ad-break", "agegate", "pagination", "pager", "popup", "yom-remote")
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you've made these two collections fields of a companion object, they can't be edited anyway, am I right? So maybe they should be immutable sequences?
Here is an entry on scala sequences: https://docs.scala-lang.org/overviews/collections/overview.html

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You are correct, I have addressed the issue. New patch no longer calls toArray after asScala as well.

@zemiret
Copy link
Owner

zemiret commented Apr 17, 2019

Overall, there's not much to review in here. Looks good.
As a sidenote, I think you have a problem with being too clever in your code, thus making it less explicit than it should be. 😄 🗡️

@zemiret zemiret assigned zemiret and unassigned zemiret Apr 17, 2019
@zemiret zemiret merged commit 5c7eccd into master Apr 18, 2019
@zemiret zemiret deleted the feature/link-extraction branch April 18, 2019 11:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants