Tom Götz tgoetz

Organizations

@Veritando
@tgoetz
@tgoetz
Support for v3 of diffbot API?
tgoetz pushed to master at tgoetz/core
@tgoetz
tgoetz opened pull request wicketstuff/core#402
@tgoetz
Revert non-ajax support
2 commits with 80 additions and 560 deletions
tgoetz pushed to master at tgoetz/core
@tgoetz
tgoetz pushed to master at tgoetz/core
@tgoetz
  • @tgoetz d46d150
    Revert non-ajax support from commit 1f7aaeb
tgoetz pushed to master at tgoetz/core
@tgoetz
tgoetz commented on pull request wicketstuff/core#368
@tgoetz

IMHO this implementation breaks current implementations, see current discussion here: http://markmail.org/thread/u2ccale6aubcmm5p

tgoetz created branch dev at tgoetz/wicket-select2
@tgoetz
@tgoetz
run-app fails with BeanDefinitionStoreException
@tgoetz
  • @yasserg 70fe6f1
    Merge pull request #36 from Veritando/master
tgoetz opened pull request yasserg/crawler4j#36
@tgoetz
Provide factory method for creating the HttpUriRequest (default: HttpGet...
1 commit with 19 additions and 6 deletions
@tgoetz
  • @tgoetz e01b206
    Provide factory method for creating the HttpUriRequest (default: Http…
@tgoetz
  • 7e225a7
    Provide factory method for generating the HttpUriRequest (HttpGet) fo…
tgoetz commented on issue yasserg/crawler4j#35
@tgoetz

I will provide a pull request, so you can have look at my proposal.

tgoetz commented on issue yasserg/crawler4j#35
@tgoetz

An easy solution could be: move the creation of the HttpGet into a new factory method in PageFetcher, which is easyly overrideable, e.g.: protected …

tgoetz commented on issue yasserg/crawler4j#35
@tgoetz

Ok, I will try ;-) In a current crawling project, we need to supply a specific HTTP header (e.g. "Accept-Language") when fetching certain URLs in o…

@tgoetz
Enable adding custom http headers when fetching urls
tgoetz commented on issue yasserg/crawler4j#34
@tgoetz

Well, in my experience with website crawling there are many situations where you need to decide wether to follow a link or not and you can't make t…

tgoetz commented on issue yasserg/crawler4j#34
@tgoetz

i.e. you propose to ignore the url in shouldVisit(Page page, WebURL url), parse the page manually and add links to follow via myController.addSeed(…

@tgoetz
Add link attributes information to WebURL
tgoetz created repository Veritando/test