Permalink
Browse files

Change warc importer to use defaultsurrogate-crawl profile, as reported

by LA_FORGE http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5990 and
analysed by @luccioman (see comment 510f11d)
it creates conflict using a other crawlprofile without setting originator.
  • Loading branch information...
reger24 committed May 21, 2017
1 parent 3b1d640 commit 039162fbf0eca808afd350d360c3bcfe62dc4195
Showing with 3 additions and 3 deletions.
  1. +3 −3 source/net/yacy/document/importer/WarcImporter.java
@@ -150,15 +150,15 @@ public void indexWarcRecords(InputStream f) throws IOException {
requestHeader.referer() == null ? null : requestHeader.referer().hash(),
"warc",
responseHeader.lastModified(),
Switchboard.getSwitchboard().crawler.defaultRemoteProfile.handle(), // use remote profile (to index text & media, without writing to cache
Switchboard.getSwitchboard().crawler.defaultSurrogateProfile.handle(),
0,
Switchboard.getSwitchboard().crawler.defaultRemoteProfile.timezoneOffset());
Switchboard.getSwitchboard().crawler.defaultSurrogateProfile.timezoneOffset());
final Response response = new Response(
request,
requestHeader,
responseHeader,
Switchboard.getSwitchboard().crawler.defaultRemoteProfile,
Switchboard.getSwitchboard().crawler.defaultSurrogateProfile,
false,
content
);

0 comments on commit 039162f

Please sign in to comment.