怎样使nutch从一个失败的process开始-xpjjy-ChinaUnix博客

Chinaunix首页 | 论坛 | 博客

首页　| 　博文目录　| 　关于我

博客访问： 575865
博文数量： 136
博客积分： 4010
博客等级：上校
技术积分： 1343
用户组：普通用户
注册时间： 2008-08-19 23:18

文章分类

全部博文（136）

unix（2）
nutch（7）
SSH笔记（1）
我的文章（126）

MyEclipse（2）

EXTJS（4）

struts2（4）

Spring（2）

Oracle（2）

GIS_Java（1）

swt（1）

jms（1）

异常大杂烩（4）

异常锦集（0）

tomcat（0）

heritrix（4）

算法（4）

Servlet（5）

J2EE（3）

struts（17）

Hibernate（21）

SQL（6）

xml（0）

javascript（4）

jsp（8）

java（32）
未分配的博文（0）

文章存档

2011年（28）

2009年（60）

2008年（48）

我的朋友

最近访客

推荐博文

相关博文

怎样使nutch从一个失败的process开始

分类： Java

2009-01-09 14:52:15

Well, you can not. However, you have two choices to proceed:

1) Recover the pages already fetched and than restart the fetcher.
- You'll need to create a file fetcher.done in the segment directory an than: , and . Assuming your index is at /index
```
% touch /index/segments/2005somesegment/fetcher.done 

% bin/nutch updatedb /index/db/ /index/segments/2005somesegment/

% bin/nutch generate /index/db/ /index/segments/2005somesegment/

% bin/nutch fetch /index/segments/2005somesegment
```
  All the pages that were not crawled will be re-generated for fetch. If you fetched lots of pages, and don't want to have to re-fetch them again, this is the best way.
2) Discard the aborted output.
- Delete all folders from the segment folder except the fetchlist folder and restart the fetcher.

阅读(593) | 评论(0) | 转发(0) |

0

上一篇：如何增加Nutch中Summary的长度

下一篇：050529 011245 fetch okay, but can't parse myfile,

给主人留下些什么吧！~~

关于我们 | 关于IT168 | 联系方式 | 广告合作 | 法律声明 | 免费注册

Copyright 2001-2010 ChinaUnix.net All Rights Reserved 北京皓辰网域网络信息技术有限公司. 版权所有

感谢所有关心和支持过ChinaUnix的朋友们