[root@localhost local]# bin/nutch crawl urls -depth 3 -threads 100
InjectorJob: Using class org.apache.gora.cassandra.store.CassandraStore as the Gora storage class.
InjectorJob: total number of urls rejected by filters: 1
InjectorJob: total number of urls injected after normalization and filtering: 0
FetcherJob: threads: 100
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob : timelimit set for : -1
Using queue mode : byHost
Fetcher: threads: 100
QueueFeeder finished: total 0 records. Hit by time limit :0
-finishing thread FetcherThread0, activeThreads=0
...
-finishing thread FetcherThread94, activeThreads=0
Fetcher: throughput threshold: -1
0/0 spinwaiting/active, 0 pages, 0 errors, 0.0 0 pages/s, 0 0 kb/s, 0 URLs in 0 queues
-activeThreads=0
ParserJob: resuming: false
ParserJob: forced reparse: false
ParserJob: parsing all
FetcherJob: threads: 100
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob : timelimit set for : -1
Using queue mode : byHost
Fetcher: threads: 100
...
-activeThreads=0
ParserJob: resuming: false
ParserJob: forced reparse: false
ParserJob: parsing all
uj5u.com熱心網友回復:
我也遇到了這個問題啊,而且只能抓到一個頁面。請問你解決了嗎uj5u.com熱心網友回復:
有一些是抓不到,多抓一些總會有東西的,所以一些抓不到的就飄過了...轉載請註明出處,本文鏈接:https://www.uj5u.com/qita/107044.html
標籤:網絡
上一篇:大神們~~淘寶看點擊率的頁面被和諧了!以前可以打開的,現在自動跳轉了有辦法嗎??
下一篇:關于AWS本地開發的問題
