
計/wordcount下的檔案的單詞數:
hadoop jar hadoop-mapreduce-examples-2.4.1.jar wordcount /wordcount2 /output2
出來這么些資訊(其中一部分),怎么看統計的單詞數是否正確?統計的結果是哪個?
Map-Reduce Framework
Map input records=1
Map output records=5
Map output bytes=57
Map output materialized bytes=61
Input split bytes=101
Combine input records=5
Combine output records=4
Reduce input groups=4
Reduce shuffle bytes=61
Reduce input records=4
Reduce output records=4
Spilled Records=8
Shuffled Maps =1
Failed Shuffles=0
Merged Map outputs=1
GC time elapsed (ms)=707
CPU time spent (ms)=5290
Physical memory (bytes) snapshot=221282304
Virtual memory (bytes) snapshot=631783424
Total committed heap usage (bytes)=137498624
Shuffle Errors
uj5u.com熱心網友回復:
知道了,可以用hadoop fs -cat /output/part-r-00000 查看統計的各類單詞數轉載請註明出處,本文鏈接:https://www.uj5u.com/qita/71161.html
標籤:Spark
