hadoop-example 를 이용한 하둡 정렬 벤치마크
hadoop-example을 이용하여 랜덤 데이터를 생성한 후 이 데이터를 다시 정렬해 총 걸리는 시간을 측정하여 성능을 유추해 낼 수 있다.
다음의 명령을 사용해 random 데이터를 생성한다
$bin/hadoop jar hadoop-example-1.2.1.jar randomwriter -D test.randomwrite.bytes_per_map=100 -D test.randomwriter.maps_per_host = 10 data/unsorted-data
hadoop@hadoop-VirtualBox:/usr/local/hadoop-1.2.1$ bin/hadoop jar hadoop-examples-1.2.1.jar randomwriter -D test.randomwrite.bytes_per_map=100 -D test.randomwriter.maps_per_host=10 data/unsorted-data
Running 10 maps.
Job started: Wed Oct 23 11:30:18 KST 2013
13/10/23 11:30:18 INFO mapred.JobClient: Running job: job_201310231129_0001
13/10/23 11:30:19 INFO mapred.JobClient: map 0% reduce 0%
13/10/23 11:30:26 INFO mapred.JobClient: map 20% reduce 0%
13/10/23 11:30:29 INFO mapred.JobClient: map 30% reduce 0%
13/10/23 11:30:30 INFO mapred.JobClient: map 40% reduce 0%
13/10/23 11:30:33 INFO mapred.JobClient: map 60% reduce 0%
13/10/23 11:30:37 INFO mapred.JobClient: map 80% reduce 0%
13/10/23 11:30:41 INFO mapred.JobClient: map 90% reduce 0%
13/10/23 11:30:42 INFO mapred.JobClient: map 100% reduce 0%
13/10/23 11:30:42 INFO mapred.JobClient: Job complete: job_201310231129_0001
13/10/23 11:30:42 INFO mapred.JobClient: Counters: 21
13/10/23 11:30:42 INFO mapred.JobClient: Job Counters
13/10/23 11:30:42 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=39263
13/10/23 11:30:42 INFO mapred.JobClient: Total time spent by all reduces waiting after reserving slots (ms)=0
13/10/23 11:30:42 INFO mapred.JobClient: Total time spent by all maps waiting after reserving slots (ms)=0
13/10/23 11:30:42 INFO mapred.JobClient: Launched map tasks=10
13/10/23 11:30:42 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=0
13/10/23 11:30:42 INFO mapred.JobClient: File Input Format Counters
13/10/23 11:30:42 INFO mapred.JobClient: Bytes Read=0
13/10/23 11:30:42 INFO mapred.JobClient: File Output Format Counters
13/10/23 11:30:42 INFO mapred.JobClient: Bytes Written=49284
13/10/23 11:30:42 INFO mapred.JobClient: org.apache.hadoop.examples.RandomWriter$Counters
13/10/23 11:30:42 INFO mapred.JobClient: BYTES_WRITTEN=48164
13/10/23 11:30:42 INFO mapred.JobClient: RECORDS_WRITTEN=10
13/10/23 11:30:42 INFO mapred.JobClient: FileSystemCounters
13/10/23 11:30:42 INFO mapred.JobClient: HDFS_BYTES_READ=1190
13/10/23 11:30:42 INFO mapred.JobClient: FILE_BYTES_WRITTEN=570920
13/10/23 11:30:42 INFO mapred.JobClient: HDFS_BYTES_WRITTEN=49284
13/10/23 11:30:42 INFO mapred.JobClient: Map-Reduce Framework
13/10/23 11:30:42 INFO mapred.JobClient: Map input records=10
13/10/23 11:30:42 INFO mapred.JobClient: Physical memory (bytes) snapshot=597352448
13/10/23 11:30:42 INFO mapred.JobClient: Spilled Records=0
13/10/23 11:30:42 INFO mapred.JobClient: CPU time spent (ms)=3930
13/10/23 11:30:42 INFO mapred.JobClient: Total committed heap usage (bytes)=349700096
13/10/23 11:30:42 INFO mapred.JobClient: Virtual memory (bytes) snapshot=3946369024
13/10/23 11:30:42 INFO mapred.JobClient: Map input bytes=0
13/10/23 11:30:42 INFO mapred.JobClient: Map output records=10
13/10/23 11:30:42 INFO mapred.JobClient: SPLIT_RAW_BYTES=1190
Job ended: Wed Oct 23 11:30:42 KST 2013
The job took 23 seconds.