티스토리 뷰
hadoop-example을 이용하여 랜덤 데이터를 생성한 후 이 데이터를 다시 정렬해 총 걸리는 시간을 측정하여 성능을 유추해 낼 수 있다.
다음의 명령을 사용해 random 데이터를 생성한다
$bin/hadoop jar hadoop-example-1.2.1.jar randomwriter -D test.randomwrite.bytes_per_map=100 -D test.randomwriter.maps_per_host = 10 data/unsorted-data
hadoop@hadoop-VirtualBox:/usr/local/hadoop-1.2.1$ bin/hadoop jar hadoop-examples-1.2.1.jar randomwriter -D test.randomwrite.bytes_per_map=100 -D test.randomwriter.maps_per_host=10 data/unsorted-data
Running 10 maps.
Job started: Wed Oct 23 11:30:18 KST 2013
13/10/23 11:30:18 INFO mapred.JobClient: Running job: job_201310231129_0001
13/10/23 11:30:19 INFO mapred.JobClient: map 0% reduce 0%
13/10/23 11:30:26 INFO mapred.JobClient: map 20% reduce 0%
13/10/23 11:30:29 INFO mapred.JobClient: map 30% reduce 0%
13/10/23 11:30:30 INFO mapred.JobClient: map 40% reduce 0%
13/10/23 11:30:33 INFO mapred.JobClient: map 60% reduce 0%
13/10/23 11:30:37 INFO mapred.JobClient: map 80% reduce 0%
13/10/23 11:30:41 INFO mapred.JobClient: map 90% reduce 0%
13/10/23 11:30:42 INFO mapred.JobClient: map 100% reduce 0%
13/10/23 11:30:42 INFO mapred.JobClient: Job complete: job_201310231129_0001
13/10/23 11:30:42 INFO mapred.JobClient: Counters: 21
13/10/23 11:30:42 INFO mapred.JobClient: Job Counters
13/10/23 11:30:42 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=39263
13/10/23 11:30:42 INFO mapred.JobClient: Total time spent by all reduces waiting after reserving slots (ms)=0
13/10/23 11:30:42 INFO mapred.JobClient: Total time spent by all maps waiting after reserving slots (ms)=0
13/10/23 11:30:42 INFO mapred.JobClient: Launched map tasks=10
13/10/23 11:30:42 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=0
13/10/23 11:30:42 INFO mapred.JobClient: File Input Format Counters
13/10/23 11:30:42 INFO mapred.JobClient: Bytes Read=0
13/10/23 11:30:42 INFO mapred.JobClient: File Output Format Counters
13/10/23 11:30:42 INFO mapred.JobClient: Bytes Written=49284
13/10/23 11:30:42 INFO mapred.JobClient: org.apache.hadoop.examples.RandomWriter$Counters
13/10/23 11:30:42 INFO mapred.JobClient: BYTES_WRITTEN=48164
13/10/23 11:30:42 INFO mapred.JobClient: RECORDS_WRITTEN=10
13/10/23 11:30:42 INFO mapred.JobClient: FileSystemCounters
13/10/23 11:30:42 INFO mapred.JobClient: HDFS_BYTES_READ=1190
13/10/23 11:30:42 INFO mapred.JobClient: FILE_BYTES_WRITTEN=570920
13/10/23 11:30:42 INFO mapred.JobClient: HDFS_BYTES_WRITTEN=49284
13/10/23 11:30:42 INFO mapred.JobClient: Map-Reduce Framework
13/10/23 11:30:42 INFO mapred.JobClient: Map input records=10
13/10/23 11:30:42 INFO mapred.JobClient: Physical memory (bytes) snapshot=597352448
13/10/23 11:30:42 INFO mapred.JobClient: Spilled Records=0
13/10/23 11:30:42 INFO mapred.JobClient: CPU time spent (ms)=3930
13/10/23 11:30:42 INFO mapred.JobClient: Total committed heap usage (bytes)=349700096
13/10/23 11:30:42 INFO mapred.JobClient: Virtual memory (bytes) snapshot=3946369024
13/10/23 11:30:42 INFO mapred.JobClient: Map input bytes=0
13/10/23 11:30:42 INFO mapred.JobClient: Map output records=10
13/10/23 11:30:42 INFO mapred.JobClient: SPLIT_RAW_BYTES=1190
Job ended: Wed Oct 23 11:30:42 KST 2013
The job took 23 seconds.
'Hadoop & Mapreduce' 카테고리의 다른 글
HDFS append 활성화 시키기 (0) | 2014.01.22 |
---|---|
$HADOOP_HOME is deprecated 경고문 대처법 (0) | 2013.12.31 |
단일 tasktracker가 실행할 map과 reduce 최대 갯수 설정하기 (0) | 2013.10.23 |
hadoop 로그저장 폴더 변경하기 (0) | 2013.10.23 |
기타 HDFS 설정 (0) | 2013.10.21 |
- Total
- Today
- Yesterday
- L470
- t470p
- 소켓
- 레노버
- Python
- L570
- x270
- socket
- t450s
- 키보드
- x1
- 파이썬
- t470
- c++
- t460s
- x1 carbon 5th
- t470s
- x1 carbon
- thinkpad 13
- x1 카본 5세대
- x1 carbon 2017
- x260
- x1c
- lenovo
- Yoga 370
- x1 카본 2017
- t570
- 더헌트맨
- x250
- Thinkpad
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||
6 | 7 | 8 | 9 | 10 | 11 | 12 |
13 | 14 | 15 | 16 | 17 | 18 | 19 |
20 | 21 | 22 | 23 | 24 | 25 | 26 |
27 | 28 | 29 | 30 |