资源简介
求所有文件中数值的最大三个值( Top 3 ),并按降序输出。MR编程mapper,reducer,combiner类的源代码
代码片段和文件信息
package com.top3;
import java.io.IOException;
import java.util.ArrayList;
import java.util.Collections;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.NullWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
public class TOP3 {
//@author YQ
public static class TopNMap extends Mapperject Text IntWritable NullWritable>{
private static IntWritable data = new IntWritable();
@Override
protected void map(object key Text valueContext context)
throws IOException InterruptedException {
//把字符串转为数值
data.set(Integer.parseInt(value.toString().trim()));
context.write(data NullWritable.get());
// System.out.println(data);
}
}
public static class TopNReducer extends Reducer{
private ArrayList keyList = new ArrayList();
private IntWritable num = new IntWritable();
@Override
public void reduce(IntWritable key Iterable values Context context) throws IOException InterruptedException {
//int kk = key.get();
keyList.add(key.get());
//System.out.println(kk);
}
@Override
protected void cleanup(Context context) throws IOException InterruptedException{
Collections.sort(keyList);//从
相关资源
- 大数据hadoop,spark,hive等等面试汇总
- 《Hadoop with Python》 pdf
- 绝对可用hadoop-eclipse-plugin-1.1.2.jar
- hadoop-common-2.7.1-bin
- hadoop2.5.0 snappy编译jar包
- Hadoop 2.7.4 Windows 64位 编译bin包含win
- hadoop-hdfs-2.7.7.jar
- 修复版 hadoop-0.21.0-eclipse-plugin.jar
- hadoop-eclipse-plugin-1.2.1.jar
- Native Hadoop3.2.1 Library 64位编译
- 华农Linux系统及程序设计实验全部代码
- hadoop-common-2.7.3.jar
- avro-1.8.1
- Hadoop2.7.7配合使用的winutil
- hadoop 32位lib包
- hadoop电商网站分析系统毕业设计项目
- hadoop-eclipse-plugin-2.7.5.jar275778
- Hadoop-Eclipse插件2.7.6
- JavaWeb操作hadoop2.6 HDFS从页面上传,,
- hadoop-eclipse-plugin-2.7.3.jar 插件
- hadoop-eclipse-plugin-2.7.2完美兼容版
- hadoop-eclipse-plugin-2.6.4.jar
- MapReduce分析年气象数据用源代码
- hadoop-eclipse-plugin-2.8.5.jar
- hadoop-eclipse-plugin-3.1.2.jar
- Hadoop的jar包
- hadoop-eclipse-plugin-2.9.2.jar 插件
- hadoop-eclipse-plugin-2.7.7.jar 插件
- hadoop-3.1.3.tar.gz
- hadoop-eclipse-plugin-2.7.3.jar
评论
共有 条评论