hadoop使用简介

2023-10-29 14:35:48 浏览数 (1)

git clone

hadoop源码地址:https://gitee.com/CHNnoodle/hadoop.git git clone错误:

Filename too long错误,使用git config --global core.longpaths true git clone https://gitee.com/CHNnoodle/hadoop.git -b rel/release-3.2.2 拉取指定tag版本

hadoop安装包地址:https://mirrors.cloud.tencent.com/apache/hadoop/common/

windows平台下载,https://github.com/cdarlint/winutils替换hadoop对应版本的bin文件

hadoop

  • core-site.xml,hadoop中间文件的输出路径
代码语言:javascript复制
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
    <property>
      <name>hadoop.tmp.dir</name>
      <value>/D:/ruanjian/hadoop/hadoop-3.3.5/data</value>    
    </property>
    <property>
      <name>fs.defaultFS</name>
      <value>hdfs://localhost:9000</value>
    </property>
</configuration>
  • hdfs-site.xml
代码语言:javascript复制
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
    <property>
        <name>dfs.replication</name>
        <value>1</value>
    </property>
    <property>
        <name>dfs.namenode.name.dir</name>
        <value>/D:/ruanjian/hadoop/hadoop-3.3.5/data/namenode</value>
    </property>
    <property>
        <name>dfs.datanode.data.dir</name>
        <value>/D:/ruanjian/hadoop/hadoop-3.3.5/data/datanode</value>
    </property>
</configuration>
  • mapred-site.xml
代码语言:javascript复制
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
     <property>
            <name>mapreduce.framework.name</name>
             <value>yarn</value>
    </property>
</configuration>
  • yarn-site.xml
代码语言:javascript复制
<?xml version="1.0"?>
<configuration>
     <property>
        <name>yarn.nodemanager.aux-services</name>
        <value>mapreduce_shuffle</value>
    </property>
    <property>
        <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
        <value>org.apache.hahoop.mapred.ShuffleHandler</value>
    </property>
</configuration>

启动hadoop

hdfs namenode -format //格式化节点 sbin/start-all.cmd //启动hadoop

浏览器访问

  • 访问集群节点:http://localhost:8088/
  • 访问HDFS:http://localhost:9870/

0 人点赞