1. 程式人生 > >kettle連結hive的環境配置

kettle連結hive的環境配置

kettle連結hive

1)首先將hive/lib目錄的包匯入到kettle目錄D:\software\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh510\lib下(找到你安裝的目錄)

2)找到D:\software\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh510目錄

檔案core-site.xml  新增內容:

<configuration>

<property>

    <name>fs.defaultFS</name>

    <value>hdfs://bigData01:9000</value>

</property>

<property>

    <name>hadoop.tmp.dir</name>

    <value>/opt/modules/hadoop-2.6.0-cdh5.13.0/data</value>

</property>

</configuration>

檔案hive-site.xml 新增內容:

<property>

    <name>hive.metastore.local</name>

    <value>false</value>

</property>

<property>

    <name>hive.metastore.uris</name>

    <value>thrift://ip地址:9083</value>

</property>

<property>

    <name>hive.exec.reducers.bytes.per.reducer</name>

    <value>1073741824</value>

</property>

<property>

    <name>hive.support.concurrency</name>

    <value>true</value>

</property>

<property>

    <name>hive.server2.thrift.port</name>
    
    <value>10000</value>

</property>

<property>

    <name>hive.server2.thrift.min.worker.threads</name>

    <value>5</value>

</property>

<property>

    <name>hive.server2.thrift.max.worker.threads</name>

    <value>500</value>

</property>

<property>

    <name>hive.jdbc_passwd.auth.zjl</name>

    <value>123456</value>

<description/>

</property>

檔案mapred-site.xml新增如下

<property>

    <name>mapreduce.framework.name</name>

    <value>yarn</value>

</property>

檔案yarn-site.xml新增內容:

<property>

            <name>yarn.nodemanager.aux-services</name>

           <value>mapreduce_shuffle</value>

</property>

3)找到D:\software\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh510目錄

將cdh510配置在D:\software\data-integration\plugins\pentaho-big-data-plugin目錄下的plugin.properties檔案

新增內容:active.hadoop.configuration=cdh510