kettle連結hive的環境配置
阿新 • • 發佈:2019-01-23
kettle連結hive
1)首先將hive/lib目錄的包匯入到kettle目錄D:\software\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh510\lib下(找到你安裝的目錄)
2)找到D:\software\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh510目錄
檔案core-site.xml 新增內容:
<configuration> <property> <name>fs.defaultFS</name> <value>hdfs://bigData01:9000</value> </property> <property> <name>hadoop.tmp.dir</name> <value>/opt/modules/hadoop-2.6.0-cdh5.13.0/data</value> </property> </configuration>
檔案hive-site.xml 新增內容:
<property> <name>hive.metastore.local</name> <value>false</value> </property> <property> <name>hive.metastore.uris</name> <value>thrift://ip地址:9083</value> </property> <property> <name>hive.exec.reducers.bytes.per.reducer</name> <value>1073741824</value> </property> <property> <name>hive.support.concurrency</name> <value>true</value> </property> <property> <name>hive.server2.thrift.port</name> <value>10000</value> </property> <property> <name>hive.server2.thrift.min.worker.threads</name> <value>5</value> </property> <property> <name>hive.server2.thrift.max.worker.threads</name> <value>500</value> </property> <property> <name>hive.jdbc_passwd.auth.zjl</name> <value>123456</value> <description/> </property>
檔案mapred-site.xml新增如下
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
檔案yarn-site.xml新增內容:
<property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce_shuffle</value> </property>
3)找到D:\software\data-integration\plugins\pentaho-big-data-plugin\hadoop-configurations\cdh510目錄
將cdh510配置在D:\software\data-integration\plugins\pentaho-big-data-plugin目錄下的plugin.properties檔案
新增內容:active.hadoop.configuration=cdh510