java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

阿新 • • 發佈：2020-11-23

https://stackoverflow.com/questions/35652665/java-io-ioexception-could-not-locate-executable-null-bin-winutils-exe-in-the-ha

I'm not able to run a simplesparkjob inScala IDE(Maven spark project) installed onWindows 7

Spark core dependency has been added.

val conf = new SparkConf().setAppName("DemoDF").setMaster("local")
val sc = new SparkContext(conf)
val logData = sc.textFile("File.txt")
logData.count()

Error:

16/02/26 18:29:33 INFO SparkContext: Created broadcast 0 from textFile at FrameDemo.scala:13
16/02/26 18:29:34 ERROR Shell: Failed to locate the winutils binary in the hadoop binary path
java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.
    at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:278)
    at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:300)
    at org.apache.hadoop.util.Shell.<clinit>(Shell.java:293)
    at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:76)
    at org.apache.hadoop.mapred.FileInputFormat.setInputPaths(FileInputFormat.java:362)
    at <br>org.apache.spark.SparkContext$$anonfun$hadoopFile$1$$anonfun$33.apply(SparkContext.scala:1015)
    at org.apache.spark.SparkContext$$anonfun$hadoopFile$1$$anonfun$33.apply(SparkContext.scala:1015)
    at <br>org.apache.spark.rdd.HadoopRDD$$anonfun$getJobConf$6.apply(HadoopRDD.scala:176)
    at <br>org.apache.spark.rdd.HadoopRDD$$anonfun$getJobConf$6.apply(HadoopRDD.scala:176)<br>
    at scala.Option.map(Option.scala:145)<br>
    at org.apache.spark.rdd.HadoopRDD.getJobConf(HadoopRDD.scala:176)<br>
    at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:195)<br>
    at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)<br>
    at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)<br>
    at scala.Option.getOrElse(Option.scala:120)<br>
    at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)<br>
    at org.apache.spark.rdd.MapPartitionsRDD.getPartitions(MapPartitionsRDD.scala:35)<br>
    at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)<br>
    at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)<br>
    at scala.Option.getOrElse(Option.scala:120)<br>
    at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)<br>
    at org.apache.spark.SparkContext.runJob(SparkContext.scala:1929)<br>
    at org.apache.spark.rdd.RDD.count(RDD.scala:1143)<br>
    at com.org.SparkDF.FrameDemo$.main(FrameDemo.scala:14)<br>
    at com.org.SparkDF.FrameDemo.main(FrameDemo.scala)<br>

eclipse scala apache-spark improve this question editedJan 30 '17 at 20:56 Glenn Slayden 13.8k33 gold badges8383 silver badges9595 bronze badges askedFeb 26 '16 at 13:12 Elvish_Blade 97011 gold badge99 silver badges1212 bronze badges add a comment

12 Answers

Active Oldest Votes 142

Hereis a good explanation of your problem with the solution.

Download winutils.exe fromhttp://public-repo-1.hortonworks.com/hdp-win-alpha/winutils.exe.
SetUp your HADOOP_HOME environment variable on the OS level or programmatically:

System.setProperty("hadoop.home.dir", "full path to the folder with winutils");
Enjoy

improve this answer editedApr 9 '19 at 15:51 answeredFeb 26 '16 at 13:21 Taky 4,96711 gold badge1616 silver badges2828 bronze badges

14 I have to set HADOOP_HOME to hadoop folder instead of the bin folder.–StanleyAug 29 '16 at 7:44
4 Also, be sure to download the correct winutils.exe based on the version of hadoop that spark is compiled for (so, not necessarily the link above). Otherwise, pain awaits :)–NP3Jun 30 '17 at 12:14
System.setProperty("hadoop.home.dir", "C:\\hadoop-2.7.1\\")–Shyam GuptaOct 14 '17 at 19:00
1 yes exactly as @Stanley says. worked with setting up the HADOOP_HOME to hadoop folder instead of the bin folder.–JazzApr 9 '19 at 13:09
@NP3 and how do you know that version? I am using latest pyspark. Thanks,–JDPeckhamNov 10 '19 at 19:12

show1more comment 67

Download winutils.exe
Create folder, sayC:\winutils\bin
Copywinutils.exeinsideC:\winutils\bin
Set environment variableHADOOP_HOMEtoC:\winutils

improve this answer editedAug 21 '17 at 7:09 Kenny John Jacob 43522 silver badges1313 bronze badges answeredSep 16 '16 at 7:26 Deokant Gupta 67155 silver badges22 bronze badges

also, if you have a cmd line open, restart it for the variables to take affect.–eychAug 21 '19 at 16:51

add a comment 26

Follow this:

Create abinfolder in any directory(to be used in step 3).
Downloadwinutils.exeand place it in the bin directory.
Now addSystem.setProperty("hadoop.home.dir", "PATH/TO/THE/DIR");in your code.

improve this answer editedJan 11 '17 at 6:28 answeredJan 11 '17 at 6:22 Ani Menon 21.5k1313 gold badges7575 silver badges100100 bronze badges

2 Thanks a lot, just what i was looking for–user373201Feb 27 '17 at 2:59
3 It is to be noted that the path to be pointed should not include the 'bin' directory. Ex: If the path where winutils.exe is "D://Hadoop//bin//winutils.exe" , then the path for hadoop.home.dir should be "D://Hadoop"–Keshav Pradeep RamanathMay 31 '18 at 10:30

add a comment 4

if we see below issue

ERROR Shell: Failed to locate the winutils binary in the hadoop binary path

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

then do following steps

download winutils.exe fromhttp://public-repo-1.hortonworks.com/hdp-win-alpha/winutils.exe.
and keep this under bin folder of any folder you created for.e.g. C:\Hadoop\bin
and in program add following line before creating SparkContext or SparkConf System.setProperty("hadoop.home.dir", "C:\Hadoop");

improve this answer editedJun 20 at 9:12 Community♦ 111 silver badge answeredSep 4 '17 at 14:16 Prem S 22522 silver badges88 bronze badges add a comment 4

On Windows 10 - you should add two different arguments.

(1) Add the new variable and value as - HADOOP_HOME and path (i.e. c:\Hadoop) under System Variables.

(2) Add/append new entry to the "Path" variable as "C:\Hadoop\bin".

The above worked for me.

improve this answer answeredJun 27 '18 at 10:42 user1023627 17711 gold badge22 silver badges1111 bronze badges add a comment 4

1) Download winutils.exe from https://github.com/steveloughran/winutils 
2) Create a directory In windows "C:\winutils\bin
3) Copy the winutils.exe inside the above bib folder .
4) Set the environmental property in the code 
  System.setProperty("hadoop.home.dir", "file:///C:/winutils/");
5) Create a folder "file:///C:/temp" and give 777 permissions.
6) Add config property in spark Session ".config("spark.sql.warehouse.dir", "file:///C:/temp")"

improve this answer answeredSep 30 '18 at 7:39 Sampat Kumar 30611 gold badge22 silver badges1212 bronze badges add a comment 2

I got the same problem while running unit tests. I found this workaround solution:

The following workaround allows to get rid of this message:

    File workaround = new File(".");
    System.getProperties().put("hadoop.home.dir", workaround.getAbsolutePath());
    new File("./bin").mkdirs();
    new File("./bin/winutils.exe").createNewFile();

from:https://issues.cloudera.org/browse/DISTRO-544

improve this answer answeredApr 3 '18 at 18:28 Joabe Lucena 62155 silver badges1616 bronze badges add a comment 2

You can alternatively downloadwinutils.exefrom GITHub:

https://github.com/steveloughran/winutils/tree/master/hadoop-2.7.1/bin

replacehadoop-2.7.1with the version you want and place the file inD:\hadoop\bin

If you do not have access rights to the environment variable settings on your machine, simply add the below line to your code:

System.setProperty("hadoop.home.dir", "D:\\hadoop");

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

https://stackoverflow.com/questions/35652665/java-io-ioexception-could-not-locate-executable-null-bin-winutils-exe-in-the-ha

Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

今天用idea連線HBase是報Could not locate executable null\\bin\\winutils.exe in the Hadoop binaries.錯誤：

java.io.IOException: Could not find resource com/xxx/xxxMapper.xml

java.io.IOException: Could not find resource com/xxx/xxxMapper.xml 報錯內容: org.apache.ibatis.exceptions.PersistenceException:

Mybatis 報錯 java.io.IOException: Could not find resource mybatis-config.xml

技術標籤：mybatis 問題描述在使用mybatis過程中,程式需要讀取mybatis-config.xml配置檔案，IDEA預設將這個資原始檔放在resource目錄下，啟動專案報錯。內容如下：

解決org.apache.ibatis.builder.BuilderException: Error parsing SQL Mapper Configuration. Cause: java.io.IOException: Could not find resource //dao/**Mapper.xml問題

1. 問題分析　　出現此問題的原因是資源過濾的問題，編寫在DAO包中的XML檔案沒有被打包。

Caused by: java.lang.IllegalStateException: Could not resolve element type of Iterable type @。。。。。web.bind.annotation.RequestParam java.util.List<?>. Not declared?

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

12 Answers

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

java.io.IOException: Could not find resource com/xxx/xxxMapper.xml

Mybatis 報錯 java.io.IOException: Could not find resource mybatis-config.xml

解決org.apache.ibatis.builder.BuilderException: Error parsing SQL Mapper Configuration. Cause: java.io.IOException: Could not find resource //dao/**Mapper.xml問題

Caused by: java.lang.IllegalStateException: Could not resolve element type of Iterable type @。。。。。web.bind.annotation.RequestParam java.util.List<?>. Not declared?

IDEA完成shiro認證報錯:org.apache.shiro.config.ConfigurationException: java.io.IOException: Resource

org.springframework.amqp.AmqpIOException: java.io.IOException

Jenkins從節點上構建自動化測試專案時報錯：java.io.IOException: Unexpected termination of the channel

什麼場景會拋該異常【java.io.IOException: 你的主機中的軟體中止了一個已建立的連線】

上傳圖片報錯java.io.IOException: Stream closed

解決org.springframework.amqp.AmqpIOException: java.io.IOException錯誤

啟動MapReduce丟擲異常java.io.IOException: Filesystem closed at org.apache.hadoop.hdfs.DFSClient.checkOpen

Spring Cloud報錯java.lang.IllegalArgumentException: Could not find class

springcloud org.apache.catalina.connector.ClientAbortException: java.io.IOException

Java程式使用Alpine Linux報錯java.lang.NoClassDefFoundError: Could not initialize class org.xerial.snappy.S...

Caused by: java.io.IOException: ZIP entry size is too large

java.io.IOException: 你的主機中的軟體中止了一個已建立的連線

docker執行postgresql出現could not locate a valid checkpoint record的產生原因及如何解決

VUE前端實現PDF預覽時出現org.apache.catalina.connector.ClientAbortException:java.io.IOException: 您的主機中的軟體中止了一個已建立的連線

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

12 Answers

相關推薦