Spark RangeDependency 區間依賴關係
阿新 • • 發佈:2018-12-05
Spark RangeDependency 區間依賴關係
- Represents a one-to-one dependency between ranges of partitions in the parent and child RDDs.
更多資源
- github: https://github.com/opensourceteams/spark-scala-maven
- csdn(彙總視訊線上看): https://blog.csdn.net/thinktothings/article/details/84726769
youtub視訊演示
- https://www.bilibili.com/video/av37442139/?p=2(bilibili視訊)
- github: https://github.com/opensourceteams/spark-scala-maven
輸入資料
c.txt
a bc
a
a.txt
a b
c a
處理程式scala
package com.opensource.bigdata.spark.local.rdd.operation.dependency.narrow.n_02_RangeDependency import com.opensource.bigdata.spark.local.rdd.operation.base.BaseScalaSparkContext object Run3 extends BaseScalaSparkContext{ def main(args: Array[String]): Unit = { val sc = pre() val rdd1 = sc.textFile("/opt/data/2/c.txt",2) val rdd2 = sc.textFile("/opt/data/2/a.txt",2) val rdd3 = rdd1.union(rdd2) println(rdd3.collect().mkString("\n")) sc.stop() } }