SparkStreaming不間斷運行模式下的流式數據清理機制源碼深度剖析-Spark商業環境實戰

時間 2019-12-02

標籤 sparkstreaming 不間斷運行模式數據清理機制源碼深度剖析 spark 商業環境實戰欄目 Spark 简体版

原文原文鏈接

Spark商業環境實戰-Spark內置框架rpc通信機制及RpcEnv基礎設施
Spark商業環境實戰-Spark事件監聽總線流程分析
Spark商業環境實戰-Spark存儲體系底層架構剖析
Spark商業環境實戰-Spark底層多個MessageLoop循環線程執行流程分析
Spark商業環境實戰-Spark一級資源調度Shedule機制及SpreadOut模式源碼深刻剖析
Spark商業環境實戰-Spark二級調度系統Stage劃分算法和最佳任務調度細節剖析
Spark商業環境實戰-Spark任務延遲調度及調度池Pool架構剖析
Spark商業環境實戰-Task粒度的緩存聚合排序結構AppendOnlyMap詳細剖析
Spark商業環境實戰-ExternalSorter 外部排序器在Spark Shuffle過程當中設計思路剖析
Spark商業環境實戰-ShuffleExternalSorter外部排序器在Spark Shuffle過程當中的設計思路剖析
Spark商業環境實戰-Spark ShuffleManager內存緩衝器SortShuffleWriter設計思路剖析
Spark商業環境實戰-Spark ShuffleManager內存緩衝器UnsafeShuffleWriter設計思路剖析
Spark商業環境實戰-Spark ShuffleManager內存緩衝器BypassMergeSortShuffleWriter設計思路剖析
Spark商業環境實戰-Spark Shuffle 核心組件BlockStoreShuffleReader內核原理深刻剖析
Spark商業環境實戰-Spark Shuffle 管理器SortShuffleManager內核原理深刻剖析
Spark商業環境實戰-Spark PersistenceEngine持久化引擎與領導選舉代理機制內核原理深刻剖析
[Spark商業應用實戰-Spark Shuffle 專業級核心參數調優源碼深刻剖析 ]
[Spark商業應用實戰-Spark 內存管理體系UnifiedMemoryManager源碼深刻剖析 ]
[Spark商業應用實戰-Spark 內存管理體系StaticMemoryManager源碼深刻剖析 ]
[Spark商業應用實戰-Spark 基於JVM 統一內存使用內幕源碼深刻剖析 ]
[Spark商業應用實戰-Spark 基於Tungsten內存分配器的管理機制內幕源碼深刻剖析 ]
[Spark商業應用實戰-Spark 基於Task粒度的內存管理器及內存消費者源碼深刻剖析]
[Spark商業應用實戰-Spark Shuffle Block 索引與數據解析組件IndexShuffleBlocakResolver源碼深刻剖析 ]
[Spark商業應用實戰-Spark Block數據傳輸服務NettyBlockTransferService 源碼深刻剖析 ]
[Spark商業應用實戰-Spark 基於Yarn的資源申請過程源碼深刻剖析 ]
[Spark商業應用實戰-Spark 基於K8s的資源申請流程源碼深刻剖析 ]
Spark商業環境實戰-StreamingContext啓動流程及Dtream 模板源碼剖析
Spark商業環境實戰-ReceiverTracker 啓動過程及接收器 receiver RDD 任務提交機制源碼剖析
Spark商業環境實戰-SparkStreaming數據流從Batch到Block定時轉化過程源碼深度剖析
Spark商業環境實戰-SparkStreaming之JobGenerator週期性任務數據處理邏輯源碼深度剖析
[Spark商業環境實戰-SparkStreaming Graph 處理鏈迭代過程源碼深度剖析]
Spark商業環境實戰-SparkStreaming不間斷運行模式下的流式數據清理機制源碼深度剖析
[Spark商業環境實戰-SparkStreaming 容錯機制源碼深度剖析]
[Spark商業環境實戰-SparkStreaming 之No Receiver方式基於Kafka 拉取內幕源碼深度剖析]
[Spark商業環境實戰-SparkStreaming 反壓機制控制消費速率內幕源碼深度剖析]

1 不掃一屋何以掃天下

SparkStreaming 應用在持續不斷的運行，假設Spark 數據接入只進不出，那麼即便Spark內存使用多麼牛逼，也會崩掉的，所以及時進行內存數據的清理和磁盤的清理可謂重中之重。
那麼SparkStreaming應用的對象，數據，元數據這些信息如何進行回收呢？先拋個問題。

2 什麼時候掃天下？

2.1 jobScheduler之job的提交到結束

JobGenerator觸發generateJobs
JobGenerator -> jobScheduler.submitJobSet(JobSet(time, jobs, streamIdToInputInfos))
submitJobSet -> jobSet.jobs.foreach(job => jobExecutor.execute(new JobHandler(job)))
jobScheduler -> _eventLoop.post(JobCompleted(job, clock.getTimeMillis()))
jobScheduler -> handleJobCompletion(job, completedTime)
jobScheduler -> jobGenerator.onBatchCompletion(jobSet.time)
jobGenerator -> eventLoop.post(ClearMetadata(time))

2.2 clearMetadata 神龍見首不見尾

主要的緩存元數據

private val batchTimeToInputInfos =
  new mutable.HashMap[Time, mutable.HashMap[Int, StreamInputInfo]]
  
 private val timeToAllocatedBlocks = new mutable.HashMap[Time, AllocatedBlocks]    
複製代碼

ssc.graph.clearMetadata : 基於outputStreams 清除 RDD，經過BlockManager清除Block數據緩存
jobScheduler.inputInfoTracker.cleanup : 基於inputInfoTracker清除緩存中的timeToAllocatedBlocks。架構

jobScheduler.inputInfoTracker.cleanup : 基於inputInfoTracker清除batchTimeToInputInfos中元數據框架

private def clearMetadata(time: Time) {

    ssc.graph.clearMetadata(time)               <- 核心之處

    // If checkpointing is enabled, then checkpoint,
    // else mark batch to be fully processed
    if (shouldCheckpoint) {
      eventLoop.post(DoCheckpoint(time, clearCheckpointDataLater = true))  <- 核心之處

    } else {
      // If checkpointing is not enabled, then delete metadata information about
      // received blocks (block data not saved in any case). Otherwise, wait for
      // checkpointing of this batch to complete.
      val maxRememberDuration = graph.getMaxInputStreamRememberDuration()

      jobScheduler.receiverTracker.cleanupOldBlocksAndBatches(time - maxRememberDuration)  <- 核心之處
      jobScheduler.inputInfoTracker.cleanup(time - maxRememberDuration)    <- 核心之處
      markBatchFullyProcessed(time)
    }
  } 
複製代碼

2.3 ssc.graph.clearMetadata 之RDD再見

* Clear metadata that are older than `rememberDuration` of this DStream.
   * This is an internal method that should not be called directly. This default
   * implementation clears the old generated RDDs. Subclasses of DStream may override
   * this to clear their own metadata along with the generated RDDs.
   
  private[streaming] def clearMetadata(time: Time) {
    val unpersistData = ssc.conf.getBoolean("spark.streaming.unpersist", true)
    val oldRDDs = generatedRDDs.filter(_._1 <= (time - rememberDuration))
    logDebug("Clearing references to old RDDs: [" +
      oldRDDs.map(x => s"${x._1} -> ${x._2.id}").mkString(", ") + "]")
      
    generatedRDDs --= oldRDDs.keys        <- 核心之處
    
    if (unpersistData) {
      logDebug(s"Unpersisting old RDDs: ${oldRDDs.values.map(_.id).mkString(", ")}")
      oldRDDs.values.foreach { rdd =>
      
        rdd.unpersist(false)               <- 核心之處
        
        // Explicitly remove blocks of BlockRDD
        rdd match {
          case b: BlockRDD[_] =>
            logInfo(s"Removing blocks of RDD $b of time $time")
            b.removeBlocks()
          case _ =>
        }
      }
    }
複製代碼

RDD清除細節

private[spark] def unpersistRDD(rddId: Int, blocking: Boolean = true) {
    env.blockManager.master.removeRdd(rddId, blocking)
    persistentRdds.remove(rddId)
    listenerBus.post(SparkListenerUnpersistRDD(rddId))
  }
複製代碼

2.4 cleanupOldBlocksAndBatches 之 batch數據再見

* Clean up block information of old batches. If waitForCompletion is true, this method
   * returns only after the files are cleaned up.
   
  def cleanupOldBatches(cleanupThreshTime: Time, waitForCompletion: Boolean): Unit = synchronized {
    require(cleanupThreshTime.milliseconds < clock.getTimeMillis())
    
    val timesToCleanup = timeToAllocatedBlocks.keys.filter { _ < cleanupThreshTime }.toSeq
    logInfo(s"Deleting batches: ${timesToCleanup.mkString(" ")}")
    
    if (writeToLog(BatchCleanupEvent(timesToCleanup))) {
      timeToAllocatedBlocks --= timesToCleanup
      writeAheadLogOption.foreach(_.clean(cleanupThreshTime.milliseconds, waitForCompletion))
    } else {
      logWarning("Failed to acknowledge batch clean up in the Write Ahead Log.")
    }
  }
複製代碼

2.5 cleanupOldBlocksAndBatches 之 batch info 元數據再見

def cleanup(batchThreshTime: Time): Unit = synchronized {
    val timesToCleanup = batchTimeToInputInfos.keys.filter(_ < batchThreshTime)
    logInfo(s"remove old batch metadata: ${timesToCleanup.mkString(" ")}")
    batchTimeToInputInfos --= timesToCleanup
  }
複製代碼