基於Kubernetes的機器學習系統

時間 2019-11-06

標籤基於 kubernetes 機器學習系統简体版

原文原文鏈接

基於Kubernetes的機器學習系統

基於Kubernetes搭建機器學習系統，將能夠利用大型集羣管理和微服務架構實現大規模、生產級的訓練、研究和應用機器學習系統，Nvidia的 HGX超級機器學習平臺也基於該架構，具備強大的運算和IO能力。前端

一、基於Kubernetes的機器學習流程

機器學習系統包括訓練框架、機器學習計算引擎、樣本數據庫、計算模型庫、超級參數庫、應用框架和移動應用（包括引擎、框架和參數庫調優等）等部分。爲了生產級的應用，還須要具備有配置參數及其版本化的管理，樣本數據庫、超級參數和環境配置參數每每存儲在虛擬存儲系統上（如NFS/Rook），以實現多個節點的並行處理訪問所需。docker

二、基於Kubernetes的機器學習資源

本文將包含基於Kubernetes的機器學習系統相關參考資源，包括基於Ubuntu的Nvidia圖形驅動驅動、cuDNN深度學習引擎、Nvidia Docker二、Nvidia Plugin for Kubernetes、Kubeflow、Spark ML等安裝和配置方法。數據庫

ML引擎與基礎環境
- Ubuntu快速玩起色器學習，http://www.javashuo.com/article/p-ejsumwgy-ge.html
- Kubernetes安裝GPU支持插件，http://www.javashuo.com/article/p-tcfjgauk-gd.html
- pytorch-operator on Kubernetes，http://www.javashuo.com/article/p-afvqxpqf-dm.html
- 快速運行TensorFlow的6種方式，http://www.javashuo.com/article/p-alzssfpr-do.html
- 分佈式TensorFlow，http://www.javashuo.com/article/p-rrneniir-gd.html
Spark上的機器學習
- Spark上的深度學習流水線，http://www.javashuo.com/article/p-wafhvhjg-cq.html
- Spark機器學習工具鏈-MLflow，http://www.javashuo.com/article/p-winervew-nt.html
- MLflow 使用教程，http://www.javashuo.com/article/p-sbrldptq-nt.html
- Introducing Deep Learning Pipelines for Apache Spark，
  - http://www.javashuo.com/article/p-zvxwlfiy-dt.html
- A Vision for Making Deep Learning Simple，
  - http://www.javashuo.com/article/p-nnrxnajo-eg.html
KubeFlow的ML技術棧
- ksonnet 使用教程，http://www.javashuo.com/article/p-vbthnzft-nt.html
- Kubeflow 使用指南，http://www.javashuo.com/article/p-nppedcay-hp.html
- Kubeflow 快速入門，http://www.javashuo.com/article/p-qbhmhuai-hd.html
- Kubeflow-機器學習工做流框架，https://my.oschina.net/u/2306127/blog/1807785
- Kubernetes集成TensorFlow服務，http://www.javashuo.com/article/p-mxgqswxy-mp.html
工做流程與應用
- 什麼是地理空間智能（Geospatial AI），http://www.javashuo.com/article/p-zhymzkbc-ek.html
- Land Cover Classification Using the Geo AI，http://www.javashuo.com/article/p-yutgshbi-ev.html
- Deep Learning Models on Kubernetes with GPUs，http://www.javashuo.com/article/p-gsqaalml-ex.html
- 使用TensorFlow和Kubernetes進行基於GPU的深度學習，http://www.javashuo.com/article/p-hrpmbtep-ey.html
- Pachyderm 快速入門，http://www.javashuo.com/article/p-xlqhpxte-ga.html

三、Kubenrnetes集羣安裝參考

kubeadm安裝Kubernetes實踐記錄，http://www.javashuo.com/article/p-rqehlpup-bz.html
Kubernetes加入新節點，經驗總結，http://www.javashuo.com/article/p-rgeikibn-ek.html
Kubernetes v1.10.x HA 全手動安裝教程，http://www.javashuo.com/article/p-wfpihfpm-dt.html
使用kubeadm建立Kubernetes集羣，http://www.javashuo.com/article/p-youxcash-dm.html
Kubernetes部署與應用解決方案大全，http://www.javashuo.com/article/p-zbkqmxgg-cz.html

四、Kubenrnetes網絡部署參考

calico在docker上的部署及驗證，http://www.javashuo.com/article/p-zzkpntwt-dc.html
容器SDN技術與微服務架構實踐，http://www.javashuo.com/article/p-faauvliw-ct.html
基於Kubeadm的Flannel分析，http://www.javashuo.com/article/p-wdpwzohm-ey.html
...

五、機器學習前端應用方向

在機器學習平臺中，後端主要用於訓練和大型分析、在線服務，而前端經過低功耗的專用芯片實現模型的應用，如自動駕駛、圖像分類、視頻理解、人臉識別、語音合成、視覺定位等等，有基於ASIC、FPGA、DSP、ARM的專用芯片和多種方案，體積和功耗都是比較低的，能夠在包括手機上的移動設備上運行。後端

本文地址 http://www.javashuo.com/article/p-bpgpkqza-dt.html.......

相關文章

相關標籤/搜索

機器學習基礎

機器學習基石

機器學習系列

圖機器學習

java機器學習

Python機器學習

python 機器學習

opencv、機器學習

瀏覽器信息

網站主機教程

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。

最新文章

本站公眾號

歡迎關注本站公眾號,獲取更多信息

相關文章

>>更多相關文章<<