Autoscaling - IT文库_程序员IT互联网编程电子书和文档免费下载，助您码力十足！

首页文库资料文章资讯上传文档发布文章登录账户

A Day in the Life of a Data Scientist Conquer Machine Learning Lifecycle on Kubernetes

GPU or CPU nodes • Massive Scale • OpenAI dedicates up to 10k cores for a single experiment • Autoscaling capabilities: Pay for what you use, scale down when idle • Parallel training instead of sequential: Spin up pods for each variation of hyperparameters • One centralized TensorBoard instance • Autoscaling will create / remove VMs as needed to save cost Demo: Create End to End ML Pipelines with Argo Distributed File Systems • NFS • HDFS • … Classic DevOps solutions: • Containers • CI/CD • Autoscaling • A/B testing and canary release of Models • Comparing Production accuracy vs expected accuracy

0 码力 | 21 页 | 68.69 MB | 1 年前
3

共 1 条前往

页

KubeCon China ML Lifecycle