Deploy and Scale AI Models with Cloud Run

Google Cloud

Learn to deploy AI models on Cloud Run, optimize performance and costs, and integrate AI services with cloud databases effectively.

Unknown2 weeksEnglish

Free

About this Course

AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products. The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services

What You'll Learn

Deploy AI models using Cloud Run GPUs
Deploy lightweight language models on Cloud Run
Optimize model deployment for performance and cost
Integrate AI inference services with Google Cloud databases

Prerequisites

Basic computer and internet skills
Ability to read instructions in English

Instructors

Google Cloud Training

Topics

Software Development

Computer Science

Cloud Computing

Information Technology

Containerization

Model Deployment

Performance Tuning

Scalability

Generative AI

Cloud Deployment

Course Info

PlatformCoursera

LevelUnknown

PacingUnknown

PriceFree

Skills

تطوير البرمجيات

علوم الحاسوب

الحوسبة السحابية

تكنولوجيا المعلومات

حاويات التطبيقات

نشر النماذج

تحسين الأداء

قابلية التوسع

Generative AI

Cloud Deployment

Start Learning Now