TrueschoTruescho
All Courses
Deploy and Scale AI Models with Cloud Run
Coursera
Course
Unknown

Deploy and Scale AI Models with Cloud Run

Google Cloud

Learn to deploy AI models on Cloud Run, optimize performance and costs, and integrate AI services with cloud databases effectively.

Unknown2 weeksEnglish

About this Course

AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products. The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services

What You'll Learn

  • Deploy AI models using Cloud Run GPUs
  • Deploy lightweight language models on Cloud Run
  • Optimize model deployment for performance and cost
  • Integrate AI inference services with Google Cloud databases

Prerequisites

  • Basic computer and internet skills
  • Ability to read instructions in English

Instructors

G

Google Cloud Training

Topics

Software Development
Computer Science
Cloud Computing
Information Technology
Containerization
Model Deployment
Performance Tuning
Scalability
Generative AI
Cloud Deployment

Course Info

PlatformCoursera
LevelUnknown
PacingUnknown
PriceFree

Skills

تطوير البرمجيات
علوم الحاسوب
الحوسبة السحابية
تكنولوجيا المعلومات
حاويات التطبيقات
نشر النماذج
تحسين الأداء
قابلية التوسع
Generative AI
Cloud Deployment

Start Learning Now