All Courses
Mastering Generative AI: LLM Architecture & Data Preparation
edX
Course
Intermediate
Free to Audit
Certificate

Mastering Generative AI: LLM Architecture & Data Preparation

IBM

Build in-demand, job-ready generative AI architecture and data science skills in two weeks. A basic knowledge of Python and PyTorch and an awareness of machine learning and neural networks would be an advantage, though not strictly required.

2 hrs/week2 weeksEnglish1,743 enrolled
Free to Audit

About this Course

The demand for gen AI is forecast to grow over 46% annually by 2030 (Source: Statista). AI engineers and developers, data scientists, machine learning engineers, and other AI professionals with gen AI skills are highly sought-after. This course builds in-demand skills in large language model (LLM) architecture and data preparation employers are looking for. During the course, you’ll learn about real-world applications using generative AI. You’ll gain insights into gen AI architectures and models, such as recurrent neural networks (RNNs), transformers, generative adversarial networks (GANs), variational autoencoders (VAEs), and diffusion models. You’ll use different training approaches for each model. Plus, you’ll explore LLMs such as generative pre-trained transformers (GPT) and bidirectional encoder representations from transformers (BERT). Additionally, you’ll gain a detailed understanding of the tokenization process, tokenization methods, and the use of tokenizers for word-based, character-based, and subword-based tokenization. You’ll get hands-on experience using data loaders for training generative AI models, using PyTorch libraries, and generative AI libraries in Hugging Face. Plus, you’ll implement tokenization and create an NLP data loader. If you’re looking to master gen AI LLM architecture and data preparation, ENROLL TODAY and get ready to power up your resume with skills employers need! Prerequisites: To enroll for this course, a basic knowledge of Python and PyTorch and an awareness of machine learning and neural networks would be an advantage, though not strictly required. 3b:T67d,

What You'll Learn

  • Job-ready generative AI architecture and data science skills in two weeks, plus practical experience and an industry-recognized credential employers value.
  • The difference between generative AI architectures and models, such as RNNs, transformers, VAEs, GANs, and diffusion models.
  • How LLMs such as GPT, BERT, BART, and T5 are used in language processing.
  • How to implement tokenization to preprocess raw textual data using NLP libraries such as NLTK, spaCy, BertTokenizer, and XLNetTokenizer.
  • How to create an NLP data loader using PyTorch to perform tokenization, numericalization, and padding of text data.

Prerequisites

  • For this course, a basic knowledge of Python and PyTorch and an awareness of machine learning and neural networks would be an advantage, though not strictly required.

Instructors

J

Joseph Santarcangelo

PhD., Data Scientist

Course Info

PlatformedX
LevelIntermediate
PacingUnknown
CertificateAvailable
PriceFree to Audit

Start Learning Now