Global Admissions Logo

$ USD

Search Programs

Introducing Multimodal Llama 3.2

California, United States

The Non-degree in Introducing Multimodal Llama 3.2 at DeepLearning.AI is a program for international students taught in English.

📖 Introduction

DeepLearning.AI is an online education platform founded in 2017 by Andrew Ng, a leading AI expert and co-founder of Coursera. As a private organization, DeepLearning.AI specializes in AI and machine learning education, offering high-quality courses, specializations, and professional certifications in collaboration with top institutions and industry leaders. The platform is known for its practical, hands-on approach to teaching AI concepts and its focus on making cutting-edge AI knowledge accessible to learners worldwide.

📚 About the Program

Join our new short course, Introducing Multimodal Llama 3.2, and learn from Amit Sangani, Senior Director of AI Partner Engineering at Meta, to learn all about the latest additions to the Llama models 3.1 and 3.2, from custom tool calling to multimodality and the new Llama stack.Open models are a key building block of AI and a key enabler of AI research. With Meta’s family of open models, anyone can download, customize, fine-tune, or build new applications on top of them, allowing AI innovation. The Llama model family now ranges from 1B model parameters to its 405B foundation model, allowing for diverse use cases and applications.In this course, you’ll learn about the new vision capabilities that Llama 3.2 brings to the Llama family. You’ll learn how to leverage this along with tool-calling, and Llama Stack, which is an open-source orchestration layer for building on top of the Llama family of models.In detail, you’ll: 1. Learn about the new models, how they were trained, their features, and how they fit into the Llama family.2. Understand how to do multimodal prompting with Llama and work on advanced image reasoning use cases such as understanding errors on a car dashboard, adding up the total of three restaurant receipts, grading written math homework, and many more.3. Learn different roles—system, user, assistant, ipython—in the Llama 3.1 and 3.2 family and the prompt format that identifies those roles.4. Understand how Llama uses the tiktoken tokenizer, and how it has expanded to a 128k vocabulary size that improves encoding efficiency and enables support for seven non-English languages.5. Learn how to prompt Llama to call both built-in and custom tools with examples for web search and solving math equations.6. Learn about ‘Llama Stack API’, which is a standardized interface for canonical toolchain components like fine-tuning or synthetic data generation to customize Llama models and build agentic applications.Start building exciting applications on Llama!

Join our new short course, Introducing Multimodal Llama 3.2, and learn from Amit Sangani, Senior Director of AI Partner Engineering at Meta, to learn all about the latest additions to the Llama models 3.1 and 3.2, from custom tool calling to multimodality and the new Llama stack.Open models are a key building block of AI and a key enabler of AI research. With Meta’s family of open models, anyone can download, customize, fine-tune, or build new applications on top of them, allowing AI innovation. The Llama model family now ranges from 1B model parameters to its 405B foundation model, allowing for diverse use cases and applications.In this course, you’ll learn about the new vision capabilities that Llama 3.2 brings to the Llama family. You’ll learn how to leverage this along with tool-calling, and Llama Stack, which is an open-source orchestration layer for building on top of the Llama family of models.In detail, you’ll: 1. Learn about the new models, how they were trained, their features, and how they fit into the Llama family.2. Understand how to do multimodal prompting with Llama and work on advanced image reasoning use cases such as understanding errors on a car dashboard, adding up the total of three restaurant receipts, grading written math homework, and many more.3. Learn different roles—system, user, assistant, ipython—in the Llama 3.1 and 3.2 family and the prompt format that identifies those roles.4. Understand how Llama uses the tiktoken tokenizer, and how it has expanded to a 128k vocabulary size that improves encoding efficiency and enables support for seven non-English languages.5. Learn how to prompt Llama to call both built-in and custom tools with examples for web search and solving math equations.6. Learn about ‘Llama Stack API’, which is a standardized interface for canonical toolchain components like fine-tuning or synthetic data generation to customize Llama models and build agentic applications.Start building exciting applications on Llama!

🏫 About the University

DeepLearning.AI is dedicated to advancing artificial intelligence education and empowering individuals to build careers in AI and machine learning. The platform offers a range of courses, including the renowned "Deep Learning Specialization" and "AI for Everyone," designed to cater to beginners, professionals, and researchers. By collaborating with leading experts and institutions, DeepLearning.AI provides industry-relevant content that bridges the gap between theoretical knowledge and real-world applications. Through its online courses, research initiatives, and community-driven projects, DeepLearning.AI plays a crucial role in shaping the future of AI education and innovation.

💰 Fees

Application Fee

$0 USD

$0 USD

Tuition Fee

$120 USD

$120 USD

per year

✅ Entry Requirements

The minimum age is 18 and the maximum age is 50.

English Fluent is required.

Minimum education level Bachelor's degree

All students from all countries are eligible to apply to this program.

📬 Admissions Process


1

Step 1

Choose programs

2

Step 2

Apply online

3

Step 3

Enroll

📝 Reviews

Write a review

Application Fee

$0 USD

Service Fee

$0 USD

Tuition

$120 USD

Why Apply on Global Admissions?

Similar Programs to Non-degree

Similar Programs to Non-degree

Blog

We use cookies to improve your experience and analyze site usage. Read our Privacy Policy to learn more about our data practices.