Machine Learning with Snowflake: Snowpark & Cortex
- Created By shambhvi
- Last Updated February 18th, 2025
- Overview
- Prerequisites
- Audience
- Curriculum
Description:
The rapid evolution of artificial intelligence (AI) and machine learning (ML) is transforming industries worldwide. From personalized customer experiences to real-time fraud detection, organizations are leveraging ML to drive innovation and gain a competitive edge. Snowflake, with its powerful data platform, has emerged as a leader in enabling end-to-end ML pipelines, empowering data professionals to harness the full potential of their data.
This course equips students with the skills to navigate this transformative landscape, combining the scalability of Snowflake with advanced ML capabilities through Snowpark and Cortex. Over three days, students will learn how to build ML workflows that are seamless, scalable, and optimized for modern data engineering and analytics. With Snowflake's recent advancements in Large Language Model (LLM) integration and in-database ML, participants will gain firsthand experience with cutting-edge technologies shaping the future of AI.
Whether you're a data scientist, ML engineer, or analytics professional, this course will help you stay ahead of industry trends and enable you to deploy powerful, real-time ML models in Snowflake's unified platform.
Duration: 3 Day
Course Code: BDT396
Learning Objectives:
After this course, students will be able to:
- Understand Snowflake’s architecture for machine learning pipelines and integration with external datasets
- Develop ML pipelines using Snowpark for scalable data preparation and modeling
- Utilize Cortex for advanced ML pipeline development, including model training and deployment
- Implement and execute ML functions and LLM functions in Cortex for real-world use cases
- Optimize end-to-end ML workflows using Snowflake’s capabilities
- Familiarity with programming language – especially Python
- Basics of using Snowflake and SQL
- Prior knowledge of Machine Learning will be useful but not required
- This course is designed for Software Developers, Data Scientists, Software Architects, Quality Assurance Engineers, Data Analysts to build and implement generative AI models. Familiarity with machine learning concepts (like neural networks) is helpful but not required.
Course Outline:
- Snowflake Overview
- Quick review of Snowflake Architecture
- Understanding data ingestion from external sources
- Demos and Labs
- Machine Learning Overview
- Understand the concept of machine learning
- Learn about data wrangling & preparing data for machine learning
- Understanding machine learning techniques: Classification & Regression
- Learn about metrics for validating these techniques
- Working with hyper parameters, cross validation
- Build machine learning pipelines
- Multiple Demos and Labs
- Machine Learning pipelines with Snowpark
- What are Snowpark components?
- Snowflake Python connector vs Snowpark – what is the difference?
- Working with UDF, Vectorized UDF, Functions, Procedures
- Pandas vs Snowpark Dataframes
- ML Pipelines with Snowpark
- Multiple Demos and Labs
- Machine Learning pipelines with Snowpark ML (Cortex)
- Introduction to Snowpark ML APIs
- Data Collections with Filesystem and FileSet
- Distributed pipelines with Snowpark ML
- Hyper parameter tuning with Snowpark ML
- Model predictions with registered models
- Multiple Demos and Labs
- Machine Learning Functions with Cortex
- What are machine learning functions in cortex
- Performing Time-series forecasting, anomaly detection, classification with ML Functions
- Using Snowflake SQL classes and instances
- Building Classification and Regression Models with Spark machine learning library
- Understanding the costs involved in using Machine Learning Functions
- Multiple Demos and Labs
- LLM (Large Language Model) Functions with Cortex
- Understand support for various LLM models on Snowflake
- Integrate with ChatGPT models
- Using LLM functions: COMPLETE, SENTIMENT, SUMMARIZE, TRANSLATE, etc
- Understanding the costs involved in using these LLMs
- Multiple Demos and Labs
Training material provided: Yes (Digital format)
Hands-on Lab: Students will create a trial Snowflake account for the hands-on labs. If required virtual machines will be provided