What is snowflake
Snowflake has emerged as the only cloud data warehouse solution powering the data cloud, allowing businesses to build best-in-class analytical environments with unparalleled power, elasticity, and customization – all at a fraction of the cost. Learning Snowflake is simple and can bring exceptional opportunities for career advancement. This course will teach you all you need to know about Snowflake as well as how to get started with this data platform.
Snowflake course content
Snowflake course content is designed to help you acquire hands-on skills needed for Snowflake Data Analyst and Data Engineer Job Roles and also includes the topics required to clear the Snowflake Certification exams.
Snowflake Architecture and Overview
- Snowflake Overview
- Architecture
- How to use the Snowflake UI & ecosystem
Introduction to Cloud
- What is Cloud
- Different Cloud Vendors
- Advantages of Cloud over On-Premise
Introduction to Data Warehousing Concepts
- What is a Data Warehouse, and Why do we need a Data Warehouse?
- Database Vs Data Warehouse.
- Data Warehouse Architecture
- OLTP Vs OLAP
- What is ETL
Introduction to Snowflake
- How different from traditional DB
- Quick start to the snowflake and accessing trial account
- Creating warehouse, DB, Schema, and tables
- Accessing different roles and using it
- Working with worksheets
- Understanding different type of accounts
Understand Cloud platforms
- AWS and understanding S3 storage
- Snowflake architecture and caching
- AZURE and understanding blob storage
- GCP and understanding Bucket storage
Data Loading and Unloading
- File formats
- Internal and external storage
- Internal and external stage
- Copy into usage
- Snowflake internal storage
- Accessing Cloud storage data into Snowflake (GCP, AZURE and AWS)
- Data unloading
SnowPipe
- Accessing Snowpipe
- PUT and GET commands
- Bulk loading from cloud storage
- Continuous loading
Partner connect with snowflake
- Snowflake Connector and use cases Python
- BI connectors use cases
- Other connectors hands-on
Handling JSON and Semi structured data
- Variant Data Type
- File format options
- Creating stages
- Loading JSON semi-structured data into SF tables
- Accessing JSON with select statement
Scheduling with Snowflake
- Creating Tasks
- Streams
- Accessing procedures with tasks
- Scheduling as per time with Different time zones
- Automate loading process Daily and Weekly
Secure Data sharing
- Usage of sharing data
- Sharing data with different accounts
- Sharing data with non-SF accounts using reader accounts
- Importance of reader accounts
- Privileges in data sharing
- Challenges with cross-region sharing and understanding replication
- Connecting shared objects with BI tools
- Limitations with Data sharing
Cloning
- Limitations with Data sharing
- Access Control Privileges for Cloned Objects
- Cloning and Snowflake Objects
- Impact of DDL on Cloning
- Impact of DML and Data Retention on Cloning
Time travel & Failsafe
- Introduction to Time Travel
- Querying Historical Data
- Enabling and Disabling Time Travel
- Data Retention Period
- Cloning Using Time Travel (Databases, Schemas, and Tables Only)
Query Caching, Clustering, and Performance techniques
- Creating multi-users on large tables
- Performance techniques
- Result set cache
- Metadata cache
- Query data cache
- Best practices of using caching for performance and cost optimization
Advanced Topics
- Error Handling and Validations
- Snowflake Pricing model and selecting best Edition and Calculation of Credits usage
- Resource Monitoring
- Data Masking
- Partitioning and Clustering in snowflake
- Materialized View and Normal View
- Integration with Python
- Integration with AWS, Azure and Google
- Cloud Best Practices to follow
Instructor
