Logging, Monitoring and Observability in Google Cloud

This two-day course will help you master fundamental techniques - modelled on site reliability engineering principles - for monitoring, troubleshooting and improving infrastructure and application performance on Google Cloud.

google badge
Book this course
Call our sales team today
2 day course
Partner of the Year
Private
Private
A private training session for your team. Groups can be of any size, at a location of your choice including our training centres.

Course Credits

Select the pre-paid training investment that’s right for you and help your money stretch a little further with our course credits.

As a Google Cloud Partner, Jellyfish provides world-leading Cloud-based Training solutions to help clients succeed. This course covers everything you need to know about monitoring, troubleshooting, and improving infrastructure and application performance.

Using a combination of presentations, demos, hands-on labs, and real-world case studies, you’ll gain experience across all the techniques required to support a live Google Cloud environment.

We’ll start by looking at the Google Cloud Operations suite and look at setting up monitoring and alerting policies. You’ll then look at installing and managing the Ops Agent to collect Compute Engine logs and then delve further into GKE Operations. After this, we’ll move into supporting and monitoring the networking capabilities of Google Cloud including VPC and firewall logs. Finally, you will look at supporting and managing application performance.

Our Logging, Monitoring and Observability in Google Cloud course is available as a private training session that can be delivered via Virtual Classroom or at a location of your choice in South Africa.

Course overview

Who should attend:

This course is suitable for cloud architects, administrators and SysOps personnel, as well as cloud developers and DevOps personnel.

What you'll learn:

By the end of this course, you will be able to:

  • Explain the purpose and capabilities of Google Cloud’s operations suite
  • Implement monitoring for multiple cloud projects
  • Create alerting policies, uptime checks and alerts
  • Install and manage Ops Agent to collect logs for Compute Engine
  • Explain Cloud operations for GKE
  • Analyze VPC Flow Logs and firewall rules logs
  • Analyze and export Cloud Audit Logs instances
  • Profile and identify resource-intensive functions in an application
  • Analyze resource utilization cost for monitoring related components within Google Cloud

Prerequisites

To get the most out of this course, you should have completed the Google Cloud Fundamentals: Core Infrastructure course or have equivalent experience. You should also be familiar with basic scripting or coding, and be proficient with command-line tools and Linux operating system environments.

Course agenda

Module 1: Introduction to Google Cloud Operations Suite
  • Describe the purpose and capabilities of Google Cloud’s operations suite
  • Explain the purpose of the Cloud Monitoring tool
  • Explain the purpose of Cloud Logging and Error Reporting tools
  • Explain the purpose of Application Performance Management tools
Module 2: Monitoring Critical Systems
  • Use Cloud Monitoring to view metrics for multiple cloud projects
  • Explain the different types of dashboards and charts that can be built
  • Create an uptime check
  • Explain the cloud operations architecture
  • Explain and demonstrate the purpose of using Monitoring Query Language (MQL) for monitoring
Module 3: Alerting Policies
  • Explain alerting strategies
  • Explain alerting policies
  • Explain error budget
  • Explain why server-level indicators (SLIs), service-level objectives (SLOs), and service-level agreements (SLAs) are important
  • Identify types of alerts and common uses for each
  • Use Cloud Monitoring to manage services
Module 4: Advanced Logging & Analysis
  • Use Log Explorer features
  • Explain the features and benefits of logs-based metrics
  • Define log sinks (inclusion filters) and exclusion filters
  • Explain how BigQuery can be used to analyze logs
  • Export logs to BigQuery for analysis
  • Use log analytics on Google Cloud
Module 5: Working with Audit Logs
  • Explain Cloud Audit Logs
  • List and explain different audit logs
  • Explain the features and functionalities of the different audit logs
  • List the best practices to implement audit logs
Module 6: Configuring Google Cloud Services for Observability
  • Use the Ops Agent with Compute Engine
  • Enable and use Kubernetes Monitoring
  • Explain the benefits of using Google Cloud Managed Service for Prometheus
  • Explain the usage of PromQL to query Cloud Monitoring metrics
  • Explain the uses of Open Telemetry
  • Explain custom metrics
Module 7: Monitoring Google Cloud Network & Data Access
  • Collect and analyze VPC Flow Logs and firewall rules logs
  • Enable and monitor Packet Mirroring
  • Explain the capabilities of the Network Intelligence Center
Module 8: Investigating Application Performance Issues
  • Explain the features and benefits of Error Reporting, Cloud Trace, and Cloud Profiler
  • Explain the functionalities of the Error Reporting, Cloud Trace, and Cloud Profiler
Module 9: Optimizing the Costs for Operations Suite
  • Analyze resource utilization cost for monitoring related components within Google Cloud
  • Implement best practices for controlling the cost of monitoring within Google Cloud
close
Don't miss out
Keep up to date with news, views and offers from Jellyfish Training.
Your data will be handled in accordance with our Privacy Policy