Logging, Monitoring and Observability in Google Cloud

This three-day instructor-led course covers techniques for monitoring, troubleshooting, and improving infrastructure and application performance on Google Cloud.
google badge
3 day course
Supporting material
Google Cloud Partner of the Year
A private training session for your team. Groups can be of any size, at a location of your choice including our training centres.

This course covers the techniques for monitoring, troubleshooting, and improving infrastructure and application performance in Google Cloud guided by the principles of Site Reliability Engineering (SRE).

Using a combination of presentations, demos, hands-on labs, and real-world case studies, attendees will gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, tracing application performance bottlenecks, and profiling CPU and memory usage.

Jellyfish has recently been named a Google Cloud Specialisation Partner of the Year. This title recognises our commitment to provide world-leading Cloud-based Training solutions that help our clients succeed. Our Logging, Monitoring and Observability in Google Cloud course is available as a private training session, which will run over three consecutive days and can be delivered at a location of your choice.

Course overview
Who should attend:

This course is intended for the following participants: Cloud architects, Administrators and SysOps personnel or Cloud developers and DevOps personnel.

Walk away with the ability to:
  • Plan and implement a well-architected logging and monitoring infrastructure
  • Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
  • Create effective monitoring dashboards and alerts
  • Monitor, troubleshoot, and improve Google Cloud infrastructure
  • Analyse and export Google Cloud audit logs
  • Find production code defects, identify bottlenecks, and improve performance
  • Optimise monitoring costs

Attendees should have basic scripting or coding ability and proficiency with command-line tools and Linux operating system environments. They should also understand the principles of Google Cloud as covered in the one day course Google Cloud Fundamentals: Core Infrastructure or have equivalent experience.

Course agenda
Module 1: Introduction to Google Cloud Monitoring Tools
  • Understand the purpose and capabilities of Google Cloud operations-focused components (Logging, Monitoring, Error Reporting, and Service Monitoring)
  • Understand the purpose and capabilities of Google Cloud application performance management focused components (Debugger, Trace, and Profiler)
Module 2: Avoiding Customer Pain
  • Construct a monitoring base on the four golden signals: latency, traffic, errors, and saturation
  • Measure customer pain with SLIs
  • Define critical performance measures
  • Create and use SLOs and SLAs
  • Achieve developer and operation harmony with error budgets
Module 3: Alerting Policies
  • Develop alerting strategies
  • Define alerting policies
  • Add notification channels
  • Identify types of alerts and common uses for each
  • Construct and alert on resource groups
  • Manage alerting policies programmatically
Module 4: Monitoring Critical Systems
  • Choose best practice monitoring project architectures
  • Differentiate Cloud IAM roles for monitoring
  • Use the default dashboards appropriately
  • Build custom dashboards to show resource consumption and application load
  • Define uptime checks to track aliveness and latency
Module 5: Configuring Google Cloud Services for Observability
  • Integrate logging and monitoring agents into Compute Engine VMs and images
  • Enable and utilize Kubernetes Monitoring
  • Extend and clarify Kubernetes monitoring with Prometheus
  • Expose custom metrics through code, and with the help of OpenCensus
Module 6: Advanced Logging and Analysis
  • Identify and choose among resource tagging approaches
  • Define log sinks (inclusion filters) and exclusion filters
  • Create metrics based on logs
  • Define custom metrics
  • Link application errors to Logging using Error Reporting
  • Export logs to BigQuery
Module 7: Monitoring Network Security and Audit Logs
  • Collect and analyse VPC Flow logs and Firewall Rules logs
  • Enable and monitor Packet Mirroring
  • Explain the capabilities of Network Intelligence Center
  • Use Admin Activity audit logs to track changes to the configuration or metadata of resources
  • Use Data Access audit logs to track accesses or changes to user-provided resource data
  • Use System Event audit logs to track Google Cloud administrative actions
Module 8: Managing Incidents
  • Define incident management roles and communication channels
  • Mitigate incident impact
  • Troubleshoot root causes
  • Resolve incidents
  • Document incidents in a post-mortem process
Module 9: Investigating Application Performance Issues
  • Debug production code to correct code defects
  • Trace latency through layers of service interaction to eliminate performance bottlenecks
  • Profile and identify resource-intensive functions in an application
Module 10: Optimising the Costs of Monitoring
  • Analyse resource utilisation cost for monitoring related components within Google Cloud
  • Implement best practices for controlling the cost of monitoring within Google Cloud
Book this course
Call our sales team today
Don't miss out
Keep up to date with news, views and offers from Jellyfish Training.
Your data will be handled in accordance with our Privacy Policy