What You Need to Know Before
You Start

Starts 23 June 2025 02:50

Ends 23 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

Codifying K8s Knowledge: How We Built the Ultimate SRE Companion with Bedrock

DevOpsDays Tel Aviv via YouTube

DevOpsDays Tel Aviv

2753 Courses


30 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Free Video

Optional upgrade avallable

Overview

Syllabus

  • Introduction to Kubernetes and SRE
  • Overview of Kubernetes architecture
    Role of Site Reliability Engineering (SRE) in managing Kubernetes
    Introduction to common Kubernetes issues and troubleshooting
  • Understanding Tribal Knowledge in SRE
  • Definition and examples of tribal knowledge
    Challenges of relying on undocumented expertise
    Importance of codifying knowledge
  • Introduction to Amazon Bedrock
  • Overview of Amazon Bedrock and its capabilities
    Benefits of using Amazon Bedrock for AI solutions
    Integration of Amazon Bedrock with Kubernetes environments
  • Building an AI-powered SRE Companion
  • Key objectives of the AI companion
    Designing the architecture for AI integration
    Utilization of AI to transform tribal knowledge
  • Data Collection and Analysis
  • Methods for gathering SRE and Kubernetes data
    Analyzing data for patterns and insights
    Turning data insights into actionable knowledge
  • Developing AI Models for Troubleshooting
  • Creating and training AI models with Bedrock
    Tailoring models to Kubernetes-specific issues
    Testing model accuracy and performance
  • Implementing the SRE Companion for Incident Response
  • Integrating the AI companion into existing workflows
    Enhancing incident response with actionable insights
    Case studies of improved incident response times
  • Monitoring and Continuous Improvement
  • Setting up monitoring for AI model performance
    Strategies for continuous learning and model updates
    Gathering feedback and iterating on the AI companion
  • Best Practices and Lessons Learned
  • Best practices for deploying AI in SRE contexts
    Challenges faced and solutions implemented
    Lessons learned for future AI projects in Kubernetes environments
  • Conclusion and Future Directions
  • Summary of key learnings
    Future possibilities for AI in Kubernetes and SRE
    Closing thoughts on transforming SRE with AI technology

Subjects

Computer Science