Skip to content
#

site-reliability-engineering

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Here are 19 public repositories matching this topic...

litmus

🏥 A production-ready CLI tool for monitoring HTTP endpoints and automatically reporting success/failure to healthchecks.io. Single binary & cross-platform.

  • Updated Sep 5, 2025
  • Go
Followers
135 followers
Website
github.com/topics/sre
Wikipedia
Wikipedia