Skip to main content

SRE (Site Reliability Engineering)

๊ฐœ๋…โ€‹

  • SW ๋„๊ตฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์‹œ์Šคํ…œ ๊ด€๋ฆฌ / ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜ ๋ชจ๋‹ˆํ„ฐ๋ง ๋“ฑ์˜ IT ์ธํ”„๋ผ ์ž‘์—…์˜ ์ž๋™ํ™”ํ•˜๋Š” ๊ณผ์ •
  • ๊ฐœ๋ฐœ ํŒ€์˜ ๋นˆ๋ฒˆํ•œ ์ฃผ๊ธฐ์˜ ์—…๋ฐ์ดํŠธ ์†์—์„œ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์˜ ์•ˆ์ •์„ฑ์„ ์œ ์ง€ํ•œ๋‹ค.
  • ์ง€์† ๊ฐ€๋Šฅํ•œ ๋Œ€๊ทœ๋ชจ ์‹œ์Šคํ…œ ๊ด€๋ฆฌ๊ฐ€ ์ˆ˜๋ฐฑ ๋Œ€์˜ ์„œ๋ฒ„ ์ˆ˜๋™ ๊ด€๋ฆฌ ๋ณด๋‹ค ์•ˆ์ •์ ์ด๊ธฐ ๋•Œ๋ฌธ์— ํ™•์žฅ ๊ฐ€๋Šฅํ•œ ์‹œ์Šคํ…œ์˜ ์‹ ๋ขฐ์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.

SRE์˜ ์ค‘์š”์„ฑโ€‹

  • ์‚ฌ์ดํŠธ ์‹ ๋ขฐ์„ฑ์€ ์ตœ์ข… ์‚ฌ์šฉ์ž์—๊ฒŒ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์ด ์ œ๊ณตํ•˜๋Š” ์„œ๋น„์Šค์˜ ์‹ ๋ขฐ์„ฑ๊ณผ ํ’ˆ์งˆ์„ ์˜๋ฏธํ•œ๋‹ค.
  • ๊ฐœ๋ฐœ, ์šด์˜ ํŒ€๊ฐ„ ํ˜‘์—…์ด ๊ฐ•ํ™”๋œ๋‹ค.
  • ๊ณ ๊ฐ์˜ ๊ฒฝํ—˜(UX)์— ์˜ํ–ฅ์ด ๋ฏธ์น˜์ง€ ์•Š๊ฒŒ๋œ๋‹ค.

SRE ํ•ต์‹ฌ ์›์น™โ€‹

  1. ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜ ๋ชจ๋‹ˆํ„ฐ๋ง
  2. ์ ์ง„์  ๋ณ€๊ฒฝ ๊ตฌํ˜„
  3. ์‹ ๋ขฐ์„ฑ ํ–ฅ์ƒ์„ ์œ„ํ•œ ์ž๋™ํ™”

๊ด€์ธก์„ฑ(Observability)โ€‹

  • ์†Œํ”„ํŠธ์›จ์–ด๊ฐ€ ์ตœ์ข… ์‚ฌ์šฉ์ž๋ฅผ ์œ„ํ•ด ๋ฐฐํฌ๋  ๋•Œ, ํŒ€์ด ๋ถˆํ™•์‹ค์„ฑ์— ๋Œ€๋น„๊ฐ€๋Šฅํ•˜๋„๋ก ์ค€๋น„ํ•˜๋Š” ํ”„๋กœ์„ธ์Šค
  1. ์ง€ํ‘œ(Metrics)
  2. ๋กœ๊ทธ
  3. ํŠธ๋ ˆ์ด์Šค

๋ชจ๋‹ˆํ„ฐ๋ง(Monitoring)โ€‹

  • ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์—์„œ ๋ฏธ๋ฆฌ ์ •์˜๋œ ์ง€ํ‘œ๋ฅผ ๊ด€์ธกํ•˜๋Š” ํ”„๋กœ์„ธ์Šค