Master On-Call Engineering
in a Safe Environment

Train on real infrastructure scenarios. Build confidence. Reduce onboarding time from months to days.

Get early access and lifelong perks as an early adopter.

Kafka MirrorMaker Lag Alert
⚠ INCIDENT ALERT - Production Down
You are working as a data engineer for an e-commerce platform using Kafka to handle
real-time data across multiple regions. An alert about growing MirrorMaker lag requires
your immediate attention.
user@oncalm:~$ kafka-topics --describe --topic order-events --bootstrap-server primary:9092
Error: Topic 'order-events' not found.
user@oncalm:~$ _
Time Elapsed: 02:47 ⏱ Your target: 15:00

The On-Call Reality

Production incidents don't wait for you to be ready

Shadowing takes weeks or months

Senior engineers spend valuable time walking juniors through scenarios they might never see

First production incident = Learning on the job

Your first real incident shouldn't be when thousands of users are affected

Mastery takes years of stress

On-call expertise comes from experience which means years of high-pressure incidents

Downtime costs thousands per minute

Every mistake in production means lost revenue and damaged customer trust

How OnCalm Works

Practice real incidents, zero risk

Real Infrastructure

Work with actual Kubernetes clusters, PostgreSQL databases, Kafka message queues and other technologies deployed on real servers.

Work Under Pressure

Timed challenges simulate the stress of production incidents. Build muscle memory for high-pressure decision making.

Track Your Progress

Get scored on approach, speed, and solution quality. Receive insights into your actions and learn from every challenge.

New Challenges Weekly

Fresh challenges every week covering diverse technologies and failure modes. Always something new to master.

Learn by Troubleshooting

Master technologies from the on-call perspective. Build intuition for what breaks, why, and how to fix it fast.

Interactive Terminal

Embedded browser terminal gives you direct access to infrastructure without cluttering your environment.

Perfect For

On-Call Preparation

Get ready for your first on-call rotation with confidence

Junior/Mid SREs

Accelerate your learning curve and build practical skills

Career Transitions

Software engineers moving into on-call or DevOps roles

Engineering Teams

Reduce shadowing overhead and standardize training

Continuous Learning

Keep skills sharp with regular practice challenges

Tech Skill Mastery

Deep dive into Kubernetes, databases, messaging, and more

Real Scenarios You'll Face

Practice challenges based on actual production incidents

Website Down

Investigate Kubernetes deployment failures causing complete service outage. Users can't access the platform.

Kubernetes Networking Medium

Payment Processing Stopped

Debug database connection pool exhaustion preventing payment transactions. Revenue is bleeding.

PostgreSQL Database Hard

Kafka MirrorMaker Lag

Fix growing cross-region data replication lag. Secondary region is falling behind primary by hours.

Kafka Streaming Medium

Disk Space Critical

Server disk is full causing service crashes. Identify what's consuming space and recover quickly.

Linux Infrastructure Easy

Ready to Level Up Your On-Call Skills?

Join the closed beta. Get early access to 25+ infrastructure challenges.

No credit card required. Early access guaranteed.