RS

System Design Blueprint.

Interactive breakdowns of structural design decisions, cloud scaling models, cost efficiency boundaries, and deployment configurations.

Event-Driven Global Uptime & SSL Monitoring Engine
Trigger Scheduler
AWS ECS Scheduled Task
Cron checks at 30s intervals
↓ Distributes Checks
Queue Broker
Redis Queue (BullMQ)
Managed task queues
↓ Polls Jobs (Auto-scales)
Fargate Node A
Async DNS Override
dns.resolve4 resolver
Fargate Node B
TLS socket auditor
SNI certificate inspection
↓ Persists Logs
Telemetry Storage
PostgreSQL (TimescaleDB)
Optimized time-series metrics

Scaling Decisions

  • Eliminated libuv thread lock contention on concurrent uptime queries by overriding standard DNS resolution with async UDP-based dns.resolve4.
  • Scaled task polling clusters dynamically by deploying Fargate instances integrated with BullMQ queue size monitors.
  • Engineered non-blocking raw socket connection queries for certificate and response monitoring to reduce thread footprint.

Cost Optimization

  • Enabled Fargate spot pricing models, cutting active worker node costs by 70% in staging and dev environments.
  • Integrated localized DNS and connection caching logic, reducing API requests and external lookups.
  • Configured automatic TimescaleDB compression policies for raw records older than 7 days, maximizing storage capacity.

Monitoring Stack

  • Tracks endpoint status (uptime, TLS configurations, SSL expiry warnings, response time).
  • Applies a moving Z-score latency checker across historical check distributions to suppress noise and filter out network jitter.
  • Integrates multi-channel notifications (Email, SMS, Slack, Discord webhook hooks) to warn developers of outages.

CI/CD Pipeline

  • GitHub Workflows build multi-platform Docker container packages, pushing to AWS ECR.
  • Triggers rolling blue-green container service rollouts on ECS via task definition updates.