Cloud Architect & Engineering Leader
Hi, I'm Tebogo
Cloud architect and engineering leader building scalable platforms on AWS. Writing about architecture, DevOps, AI, and the journey to senior technical leadership.
Stay in the loop
Get notified when I publish new articles on cloud architecture, DevOps, and technical leadership.
No spam. Unsubscribe anytime.
Recent Articles
Thoughts on cloud, code, and leadership
AI Governance in Practice: FastAPI on EKS with Model Cards, Audit Logging, and Helm
How I built an AI governance platform on AWS EKS — FastAPI inference endpoint with per-request audit logging, model card endpoint, fairness metadata, and Helm-packaged deployment with HPA.
Building a Production MLOps Pipeline on AWS SageMaker for Telecom Churn
How I built an end-to-end MLOps pipeline with SageMaker Pipelines, automated retraining via EventBridge, and drift monitoring using KS tests and CloudWatch — for a telecom churn use case.
Predicting Telecom Customer Churn with scikit-learn, Keras, and Amazon SageMaker
Learn how to build a telecom customer churn predictor using Random Forest, Keras neural networks, and deploy it to a real-time SageMaker endpoint. Full code included.
Featured Projects
Things I've built and shipped
Telecom Churn Predictor
Binary churn classifier built with scikit-learn (Random Forest, Logistic Regression) and Keras (deep neural network), evaluated against a real-world telecoms dataset, and deployed as a SageMaker real-time endpoint. Includes full model comparison pipeline and Jupyter exploratory analysis.
MLOps Pipeline — SageMaker
End-to-end MLOps pipeline on AWS SageMaker: automated preprocessing, training, evaluation, conditional model registration, and Lambda-triggered retraining on drift detection. Built with the SageMaker Pipelines SDK and Terraform infrastructure.
Real-Time IoT Telemetry Pipeline
Real-time IoT telemetry pipeline for telecoms infrastructure monitoring: multi-device simulator publishes sensor readings to Kinesis, a Lambda consumer writes to DynamoDB with partial batch failure handling, and an async alert handler fires SNS notifications and CloudWatch metrics on threshold breaches.