Introduction When we started with Amazon ECS on AWS Fargate, it felt simple. No EC2 to manage. No AMIs. No cluster scaling headaches. Then the number of services grew. Working for the ad-tech client from last 5 years and running their workload on ECS Fargate has taught us many things. Different traffic patterns. Different...
When we run Elasticsearch in production, one of the common issues is imbalance in "shards". There may be one node in the cluster that is out of disk space, while a few nodes with no shards on them. For example, here is a node with all the shards: Node Shards Disk Used Disk % Free Space PESD222 957 329.1 GB 32% 694.2...
Introduction What if upgrading your Kubernetes cluster required no downtime at all? Imagine if you could upgrade your Kubernetes cluster and keep everything running smoothly, with zero downtime. Sounds pretty great, right? A lot of teams worry that upgrading will mean their apps go offline, but with solid planning, it's actually...
It is painfully inefficient to check metrics across a large collection of AWS accounts (development, staging, uat, production, etc.). This is a major time waster, not just a small irritation. In addition to wasting valuable engineering time, you run a much higher risk of missing an alert that could result in a full-blown outage every time...
Introduction If you have a Java application running in Kubernetes, sooner or later you will want to know what’s really going on inside the JVM. And, is heap memory close to exhaustion? Is the garbage collection process busy? Are we slowly moving towards an OOM error? Without oversight, you’re essentially flying blind. In this guide,...
Introduction Managing MongoDB users through command-line can be complex and time-consuming. Our MongoDB User Management Console provides a modern web-based GUI that makes database user administration simple and accessible for teams. What It Does This Node.js web application offers a complete solution for MongoDB user management...
Introduction When you work with AWS infrastructure for some time, you realise that not all problems announce themselves with alerts or outages. Some problems stay quiet, blend into the background, and only reveal themselves later-usually when someone asks a question you can’t answer clearly. This is one such experience from my early...
In this BLOG I will share an incident that taught me how containers really work under the hood. Production Down - Once I received production website down alert for one of my customer. As I checked the website was giving 502 [caption id="attachment_77705" align="alignnone" width="846"] website-down[/caption] Initial Checks - I...
Introduction DNS is rarely the first thing teams modernise. In most client environments we work with at To The New, CI/CD, cloud infrastructure, and observability mature quickly. DNS, however, often remains manually managed through dashboards, handled by a few people, and changed mostly during incidents. That gap usually goes unnoticed...
Introduction In DevOps, upgrades are rarely exciting. They don’t ship new features (most of the time). They don’t impress clients. They don’t always get leadership applause. And yet, over the years at To The New, one thing has become very clear to us: DevOps teams that do upgrades regularly move faster, stay safer,...
Introduction When teams start on their DevOps journey, the excitement is real. CI/CD pipelines, faster deployments, cloud-native tools, automation everywhere - it feels like everything is finally going to be smooth. But in reality, the first year of DevOps is rarely smooth. It’s messy, experimental, and full of learning. [caption...
Introduction Reducing cloud costs is always the top priority and biggest headache for Devops Engineers, especially when using managed AWS services like ECS Fargate. For one of our Ad-Tech clients at TO THE NEW, we were already utilising Fargate Spot to reduce the ECS bill significantly. But we found that we could save even more money if...