DevOps

Fixing Unbalanced Shards in Elasticsearch: Why One Node Holds All Your Data

When we run Elasticsearch in production, one of the common issues is imbalance in “shards”. There may be one node in the cluster that is out of disk space, while a few nodes with no shards on them. For example, here is a node with all the shards: Node Shards Disk Used Disk % Free […]

DevOps

Rolling Node Replacement: The Safest Way to Upgrade Kubernetes

Introduction What if upgrading your Kubernetes cluster required no downtime at all? Imagine if you could upgrade your Kubernetes cluster and keep everything running smoothly, with zero downtime. Sounds pretty great, right? A lot of teams worry that upgrading will mean their apps go offline, but with solid planning, it’s actually possible to have safe […]

DevOps

How to Centralize AWS Monitoring: A Guide to CloudWatch Cross-Account Metrics

It is painfully inefficient to check metrics across a large collection of AWS accounts (development, staging, uat, production, etc.). This is a major time waster, not just a small irritation. In addition to wasting valuable engineering time, you run a much higher risk of missing an alert that could result in a full-blown outage every […]

DevOps

How to Make Your Java App Monitoring with JMX Exporter & Prometheus

Introduction If you have a Java application running in Kubernetes, sooner or later you will want to know what’s really going on inside the JVM. And, is heap memory close to exhaustion? Is the garbage collection process busy? Are we slowly moving towards an OOM error? Without oversight, you’re essentially flying blind. In this guide, […]

DevOps

DNS Migration Done Right: Lessons from Moving to Route 53

Introduction DNS migrations don’t usually get much attention. They’re invisible when done right and very loud when done wrong. At TO THE NEW, we recently migrated DNS for an ad tech client from NS1 (an IBM Product) to AWS Route 53 as part of their large move to AWS and cost savings. On paper, this […]

DevOps

I Left This AWS Task Half-Done for 2 Weeks – Here’s What It Taught Me

Introduction When you work with AWS infrastructure for some time, you realise that not all problems announce themselves with alerts or outages. Some problems stay quiet, blend into the background, and only reveal themselves later-usually when someone asks a question you can’t answer clearly. This is one such experience from my early days of working […]

Vivek Tiwary
Vivek Tiwary
Read

DevOps

Powering Real-Time Multiplayer Games on AWS: From Chaos to a Managed Backbone

Real-time multiplayer games are unforgiving. Players don’t care for your flashy server hardware and state of the art network infrastructure, they care about quick response matches, fair competition, and smooth gameplay. When that breaks, they don’t blame latency graphs; they blame the game. At TO THE NEW, we’ve seen this pattern repeatedly: studios that treat […]

Saurabh Jain
Saurabh Jain
Read

DevOps

DNS as Code in Action: Lessons from a Client Project with NS1 and Terraform

Introduction DNS is rarely the first thing teams modernise. In most client environments we work with at To The New, CI/CD, cloud infrastructure, and observability mature quickly. DNS, however, often remains manually managed through dashboards, handled by a few people, and changed mostly during incidents. That gap usually goes unnoticed until traffic needs to be […]

DevOps

Why Regular Upgrades Are Non-Negotiable in Modern DevOps

Introduction In DevOps, upgrades are rarely exciting. They don’t ship new features (most of the time). They don’t impress clients. They don’t always get leadership applause. And yet, over the years at To The New, one thing has become very clear to us: DevOps teams that do upgrades regularly move faster, stay safer, and break […]

Services