MSP

Patching Azure Virtual Machines from AWS Systems Manager using Hybrid Activation

Patching Azure VMs from AWS Systems Manager using Hybrid Activation Each cloud platform provides its own native tools, which can lead to fragmented processes and increased administrative overhead. To address this challenge, AWS Systems Manager (SSM) offers a powerful solution through its Hybrid Activation feature. This capability allows non-AWS machines, such as Azure Virtual Machines […]

April 1, 2026

MSP

Designing Scalable AI Systems: RAG vs Long Context Trade-offs Explained

Introduction Every LLM you’ve ever used is stuck in the past. It knows everything up to its training cutoff and nothing after. It also does not know about your internal documents,your private codebase,or what just happened 5 mis ago. So if you want a model to actually be useful in a real time production environment,you […]

March 26, 2026

MSP

Cross-Account Centralised Logging in AWS Using S3, KMS, and SQS for SIEM Integration

Introduction In a multi-account AWS environment, log management for services such as Cloud Trail, VPC Flow Logs, and WAF is a complex and fragmented process. This is due to the fact that each account has its own log data, which is not easy for security and operations teams to manage centrally. This issue, however, can […]

March 26, 2026

MSP

Incident Management in Cloud MSP: From Alert to Resolution (A Real World Approach)

1. Introduction In a Cloud Managed Services Provider (MSP) ecosystem, incident management is a critical function that directly impacts service availability, SLA adherence, and customer experience. With modern cloud architectures (AWS, hybrid, microservices), incidents are no longer isolated—they are multi-layered and interdependent. This demands a structured, fast, and practical approach to incident handling. This paper […]

March 25, 2026

MSP

Storing SNS Messages in S3 using Kinesis Data Firehose – step by step implementation with Real-World Use Cases

Introduction We know that many applications generate large amounts of event data such as alerts, application events, logs, and notifications. This data is usually unstructured and arrives in a continuous manner. The initial step in creating a data engineering pipeline is to store this event data into a reliable and long term storage system in […]

March 20, 2026

MSP

CloudWatch vs CloudTrail vs AWS Config – When to Use What?

In AWS environments, visibility is critical. When applications run across multiple services, engineers need tools that help them monitor performance, track user activity, and maintain configuration compliance. Three AWS services commonly used for this purpose are Amazon CloudWatch, AWS CloudTrail, and AWS Config. Although these services are related to monitoring and auditing, they solve different […]

March 20, 2026

MSP

How VPC Components Work Together (Subnet, IGW, NAT, Route Tables)

In AWS networking, it is common to configure all required components—subnets, gateways, and route tables—yet still encounter connectivity issues. In most cases, the problem is not with individual components, but with a lack of understanding of how these components interact with each other. This article explains how key VPC components—subnets, route tables, Internet Gateway, and […]

March 20, 2026

MSP

Running GitLab CI at Scale: Setting Up Kubernetes-Based Runners

Introduction As the project grows and traditional CI runners running on static virtual machines or shared runners often struggle to keep up with increasing workloads. Jobs may queue up, builds start taking longer than expected and developers are left waiting. what’s the solution? So, instead of using traditional executioners we can leverage the GitLab Runner […]

March 11, 2026

MSP

Chaos Engineering: Simulating Network Latency using AWS FIS

Introduction Modern applications have distributed systems consisting of multiple services, containers, and infrastructure components. While it improves scalability, security and reliability, it also increases the chances of unexpected failures and downtime. Application testing methods majorly focus on application functionality, but they rarely test how systems behave in real-world failures such as instance crashes, network latency, […]

March 10, 2026