AnayticsBig DataCloud

Improving query performance in Snowflake and its related costs

In the previous blog, we understood how to Optimally use Snowflake Warehouse and Tables. Now, we continue the series by diving into Snowflake performance tuning, focusing on how to enhance query performance and manage associated costs in Snowflake cloud services. So let’s continue the blog series, where we will now focus on improving the performance […]

Big DataData & AnalyticsIndustry Buzz

Our experience through Data Engineering Summit 2023 in Bengaluru

AI is everywhere, but there’s less attention to the engine running behind it. Data Engineering is what makes the right data available at the right time for all these AI use cases. Happy that someone thought about it and organized this 2-day event DES2023, focused on Data Engineering in Bengaluru, where we got to hear […]

AnayticsBig DataCloud

Optimal use from Snowflake Warehouse and Tables

In the previous blog, we discussed the Best practices to be followed while Data loading into Snowflake from Stages. Continuing the snowflake blog series lets us understand how to use Snowflake Warehouse and Tables optimally. Snowflakes Virtual Warehouses Virtual Warehouses is one of the critical components in Snowflake architecture and deciding the correct configurations for […]

Big Data

No Code Data Ingestion Framework using NiFi

Data Ingestion: Data ingestion is the transportation of data from assorted sources to a storage medium where it can be accessed, used, and analyzed by an organization. The destination is typically a data warehouse, data mart, database, or a document store.Sources can be from  RDBMS like MySql, Oracle, Postgres, File based like FTP,SFTP,Rest api’s,Streaming .The […]

AnayticsBig DataCloud

Best practices and hacks for Data Loading in Snowflake from Stages

Continuing our Snowflake blog series, after learning about setting up a  Snowflake account using System-defined Roles, we will explore the best practices for loading data from a file into Snowflake. Let’s begin. Snowflake supports file-based data ingestion through Internal and External stages.  However, there are various factors to consider when performing data ingestion, including the […]

AnayticsBig DataCloud

Snowflake Account setup using System defined roles

This is the first blog in a series that will focus on Snowflake, where we’ll cover best practices for using Snowflake, explore various Snowflake functionalities, discuss how to maximize the benefits of Snowflake, and address the challenges that come with its implementation or migration. In this blog, we’ll start by discussing setting up a Snowflake […]

Kedhar Natekar
Kedhar Natekar
Read

AWSBig DataCloud

Mirror Maker for Kafka Migration

For one of our Global Advertising Management Platform clients, we did one migration project with zero downtime for components like Platform DB, Ceph, Aerospike, Kafka (Zookeeper +data nodes), MapR (hive, oozie, hue), Druid (Zookeeper +data nodes), Flink (Zookeeper +data nodes), Monitoring (Icinga,collectd, cloudwatch), Logging (logstash & Opensearch) & Other Components ( Nexus, SFTP, Jenkins ). […]

AWSBig DataDevOps

Migration of Hbase Running on EMR

 Introduction Amazon Elastic Map Reduce is a managed platform. We can run big data frameworks like Apache Hadoop and Apache Spark on AWS to process and analyze large volumes of data.  We can process huge amounts of data for analytics purposes and business intelligence workloads with help of this framework. Amazon Elastic Map Reduce also […]

AnayticsB2BBig Data

GA4 Migration – Step up your Analytics Game

What’s up with this Google Analytics 4? Is it worth implementing? How exactly will it enhance our existing processes? How to get started with it?  Announcing GA4, Google had this to say: “To help you get better ROI from your marketing for the long term, we’re creating a new, more intelligent Google Analytics that builds […]