AWSBig DataDevOps

What is Amazon Redshift and why you should definitely use it?

So you have spent some odd years of your software development career and now you know many of those RDBMS implementations in and out. In fact, you also already know that RDBMS is not the only enterprise storage and due to frequent scalability issues you encountered, someday you found about Big Data tools. Chances are […]

Ajay Sharma
Ajay Sharma
Read

Big DataTechnology

DataSafe – A Data Archival Tool

#fame is India’s first (and now the biggest) live-streaming app on IOS and Android platforms. This app allows people to create their own beam and go live immediately, or book a slot for future. As time passed, the operational databases of #fame kept on increasing at a great speed. As a result, the disk space […]

Rohan Kalra
Rohan Kalra
Read

Big DataDigital AnalyticsIndustry Buzz

Your Mobile Phone’s Power to Make or Break Your Favorite Brand’s Future

Mobile phone penetration worldwide is at 62.9%, which is more than half of the world’s population (Statista, n.d.). With the rise of technological advancements, we cannot deny the fact that our lives have been transformed from a television-driven generation to multiple medium owners such as the desktop, laptop, tablet, and the phone. In the 2016 […]

Big DataIndustry BuzzJava/JVM

Our experience through GIDS16 in Bengaluru

I know it’s too late, almost 2 weeks, since the GIDS16 (Great Indian Developer Summit 2016) ended – But the thought that “it’s better to be late than never” is motivating me to write down this post. It was a 5 days event – initial 4 days in Bengaluru and the last day in Pune. […]

Big DataManual TestingTechnology

Scope of Testing in Big Data and Hadoop

We have entered into a digital world where data is scaling every single second. Every digital process and social media platform produce it. Systems, sensors, and mobile devices convey it. The amount of data that’s being created and stored globally is almost inconceivable, and it just keeps on multiplying and multiplying. Is all the data […]

Big Data

Cassandra Authentication and Create User

Introduction : The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Not having to distinguish between a Master and a Slave node allows you to add any number of machines to any cluster in any datacenter, without having to worry about what type of machine you need […]

Rohit Verma
Rohit Verma
Read

Big DataTechnology

Yarn Cluster Optimization for Spark Jobs

Overview This blog is related to the Yarn Cluster Optimizations for executing the spark jobs on yarn cluster. In this blog post I will be discussing about the YARN Optimizations for the efficient utilization of available resources to execute the spark jobs on yarn cluster. These optimization configurations could be done either in the config […]

Rohit Verma
Rohit Verma
Read

Big DataJava/JVM

ZooKeeper Leader-Election simplified

Background Recently (in a project) we were required to determine the master node from a pool of similar type of nodes. And if master node fails, any other node should take on the responsibility – so that the service remains available. So, the use-case was something like – Only single node should behave as a […]

Salil Kumar
Salil Kumar
Read

Big DataJava/JVM

Curator Framework for Apache ZooKeeper

Objective My purpose to write this blog post is just to briefly introduce you to Apache Curator framework, esp. in case you have been using Apache ZooKeeper Java API but never tried Curator framework. In such case, I strongly recommend you to try Curator. Difficulties while using ZooKeeper API Handling the client/server connections. Writing so […]

Salil Kumar
Salil Kumar
Read