Blog posts around Big Data | TO THE NEW Blog

What is Amazon Redshift and why you should definitely use it?

So you have spent some odd years of your software development career and now you know many of those RDBMS implementations in and out. In fact, you also already know that RDBMS is not the only enterprise storage and due to frequent scalability issues you encountered, someday you found about Big Data tools. Chances are […]

Ajay Sharma September 26, 2016

Read

Big Data Technology

DataSafe – A Data Archival Tool

#fame is India’s first (and now the biggest) live-streaming app on IOS and Android platforms. This app allows people to create their own beam and go live immediately, or book a slot for future. As time passed, the operational databases of #fame kept on increasing at a great speed. As a result, the disk space […]

Rohan Kalra August 16, 2016

Read

Big Data Digital Analytics Industry Buzz

Your Mobile Phone’s Power to Make or Break Your Favorite Brand’s Future

Mobile phone penetration worldwide is at 62.9%, which is more than half of the world’s population (Statista, n.d.). With the rise of technological advancements, we cannot deny the fact that our lives have been transformed from a television-driven generation to multiple medium owners such as the desktop, laptop, tablet, and the phone. In the 2016 […]

Regine Remolano Villanueva August 1, 2016

Read

Big Data Industry Buzz Java/JVM

Our experience through GIDS16 in Bengaluru

I know it’s too late, almost 2 weeks, since the GIDS16 (Great Indian Developer Summit 2016) ended – But the thought that “it’s better to be late than never” is motivating me to write down this post. It was a 5 days event – initial 4 days in Bengaluru and the last day in Pune. […]

Salil Kumar May 18, 2016

Read

Big Data Manual Testing Technology

Scope of Testing in Big Data and Hadoop

We have entered into a digital world where data is scaling every single second. Every digital process and social media platform produce it. Systems, sensors, and mobile devices convey it. The amount of data that’s being created and stored globally is almost inconceivable, and it just keeps on multiplying and multiplying. Is all the data […]

Rachit Sethi April 28, 2016

Read

Big Data

Cassandra Authentication and Create User

Introduction : The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Not having to distinguish between a Master and a Slave node allows you to add any number of machines to any cluster in any datacenter, without having to worry about what type of machine you need […]

Rohit Verma January 16, 2016

Read

Big Data Technology

Yarn Cluster Optimization for Spark Jobs

Overview This blog is related to the Yarn Cluster Optimizations for executing the spark jobs on yarn cluster. In this blog post I will be discussing about the YARN Optimizations for the efficient utilization of available resources to execute the spark jobs on yarn cluster. These optimization configurations could be done either in the config […]

Rohit Verma November 10, 2015

Read

Big Data Java/JVM

ZooKeeper Leader-Election simplified

Background Recently (in a project) we were required to determine the master node from a pool of similar type of nodes. And if master node fails, any other node should take on the responsibility – so that the service remains available. So, the use-case was something like – Only single node should behave as a […]

Salil Kumar October 28, 2015

Read

Big Data Java/JVM

Curator Framework for Apache ZooKeeper

Objective My purpose to write this blog post is just to briefly introduce you to Apache Curator framework, esp. in case you have been using Apache ZooKeeper Java API but never tried Curator framework. In such case, I strongly recommend you to try Curator. Difficulties while using ZooKeeper API Handling the client/server connections. Writing so […]

Salil Kumar October 27, 2015

Read

Tips for writing a blog

Learn how to write a caption