So you have spent some odd years of your software development career and now you know many of those RDBMS implementations in and out. In fact, you also already know that RDBMS is not the only enterprise storage and due to frequent scalability issues you encountered, someday you found about Big Data tools. Chances are […]
#fame is India’s first (and now the biggest) live-streaming app on IOS and Android platforms. This app allows people to create their own beam and go live immediately, or book a slot for future. As time passed, the operational databases of #fame kept on increasing at a great speed. As a result, the disk space […]
Big DataDigital AnalyticsIndustry Buzz
Mobile phone penetration worldwide is at 62.9%, which is more than half of the world’s population (Statista, n.d.). With the rise of technological advancements, we cannot deny the fact that our lives have been transformed from a television-driven generation to multiple medium owners such as the desktop, laptop, tablet, and the phone. In the 2016 […]
I know it’s too late, almost 2 weeks, since the GIDS16 (Great Indian Developer Summit 2016) ended – But the thought that “it’s better to be late than never” is motivating me to write down this post. It was a 5 days event – initial 4 days in Bengaluru and the last day in Pune. […]
Big DataManual TestingTechnology
We have entered into a digital world where data is scaling every single second. Every digital process and social media platform produce it. Systems, sensors, and mobile devices convey it. The amount of data that’s being created and stored globally is almost inconceivable, and it just keeps on multiplying and multiplying. Is all the data […]
Introduction : The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Not having to distinguish between a Master and a Slave node allows you to add any number of machines to any cluster in any datacenter, without having to worry about what type of machine you need […]
Overview This blog is related to the Yarn Cluster Optimizations for executing the spark jobs on yarn cluster. In this blog post I will be discussing about the YARN Optimizations for the efficient utilization of available resources to execute the spark jobs on yarn cluster. These optimization configurations could be done either in the config […]
Background Recently (in a project) we were required to determine the master node from a pool of similar type of nodes. And if master node fails, any other node should take on the responsibility – so that the service remains available. So, the use-case was something like – Only single node should behave as a […]
Objective My purpose to write this blog post is just to briefly introduce you to Apache Curator framework, esp. in case you have been using Apache ZooKeeper Java API but never tried Curator framework. In such case, I strongly recommend you to try Curator. Difficulties while using ZooKeeper API Handling the client/server connections. Writing so […]