A business’s ability to leverage digital technology and innovate in a rapidly changing business environment is crucial for success. Digitally mature companies are growing at a rapid pace. Leveraging cutting-edge digital technologies, they are able to transform business models and operational processes. Legacy companies have the fear to...
The digital data is growing at an astounding rate, outpacing our ability to analyze it. With the proliferation of smartphones, network connectivity, smart devices, and IoT, the variety and volume of both structured and unstructured data have grown manifold, turning big data into bigger and bigger data. A study by IDC echoes that “By...
Video is the future of content marketing and reaches a large mass of customers within no time. Digital media environment is evolving rapidly and the increasing usage of smartphones and tablets has changed the overall consumption habits of audiences across the globe. Not just the social networking sites are swamped with videos; even the...
Offline, in-store, mobile, website and many more; there are multiple touchpoints for customers to interact with a brand. Customers expect a positive experience across these channels. Brands are striving hard to provide a superior multi-channel customer experience. However, analyzing customer’s mindset and improving experience requires...
You would have come across the terms “Machine learning” (or Data mining), or cognitive intelligence, “big data” or “data science” and wondered what they actually are. You might have used some tool or package in your job for data analytics, data modeling etc. but you don’t really know the underlying principles for it. Machine...
Hashing is one of the main concepts that we are introduced to as we start off as a basic programmer. Be it 'data structures' or simple ‘object’ notion - hashing has a role to play everywhere. But when it comes to Big Data - like every thing else, the hashing mechanism is also exposed to some challenges which we generally don’t...
In this post, I will present a technical “deep-dive” into Spark internals, including RDD and Shared Variables. If you want to know more about Spark and Spark setup in a single node, please refer previous post of Spark series, including Spark 1O1 and Spark 1O2. Resilient Distributed Datasets (RDD) - An RDD in is primary abstraction...
Big Data in itself brings many challenges; as is the case with anything related to data. Predictive Analysis is one part which takes up much effort and attention as well. One of the foremost challenge which one comes across is how to get started with the "subject". I would first like to highlight the basic things which one must know, to...
This is the second blog of the Spark series. This blog post include setup of Spark environment followed by a small word count program. The idea behind the blog is to get hands on in Spark setup and running simple program on Spark. If you want to know more about Spark history and it's comparison with Hadoop, please refer Spark 1o1. ...
We at IntelliGrape divide Big Data into four major sectors - as we commonly refer as 4C's of Big Data. These 4C's are:- Capture (Data Ingestion) Contain (Data Persistence (NoSQL) Compute (Data Processing) Comprehend (Data Analytics and Visualization) Within this blog, I'll be focusing on the last pointer i.e....
Overview: The big data space has been evolving continuously and each day more technologies are added in ecosystem. Hadoop Hive is one of the technologies that has been around along. It's give a SQL wrapper to execute Hadoop as a query language. Inherently, It's having some of the optimizations techniques. Through this blog, I thought...
A Brief History of Hadoop: Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open source web search engine, itself a part of the Lucene project. The Origin Of The Name “hadoop”. Hadoop is not an acronym; it’s a made-up name. The...