info@sparkhadoop.com +91-8056293569
A brief revision of my vision
blog

Is MapReduce coming to an end? DataBricks recently published their benchmarks results for sorting 100 TB of Data over AWS Ec2 Machines, The results have clearly proven it as general purpose distributed processing framework which is meant for both in-memory and on-disk. They have used recent version of Apache Spark(Spark 1.1).

blog
Sharding in MongoDB
11th January 2015

MongoDB is one of several database types to arise in mid 2000’s under the banner of NoSQL. Rather than using the rows and columns as in the relational databases, MongoDB is built over architecture of collections and documents. Documents contain sets of Key-Value pairs and are the basic unit of data in mongoDB. Collections contain […]