info@sparkhadoop.com +91-8056293569
A brief revision of my vision

Big Data and Hadoop course has been designed by a team of highly experienced industry professionals to provide in-depth knowledge and skills to the learner in order to become a successful Hadoop Developer. The complete curriculum extensively covers all the topics required to gain an expertise in Hadoop Ecosystem. Course Highlights 80 hours of instructor […]

Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark’s standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive […]

Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark’s standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive […]

Big Data and Hadoop course has been designed by a team of highly experienced industry professionals to provide in-depth knowledge and skills to the learner in order to become a successful Hadoop Developer. The complete curriculum extensively covers all the topics required to gain an expertise in Hadoop Ecosystem. Course Highlights 80 hours of instructor […]

blog

Is MapReduce coming to an end? DataBricks recently published their benchmarks results for sorting 100 TB of Data over AWS Ec2 Machines, The results have clearly proven it as general purpose distributed processing framework which is meant for both in-memory and on-disk. They have used recent version of Apache Spark(Spark 1.1).

Big Data and Hadoop course has been designed by a team of highly experienced industry professionals to provide in-depth knowledge and skills to the learner in order to become a successful Hadoop Developer. The complete curriculum extensively covers all the topics required to gain an expertise in Hadoop Ecosystem. Course Highlights A Single Course covers […]

Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark’s standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive […]

blog
Sharding in MongoDB
11th January 2015

MongoDB is one of several database types to arise in mid 2000’s under the banner of NoSQL. Rather than using the rows and columns as in the relational databases, MongoDB is built over architecture of collections and documents. Documents contain sets of Key-Value pairs and are the basic unit of data in mongoDB. Collections contain […]