Saturday, March 23, 2019

0 comments
RDD (Resilient Distributed Data set) is the core of Apache Spark. It is the fundamental data structure on top of which all the spark com...

Thursday, March 21, 2019

0 comments
You can find enough theory on internet related to Big Data but theories won’t give you real hands on experience, for that you need a plat...

Wednesday, March 20, 2019

0 comments
If you are beginner in Spark you must be confused about starting with PySpark or Scala Spark. I was in the same situation couple of yea...

Sunday, March 17, 2019

0 comments
Apache Spark is a cluster computing system. It is lightning fast in-memory* parallel processing engine. Though it is based on Hadoop Map-...

0 comments
Google has introduced a new dimension to the world of    Machine learning and Artificial Intelligence. Its smart devices are capable of...