Category : Analytics

Basic descriptive statistics in Apache Spark

Update: this blog is migrated to Medium https://medium.com/spark-experts. To continue access good content, please subscribe it. Spark core module provides basic descriptive statistics operations for RDD of numeric data. More complex statistics operations are available in MLlib module which is beyond the scope of this post. The descriptive statistics operations are only available under a

Read More →

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 194 other subscribers