This post explains the best practices Scala libraries should follow. Here are the most important best practices to follow: Great README Clearly defined API Accessible API documentation Clean JAR file […]

Spark DataFrame columns support maps, which are great for key / value pairs with an arbitrary length. This blog post describes how to create MapType columns, demonstrates built-in functions to […]

Apache Spark is a big data engine that has quickly become one of the biggest distributed processing frameworks in the world. It’s used by all the big financial institutions and […]