DataFrames in Go with gota, qframe, and dataframe-go
Go has great DataFrame libraries that let you easily manipulate data that’s stored in CSV files and databases. Working with CSV files directly can be burdensome. DataFrames are easier because […]
Go has great DataFrame libraries that let you easily manipulate data that’s stored in CSV files and databases. Working with CSV files directly can be burdensome. DataFrames are easier because […]
This blog post shows how to convert a CSV file to Parquet with Pandas, Spark, PyArrow and Dask. It discusses the pros and cons of each approach and explains how […]
This blog post explains how to read data from a CSV file and write it out as a Parquet file. The Parquet file format is better than CSV for a […]
You can execute Spark column functions with a genius combination of expr and eval(). This technique lets you execute Spark functions without having to create a DataFrame. This makes it […]
The term “column equality” refers to two different things in Spark: When a column is equal to a particular value (typically when filtering) When all the values in two columns […]
MUnit is a test library with a collection of features that are not available in other Scala testing libraries. MUnit draws inspiration from Scalatest, utest, and ava (JavaScript testing library). […]
Scalatest makes it easy to test your Scala code. This blog post shows how to add Scalatest to a sbt project and write some basic tests. Writing a simple test […]