For some time I did not update the blog, so I want to add a small repository in github on how to create a parquet file using the spark-csv library of databricks. Super easy to use, so simple that the code is self-explanatory.

https://github.com/alonsoir/parquet-utils

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s