2022#13 Readings 🇺🇦🌻

    4 minutes read | 690 words

2022#12 Readings 🇺🇦🌻

    3 minutes read | 564 words

2022#11 Readings 🇺🇦🌻

    3 minutes read | 600 words

2022#09 Readings 🇺🇦🌻

    5 minutes read | 911 words

2022#08 Readings 🇺🇦🌻

    3 minutes read | 573 words

2022#04 Readings

    5 minutes read | 926 words

2022#03 Readings

    4 minutes read | 682 words

2021#22 Readings

    4 minutes read | 674 words

2021#21 Readings

    4 minutes read | 772 words

Setting up Kafka with SSL and accessing it with Go

    5 minutes read | 948 words

JSON woes in Apache Spark

    4 minutes read | 667 words

2021#20 Readings

    5 minutes read | 1046 words

2020#14 Readings

    5 minutes read | 906 words

2021#12 Readings

    4 minutes read | 796 words

UTF-8 Issues between AWS Redshift and Apache Spark when COPY PARQUET

    2 minutes read | 343 words

2021#07 Readings

    3 minutes read | 572 words

2021#06 Readings

    3 minutes read | 603 words

2021#05 Readings

    4 minutes read | 823 words

2020#04 Readings

    4 minutes read | 672 words

Lakehouse: It's like Delta Lake, but not really

    5 minutes read | 1041 words

2021#03 Readings

    4 minutes read | 803 words

Down memory lane: the Hive paper

    6 minutes read | 1124 words

2021#02 Readings

    5 minutes read | 915 words

2021#01 Readings

    4 minutes read | 690 words

Configuring log4j properties in Databricks (and EMR)

    4 minutes read | 648 words

2020#66 Readings

    3 minutes read | 619 words

2020#64 Readings

    4 minutes read | 774 words

2020#62 Readings

    3 minutes read | 602 words

2020#61 Readings

    4 minutes read | 774 words

The RDD paper: introducing the Spark general purpose framework

    9 minutes read | 1909 words

2020#60 Readings

    4 minutes read | 796 words

2020#59 Readings

    4 minutes read | 764 words

Databricks' Delta Lake: high on ACID

    15 minutes read | 3024 words

2020#57 Readings

    5 minutes read | 976 words

Running SparkSQL on Databricks via Airflow's JDBC operator

    4 minutes read | 682 words

2020#55 Readings of the Week

    4 minutes read | 845 words

2020#54 Readings of the Week

    5 minutes read | 896 words

2020#51 Readings of the Week

    6 minutes read | 1130 words

2020#50 Readings of the Week

    6 minutes read | 1198 words

2020#49 Readings of the Week

    5 minutes read | 942 words

2020#48 Readings of the Week

    8 minutes read | 1548 words

2020#47 Readings of the Week

    7 minutes read | 1366 words

2020#46 Readings of the Week

    5 minutes read | 900 words

2020#45 Readings of the Week

    5 minutes read | 882 words

2020#44 Readings of the Week

    5 minutes read | 915 words

2020#43 Readings of the Week

    4 minutes read | 682 words

2020#42 Readings of the Week

    3 minutes read | 598 words

2020#41 Readings of the Week

    4 minutes read | 713 words

2020#40 Readings of the Week

    3 minutes read | 566 words

2020#37 Readings of the Week

    6 minutes read | 1097 words

2019#36 Readings of the Week

    4 minutes read | 769 words

A (section) of a map of the data engineering space

    11 minutes read | 2151 words

2019#16 Readings of the week

    3 minutes read | 435 words

2019#15 Readings of the week

    3 minutes read | 439 words

2019#14 Readings of the week

    4 minutes read | 648 words

2019#13 Readings of the week

    3 minutes read | 550 words

2019#11 Readings of the week

    2 minutes read | 394 words

2019#9 Readings of the week (x4)

    9 minutes read | 1891 words

2019#4 Readings of the week

    3 minutes read | 570 words

2019#3 Readings of the week

    3 minutes read | 527 words

2018: Year in Review

    6 minutes read | 1154 words

Notifications from Spark on an Apple Watch (via IFTTT)

    3 minutes read | 428 words

Scala eXchange 2017

    7 minutes read | 1447 words

2017: Year in Review

    9 minutes read | 1743 words