Docker replacements (particularly in Mac M1)

    1 minutes read | 210 words

2021#23 Readings

    5 minutes read | 974 words

Concept Maps

    4 minutes read | 834 words

2021#22 Readings

    4 minutes read | 674 words

2021#21 Readings

    4 minutes read | 772 words

Setting up Kafka with SSL and accessing it with Go

    5 minutes read | 948 words

JSON woes in Apache Spark

    4 minutes read | 667 words

2021#20 Readings

    5 minutes read | 1046 words

2020#19 Readings

    4 minutes read | 771 words

2021#18 Readings

    4 minutes read | 694 words

2021#16 Readings

    3 minutes read | 498 words

2020#14 Readings

    5 minutes read | 906 words

Modelling data pipelines with Alloy

    9 minutes read | 1783 words

2021#10 Readings

    4 minutes read | 709 words

UTF-8 Issues between AWS Redshift and Apache Spark when COPY PARQUET

    2 minutes read | 343 words

2021#09 Readings

    4 minutes read | 733 words

2021#08 Readings

    3 minutes read | 538 words

2020#04 Readings

    4 minutes read | 672 words

Lakehouse: It's like Delta Lake, but not really

    5 minutes read | 1041 words

2021#03 Readings

    4 minutes read | 803 words

Down memory lane: the Hive paper

    6 minutes read | 1124 words

2021#02 Readings

    5 minutes read | 915 words

2021#01 Readings

    4 minutes read | 690 words

Configuring log4j properties in Databricks (and EMR)

    4 minutes read | 648 words

Programmatic adtech industry: where to?

    9 minutes read | 1871 words

2020#65 Readings

    4 minutes read | 851 words

2020#64 Readings

    4 minutes read | 774 words

2020#63 Readings

    4 minutes read | 649 words

Find-the-gap with SQL in AWS Redshift

    3 minutes read | 632 words

2020#62 Readings

    3 minutes read | 602 words

2020#61 Readings

    4 minutes read | 774 words

The RDD paper: introducing the Spark general purpose framework

    9 minutes read | 1909 words

2020#60 Readings

    4 minutes read | 796 words

2020#59 Readings

    4 minutes read | 764 words

2020#58 Readings

    4 minutes read | 739 words

Databricks' Delta Lake: high on ACID

    15 minutes read | 3024 words

2020#57 Readings

    5 minutes read | 976 words

Running SparkSQL on Databricks via Airflow's JDBC operator

    4 minutes read | 682 words

Does Snowflake have a technical moat worth 60 billion?

    15 minutes read | 3032 words

2020#56 Readings of the Week

    4 minutes read | 832 words

2019#24 Readings of the Week

    3 minutes read | 464 words

2019#23 Readings of the Week

    4 minutes read | 715 words

2019#20,21,22 Readings of the week

    3 minutes read | 626 words

2019#19 Readings of the week

    4 minutes read | 661 words

A (section) of a map of the data engineering space

    11 minutes read | 2151 words

2019#18 Readings of the week

    2 minutes read | 356 words

2019#17 Readings of the week

    2 minutes read | 335 words

2019#16 Readings of the week

    3 minutes read | 436 words

2019#15 Readings of the week

    3 minutes read | 440 words

2019#14 Readings of the week

    4 minutes read | 649 words

2019#9 Readings of the week (x4)

    9 minutes read | 1892 words

2019#8 Readings of the week

    3 minutes read | 428 words

2019#7 Readings of the week

    3 minutes read | 449 words

Apache Hive and java.lang.ClassCastException on start

    2 minutes read | 247 words

2019#6 Readings of the week

    3 minutes read | 551 words

2019#1 Readings of the week

    4 minutes read | 648 words

2016 in Review

    6 minutes read | 1230 words

Ruben Berenguel, PhD

    1 minutes read | 147 words

Find Search Engine Rankings... via the Command Line

    3 minutes read | 523 words