2023#01 Readings

    4 minutes read | 769 words

2022#25 Readings 🇺🇦🌻

    5 minutes read | 970 words

2022#22 Readings 🇺🇦🌻

    2 minutes read | 266 words

2022#21 Readings 🇺🇦🌻

    3 minutes read | 503 words

2022#19 Readings 🇺🇦🌻

    3 minutes read | 567 words

2022#17 Readings 🇺🇦🌻

    2 minutes read | 383 words

The Presto paper

    4 minutes read | 843 words

2022#16 Readings 🇺🇦🌻

    4 minutes read | 722 words

Ray: Another way to distribute work in a cluster

    8 minutes read | 1548 words

2022#15 Readings 🇺🇦🌻

    3 minutes read | 519 words

2022#14 Readings 🇺🇦🌻

    3 minutes read | 538 words

2022#13 Readings 🇺🇦🌻

    4 minutes read | 690 words

2022#11 Readings 🇺🇦🌻

    3 minutes read | 600 words

Apache Druid: analytical queries powered by magic

    6 minutes read | 1085 words

2022#10 Readings 🇺🇦🌻

    4 minutes read | 726 words

2022#09 Readings 🇺🇦🌻

    5 minutes read | 911 words

Winning stakeholders' trust

    4 minutes read | 751 words

2022#07 Readings 🇺🇦🌻

    4 minutes read | 647 words

2022#06 Readings

    4 minutes read | 665 words

2022#04 Readings

    5 minutes read | 926 words

2022#03 Readings

    4 minutes read | 682 words

2021#02 Readings

    5 minutes read | 1041 words

2021#27 Readings

    6 minutes read | 1074 words

2021#26 Readings

    7 minutes read | 1444 words

Data pipelines with Alloy, Take 2

    7 minutes read | 1407 words

Docker replacements (particularly in Mac M1)

    1 minutes read | 210 words

2021#23 Readings

    5 minutes read | 974 words

Concept Maps

    4 minutes read | 852 words

2021#22 Readings

    4 minutes read | 674 words

2021#21 Readings

    4 minutes read | 772 words

Setting up Kafka with SSL and accessing it with Go

    5 minutes read | 948 words

JSON woes in Apache Spark

    4 minutes read | 667 words

2021#20 Readings

    5 minutes read | 1046 words

2020#19 Readings

    4 minutes read | 771 words

2021#18 Readings

    4 minutes read | 694 words

2021#16 Readings

    3 minutes read | 498 words

2020#14 Readings

    5 minutes read | 906 words

Modelling data pipelines with Alloy

    9 minutes read | 1783 words

2021#10 Readings

    4 minutes read | 709 words

UTF-8 Issues between AWS Redshift and Apache Spark when COPY PARQUET

    2 minutes read | 343 words

2021#09 Readings

    4 minutes read | 733 words

2021#08 Readings

    3 minutes read | 538 words

2020#04 Readings

    4 minutes read | 672 words

Lakehouse: It's like Delta Lake, but not really

    5 minutes read | 1041 words

2021#03 Readings

    4 minutes read | 803 words

Down memory lane: the Hive paper

    6 minutes read | 1124 words

2021#02 Readings

    5 minutes read | 915 words

2021#01 Readings

    4 minutes read | 690 words

Configuring log4j properties in Databricks (and EMR)

    4 minutes read | 648 words

Programmatic adtech industry: where to?

    9 minutes read | 1871 words

2020#65 Readings

    4 minutes read | 851 words

2020#64 Readings

    4 minutes read | 774 words

2020#63 Readings

    4 minutes read | 649 words

Find-the-gap with SQL in AWS Redshift

    3 minutes read | 632 words

2020#62 Readings

    3 minutes read | 602 words

2020#61 Readings

    4 minutes read | 774 words

The RDD paper: introducing the Spark general purpose framework

    9 minutes read | 1909 words

2020#60 Readings

    4 minutes read | 796 words

2020#59 Readings

    4 minutes read | 764 words

2020#58 Readings

    4 minutes read | 739 words

Databricks' Delta Lake: high on ACID

    15 minutes read | 3024 words

2020#57 Readings

    5 minutes read | 976 words

Running SparkSQL on Databricks via Airflow's JDBC operator

    4 minutes read | 682 words

Does Snowflake have a technical moat worth 60 billion?

    15 minutes read | 3032 words

2020#56 Readings of the Week

    4 minutes read | 832 words

2019#24 Readings of the Week

    3 minutes read | 463 words

2019#23 Readings of the Week

    4 minutes read | 714 words

2019#20,21,22 Readings of the week

    3 minutes read | 625 words

2019#19 Readings of the week

    4 minutes read | 660 words

A (section) of a map of the data engineering space

    11 minutes read | 2151 words

2019#18 Readings of the week

    2 minutes read | 355 words

2019#17 Readings of the week

    2 minutes read | 334 words

2019#16 Readings of the week

    3 minutes read | 435 words

2019#15 Readings of the week

    3 minutes read | 439 words

2019#14 Readings of the week

    4 minutes read | 648 words

2019#9 Readings of the week (x4)

    9 minutes read | 1891 words

2019#8 Readings of the week

    3 minutes read | 427 words

2019#7 Readings of the week

    3 minutes read | 448 words

Apache Hive and java.lang.ClassCastException on start

    2 minutes read | 247 words

2019#6 Readings of the week

    3 minutes read | 550 words

2019#1 Readings of the week

    4 minutes read | 648 words

2016 in Review

    6 minutes read | 1230 words

Ruben Berenguel, PhD

    1 minutes read | 147 words

Find Search Engine Rankings... via the Command Line

    3 minutes read | 523 words