Home·Articles

Field notes from the road.

Writing from the people building Causeway: on contracts, lineage, governance, and the long work of paving a path our creators can actually walk.

Architecture·April 20, 20264 min

Dive into Caspian: understanding the modern data lake concept

Organizations generate data at accelerating rates and need robust systems to store, manage, and analyze it. Caspian is an illustrative concept for what a modern data lake looks like when it is built to last.

AD
Andrew Dean
Platform·April 20, 20263 min

Causeway: a federated data cloud powered by Caspian

Centralized architectures buckle under the scale and diversity of modern data. Causeway takes a federated approach, treating each business domain as an autonomous territory connected by a paved road.

AD
Andrew Dean
Engineering·April 20, 20264 min

Consider the terrain: common challenges in modern data systems

Engineers who anticipate common failure modes build more resilient infrastructure. Engineers who do not discover those failure modes in production.

AD
Andrew Dean
Engineering·April 20, 20264 min

How data pipelines work: architecture, patterns, and design

Raw data arrives in chaotic formats from dozens of sources. Structured insights power decisions. The pipeline bridges that gap by automating movement and transformation from origin to destination.

AD
Andrew Dean
Architecture·April 20, 20262 min

The best of both worlds: the rise of the lakehouse

Data lakes stored cheap, diverse data. Warehouses delivered the structure and performance analytics demanded. The lakehouse architecture merges both into a single platform without forcing a trade.

AD
Andrew Dean
Data Management·April 20, 20263 min

Beyond basic files: Parquet, Delta Lake, and Iceberg

Raw file storage alone fails to deliver analytical value. Parquet, Delta Lake, and Iceberg fill the gap at different layers of the stack, each doing one job well.

AD
Andrew Dean
Architecture·April 20, 20263 min

Beyond the banks: activating your data lake with Lakeshore Applications

Storing data delivers no value on its own. Lakeshore Applications operate at the perimeter of your data lake, converting stored bytes into decisions, analytics, and product surfaces.

AD
Andrew Dean
Platform·April 14, 20265 min

Introducing Causeway: the paved path for our data creators

A first look at why we are building Causeway, what it replaces, and the operator principles that will shape every release.

AD
Andrew Dean
Architecture·April 04, 20268 min

Beyond basic files: why a lakehouse contract beats a CSV on a share drive

Shared folders have carried us this far. They will not carry us any further. A case for contract-first datasets as the new default unit of work.

AD
Andrew Dean
Engineering·March 29, 20267 min

Promoting a dataset from Silver to Gold: a contract-first playbook

The exact freshness, quality, and retention checks we run before a dataset earns the Gold pill, and why we refuse to ship without them.

MK
Maya Kapoor
Architecture·March 21, 202612 min

Semantic lineage over the lake: tracing meaning, not bytes

Bytes-level lineage lies. Here is how Causeway reconstructs intent across dbt models, Snowflake views, and downstream metrics.

RT
Rafael Torres
Governance·March 04, 20266 min

Classification standards that do not rot: RAG the whole way down

Restricted, Internal, Public: three tiers, one masking policy each. Why fewer rules, applied harder, beats a taxonomy.

SL
Sana Lindqvist