Databricks

All content related to Databricks

News

Simplifying CDC from snapshots in Databricks Lakeflow Spark Declarative Pipelines

In data engineering, teams often receive daily full snapshots of data from legacy systems or third-party sources. Traditionally, ingesting these periodic snapshots and identifying what changed (inserts, updates, deletes) each day has been cumbersome. Engineers need to write and maintain source-specific Change Data Capture (CDC) logic for each data source. Databricks has introduced Lakeflow Spark Declarative […]

News

Databricks Apps: Bridging the Gap Between Data and Operations

The data industry has been focused on optimizing backend infrastructure, centralizing information into Data Lakes and Warehouses to break down data silos. While the ability to report on this data has improved significantly, we’ve encountered a “last mile” problem. Although data is easily accessible for analysis, delivering it into actionable, interactive user experiences has proven […]

News

Adding automation to Databricks and dbt with VaultSpeed

We’ve already established the synergy of dbt & Databricks. However, adding data warehouse automation to the mix might be the missing link in the story. The combination of dbt & databricks provides a robust solution for streamlining data workflows and enhancing data lakehouse architectures. It leverages Databricks’ versatility in data processing and machine learning with […]

News

A unified platform with Databricks & dbt

Let’s talk about dbt (Data Build Tool) and Databricks. In our previous posts we have already talked about dbt and Databricks on their own, but what if we were to use these tools together? In this blog post, we’ll explore why the use of dbt and Databricks together is a great choice and will simplify […]

News

Unified governance solution with Databricks Unity Catalog

Databricks Unity Catalog is a pioneering unified governance tool designed for managing data and AI on the Lakehouse. Unity Catalog is Databricks’ answer to Data Governance challenges. It assists organizations in maintaining control over all data assets, managing data access, ensuring data quality, and the lineage. // WHAT Databricks Unity Catalog offers the following capabilities: […]

News

Collaborate & Create with Databricks Notebooks

Notebooks are widely used in data science and machine learning to develop code and present the results. Databricks notebooks facilitate real-time collaboration with colleagues, creating data science and machine learning workflows in multiple languages with built-in data visualizations. // WHAT The various possibilities that Databricks notebooks can be used for: Developing code using Python, SQL, […]