Scaling beyond limits dbt, or data build tool, has become the cornerstone for organizations managing data transformations. From young start-ups to large corporations, dbt serves as the centerpiece for orchestrating data workflows. However, as organizations grow, the challenge of coordinating multiple requirements within a single dbt project becomes apparent. The solution? dbt Mesh, a pattern [...]
Blogs
Databricks Unity Catalog is a pioneering unified governance tool designed for managing data and AI on the Lakehouse. Unity Catalog is Databricks’ answer to Data Governance challenges. It assists organizations in maintaining control over all data assets, managing data access, ensuring data quality, and the lineage. // WHAT Databricks Unity Catalog offers the following capabilities: [...]
Data Streaming refers to the continuous process of transmitting, processing, and analyzing data in real-time as it is generated or received. It involves the constant flow of data from its source to its destination, where it can be processed, analyzed, and acted upon without significant delay, typically within milliseconds to seconds. Real-time data streaming is [...]
dbtLabs made several impressing new releases at the recent Coalesce 2023 conference. One of them is the dbt Explorer. This functionality allows you to have a clear and visually pleasing overview of your different dbt projects, all elements within them and the different access levels.
[...]In the wake of the public introduction of ChatGPT, the field of artificial intelligence is experiencing a remarkable surge. Complex and large-scale AI solutions are being unveiled at a rapid pace. However, amid this era of innovation, it’s essential to revisit the fundamentals. In this white paper, our focus is on stripping away the complexity [...]
A 360-degree customer view is a comprehensive understanding of a customer, achieved by integrating data from various touchpoints and systems, providing insights into their preferences, behavior, and needs. Challenges In today’s digitally-driven world, the ability to deliver a personalized and seamless customer experience has become a crucial competitive advantage for businesses. However, several challenges stand [...]
Notebooks are widely used in data science and machine learning to develop code and present the results. Databricks notebooks facilitate real-time collaboration with colleagues, creating data science and machine learning workflows in multiple languages with built-in data visualizations. // WHAT The various possibilities that Databricks notebooks can be used for: Developing code using Python, SQL, [...]
In one of our previous blogposts we discussed one of the most vital tools in the modern data stack, dbt. Once you have chosen to use dbt there are two questions that remains, which version should I use and what is the difference between the two? In this blogpost we will dive into the differences [...]
With row-level security (RLS), you can restrict data access for given users. You can restrict data access at the row level but you are also able to define filters within roles. With the new update, keep in mind that it is still a preview, you can now quickly and easily define row-level security roles and [...]
With this new feature, you are now able to remove all the filters you applied in just one click. You won’t waste time anymore with removing them one by one. // HOW The Clear all slicers button can be added in three ways: From the Optimize ribbon => Clear all slicers button From the insert [...]
Browse categories
Browse tags
automation
Azure
CDC
Challenges
Change Data Capture
CICD
ClickOps
Cloud
data analytics
Databricks
Databricks Apps
Databricks Lakeflow
data engineering
data integration
Data Lake House
Data modelling
DataOps
Data platform
data vault
data visualization
dbt
DevOps
ELT
ETL
Fabric
Git
IaC
InfrastructureAsCode
MS Fabric
new feature
Permifrost
PlatformEngineering
Power BI
Python
Semantic Link Labs
Serverless
Snowflake
Snowsight
Spark
Streaming
Terraform
Unity Catalog
update
vaultspeed
Workspaces