News | Databricks Acquires Tabular to Bridge Apache Iceberg Gap

Databricks Acquires Tabular to Bridge Apache Iceberg Gap

Published by: Insights Desk Released: Jun 05, 2024

Highlights:

Databricks stated that Delta Lake boasts over 500 code contributors and has been adopted by over 10,000 companies globally.
Delta Lake and Iceberg have been closely competing for dominance in the data lake market, serving as centralized repositories for structured and unstructured data.

Recently, Databricks Inc. has acquired Tabular Technologies Inc., the creator of a universal storage platform built upon the Apache Iceberg standard.

The action indicates an intensified focus by Databricks on closing the compatibility divide between its Delta Lake storage format and Iceberg. While specific terms were not disclosed, Databricks CEO Ali Ghodsi disclosed to a prominent media outlet that the acquisition exceeded a billion USD. It’s been reported that Snowflake Inc. and Confluent were also contenders in the bidding process.

Tabular was established by three ex-employees of Netflix Inc. who collaborated on the development of Iceberg during their tenure there. In 2020, the project was donated to the open-source community.

Databricks’ Delta Lake storage framework, launched in the same timeframe, shares similarities with Iceberg as both leverage Apache Parquet and uphold principles of atomicity, consistency, isolation, and durability in transactions. They offer scalable metadata management and integrate both streaming and batch data processing. Databricks reported that Delta Lake boasts over 500 code contributors and is adopted by over 10,000 companies globally.

Competition for Dominance

Delta Lake and Iceberg have been engaged in a tight competition for dominance in the data lake market, which serves as centralized storage for structured and unstructured data. According to Dremio Corp.’s 2024 State of the Data Lakehouse report, 31% of respondents currently utilize Apache Iceberg, while 39% favor Delta Lake. Nevertheless, looking ahead, 29% anticipate adopting Iceberg over the next three years, compared to 23% for Delta Lake. SNS Insider Pvt Ltd. projects that the data lake market will experience a yearly growth rate exceeding 21%, reaching USD 57 billion by 2030.

The competition between the two standards has posed challenges for both sides. In a blog post introducing the agreement, Tabular stated, “The problem isn’t about determining which standard is better. The problem is that the risk of investing in the wrong format prevents people from choosing at all.”

For the past two years, Databricks has been making efforts to narrow the divide with Apache Iceberg. In the previous year, with the release of Version 3.0 of Delta Lake, Iceberg compatibility was integrated. Additionally, Iceberg support was incorporated into the company’s UniForm universal lakehouse format last year.

Progressing the Development of the Lakehouse

Consolidating the storage format is seen as a pivotal move in promoting the acceptance of data lakehouses, a term coined by Databricks to describe a blend of conventional data warehouses and data lakes. A lakehouse facilitates ACID transactions on data housed in object storage, ensuring robust reliability, performance, and compatibility with open-source engines like Apache Spark, Trino, and Presto.

The lakehouse concept has gained significant traction due to its versatility and scalability. According to research released by Databricks last year, almost three-quarters of the 600 technology leaders surveyed have already embraced a lakehouse architecture, while the remaining respondents anticipate doing so within the next three years.

Ghodsi said in a statement, “Databricks and Tabular will work with the open-source community to bring the two formats closer to each other over time, increasing openness and reducing silos and friction for customers.”

During an interview with a media outlet, Matei Zaharia, Co-founder and Chief Technology Officer of Databricks, expressed that the process of connecting the two formats will take several years. However, the collaboration between the creators of both standards represents a significant advancement.

The timing of the announcement was strategic, coinciding with Databricks’ competitor Snowflake’s annual Data Cloud Summit user conference. During recent conference in San Francisco, Snowflake unveiled Polaris Catalog, an open catalog implementation enabling cross-engine access to Iceberg data, directly competing Databricks’ Unity Catalog.

how to protect industrial processes in ot-it conve...

single-vendor sase for dummies...

beyond the vpn...

critical guidance for evaluating sase solutions...

choosing the best sase solution for your hybrid wo...

fruitful-berries realises their growth potential w...

sanorice future-proofs its business with aptean fo...

adapt, grow and thrive: how food industry experts ...

ai governance for the enterprise...

top 5 use cases for splunk enterprise security...

2024 gartner® magic quadrant™ for siem...

the hidden costs of downtime...

the ai philosophy powering digital resilience...

following the leaders: how premier organizations b...

the essential guide to zero trust...

2023 gartner® market guide for security, orchestr...

uncovering cyber threats: kaspersky incident analy...

proactive threat management: insights into managed...

threat hunting – what, why and how...

why are targeted ransomware attacks so successful?...

learn security vendor consolidation to enhance sec...

embedded payments: a smoother experience for your ...

leveraging multi-tenant architecture for scalabili...

building the bridge: effective post-merger it inte...

combating virtual machine sprawl: technical strate...

outdated endpoint security solutions: a security b...

businesses with low-code development enhances cust...

modern data governance for improved data quality...

deciphering cryptowall ransomware to plot a cyber ...

apache spark maximizing data potential with advanc...

scaling your cloud: scalable storage for public cl...

navigating shadow data: securing your sensitive bu...

guide to data center virtualization: management, p...

cloud application security solutions for complete ...

mastering source code management: best practices a...

profitable ai-powered data management solutions to...

application delivery network for business scalabil...

adaptive authentication fortifying businesses with...

bespoke software catalyzing roi: transforming busi...

result-driven virtual security analyst to help sec...

microsoft introduces bing generative search in lim...

qa wolf secures usd 36 m to enhance app testing...

linx security secures usd 33 m for its identity se...

microsoft reveals, crowdstrike update impacts 8.5 ...

cytoreason raises usd 80 m in the funding round in...

atlassian’s trello data breach: 15m emails leake...

google unveils a suite of new features for ai apps...

dreambig semiconductor secures usd 75m in funding...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

hayden ai raises usd 90 m to provide vision ai pla...

aws unveils app studio to accelerate app developme...

snowflake introduces multifactor authentication af...

alphabet call offs hubspot acquisition plans...

command zero launches with usd 21 m to investigate...

captions llc raises usd 60 m for generative video ...

aws introduces graviton4, fourth generation custom...

enso technologies secures usd 6 m for smb-focused ...

resurgence in lockbit drives record high ransomwar...

14 interesting trends that affect innovation and t...

what is web hosting?...

data privacy best practices every business should ...

Databricks Acquires Tabular to Bridge Apache Iceberg Gap

Highlights:

Competition for Dominance

Progressing the Development of the Lakehouse

Insights Desk

Related posts

Tembo Raises USD 14 M to Operate PostgreSQL Manage...

SingleStore Integrates Apache Iceberg for Enhanced...

Tinybird Raises USD 30 M for Real-time Analytics P...

Informatica Unveils New Integrations for Databrick...

Snowflake’s Open Data Revolution Continues...

The Redpanda Benthos Deal to Enhance the Data Stre...

Pinecone Debuts Serverless Version of its Vector D...

Elastic Search AI Lake Revolutionizes Data Accessi...

Foundational Data’s Seed Funding Round Nets USD ...

Our Brands