Tech Talk

1,000+ Pipelines

Decodable & Iceberg Power Medidata's Real-time Insights

On Demand
New Mike Araujo
Mike Araujo
Staff Engineer, Medidata Solutions
Sharon2
Sharon Xie
Head of Product, Decodable

Real-time, actionable insights can transform clinical trials and operations. Yet outdated technology and complex data pipeline workflows often get in the way. Medidata tackled these obstacles by bringing continuous data processing with Decodable into their data lake backed by Apache Iceberg. Join Mike Araujo, Staff Engineer at Medidata and Sharon Xie, Head of Product at Decodable, as they share how Medidata transitioned from managing cumbersome decentralized batch pipelines to implementing a centralized scalable real-time data platform with Decodable. 

In this session, you’ll learn how Medidata reduced engineering burdens, simplified Apache Iceberg integration, and empowered data teams to self-service thousands of pipelines—delivering fresh, actionable insights in minutes instead of days.


Key takeaways include:

  • Unified approach: Explore how Medidata moved thousands of siloed batch pipelines to a centralized, scalable platform powered by Flink.
  • Continuous processing for Iceberg: Learn how Medidata leverages Decodable to continuously cleanse, transform, and format the data in Iceberg to provide gold-level data sets in real-time at scale.
  • Remove infrastructure blockers: Learn how Decodable’s BYOC deployment ensures data sovereignty, and provides full visibility with Medidata’s observability system integration.
  • Faster data, faster insights: Discover how Medidata reduced latency, enabling their data consumers to access critical insights in minutes.
  • Accelerate time-to-value: See how Medidata’s data analysts, scientists, and engineers built self-service pipelines to improve productivity and focus on innovation.
Tech Talk

1,000+ Pipelines:

Decodable & Iceberg Power Medidata's Real-time Insights

On Demand
Sharon2
Sharon Xie
Head of Product, Decodable
New Mike Araujo
Mike Araujo
Staff Engineer, Medidata Solutions

Real-time, actionable insights can transform clinical trials and operations. Yet outdated technology and complex data pipeline workflows often get in the way. Medidata tackled these obstacles by bringing continuous data processing with Decodable into their data lake backed by Apache Iceberg. Join Mike Araujo, Staff Engineer at Medidata and Sharon Xie, Head of Product at Decodable, as they share how Medidata transitioned from managing cumbersome decentralized batch pipelines to implementing a centralized scalable real-time data platform with Decodable. 

In this session, you’ll learn how Medidata reduced engineering burdens, simplified Apache Iceberg integration, and empowered data teams to self-service thousands of pipelines—delivering fresh, actionable insights in minutes instead of days.

Key takeaways include:

  • Unified approach: Explore how Medidata moved thousands of siloed batch pipelines to a centralized, scalable platform powered by Flink.
  • Continuous processing for Iceberg: Learn how Medidata leverages Decodable to continuously cleanse, transform, and format the data in Iceberg to provide gold-level data sets in real-time at scale.
  • Remove infrastructure blockers: Learn how Decodable’s BYOC deployment ensures data sovereignty, and provides full visibility with Medidata’s observability system integration.
  • Faster data, faster insights: Discover how Medidata reduced latency, enabling their data consumers to access critical insights in minutes.
  • Accelerate time-to-value: See how Medidata’s data analysts, scientists, and engineers built self-service pipelines to improve productivity and focus on innovation.