Data lake: how Red Hat maintains data quality across multiple Drupal sites

Data accuracy and consistency is an important goal for any organization.

Maintaining data quality across multiple websites and applications (Drupal or otherwise), with different teams managing the same data in multiple systems, becomes complex and difficult to manage. Having a pool of data becomes an attractive solution to resolve some of these issues and allow for greater transparency and consistency across an organization. But, creating a scalable, reliable, and useful system brings its own challenges.

Join us, as we explore several ways that Red Hat is using a data lake architecture to share data between different Drupal sites.

We’ll cover:

What is a data lake?
The benefits, challenges, and considerations of using a data lake.
Several ways Red Hat has integrated a data lake architecture with Drupal.
Lessons learned along the way.

April Sides
Asheville, NC

I am a Senior Software Engineer (Drupal Back-end Developer) at Red Hat. My hobbies are saying "yes" to too many volunteer opportunities and going on "adventures" with my step-granddaughter. I am a philosopher in the void.

Melissa Bent
Boise, ID

Melissa Bent is a Senior Software Developer focusing on Drupal at Red Hat. She's been working with Drupal for 16 years and built her first website in 1996 in Notepad. When she's not at work, she takes her dogs outside to visit her chickens and goats. Talk to her about 3D printing, music, or plants. Especially plants.

Drupal is a registered trademark of Dries Buytaert.