Published inData Engineer ThingsData Lakehouse or Data Mesh: Which One is Right for Your Data Strategy?When designing a modern data architecture, organizations often face a key decision: should they adopt a Data Lakehouse or embrace the…Jan 7Jan 7
Published inData Engineer ThingsBuilding Machine Learning Pipelines with the FTI Architecture: A Practical Step-by-Step GuideDesigning Scalable ML Pipelines with the FTI Architecture: A Step-by-Step Guide for Feature Engineering, Training, and Real-Time InferenceDec 2, 2024Dec 2, 2024
Published inData Engineer Thingsdbt Core and BigQuery: A Complete Guide to Automating Data Transformations with GitHub CI/CDStreamline your data transformations with dbt Core and BigQuery. A complete guide for setting up CI/CD and managing SQL workflows.Nov 16, 2024Nov 16, 2024
Published inData Engineer ThingsImplementing Change Data Capture (CDC) with Kafka Connect and Apache Kafka on Kubernetes: A…End-to-End CDC Pipeline on Kubernetes: Integrating MySQL, Kafka Connect, Apache Kafka and FastAPI for Real-Time Data StreamingNov 12, 20242Nov 12, 20242
Published inData Engineer ThingsCloud-Native Data Engineering: Orchestrating Spark on Kubernetes with Custom Airflow Operator and…Deploying Apache Airflow on Kubernetes with Custom Airflow Operator integrated with Apache Spark and Google Cloud StorageNov 9, 2024Nov 9, 2024
Running Apache Spark on Kubernetes Using Spark OperatorUsing Kubernetes Operator to run Apache SparkNov 7, 2024Nov 7, 2024