Web designing in a powerful way of just not an only professions. We have tendency to believe the idea that smart looking .
Confidential
Application Engineering
E-Commerce
➔ The client faced challenges in scaling catalog ingestion process due to growing number of retailers and increasing catalog sizes.
➔ Keeping catalog data up-to-date for good user shopping experience.
➔ Provide raw catalog and user demographics data to the Data Science team to generate personalized product recommendations.
➔ Minimize latency and optimize resource utilization while consuming millions of catalog products.
➔ Ensure fault-tolerance and data consistency in end to end processing.
➔ Simplify the onboarding of new retailers to ease integration with marketplace.
The architecture leveraged GCP’s managed services, along with open-source tools for data integration and orchestration. Below is an outline of the components:
GCP Composer was used to create a webhook based job to extract product catalog data from csv files provided by catalog provider(s) on a shared ftp server and dump the data in Postgresql DB.
Kafka served as the backbone for real-time data streaming. A GCP Composer batch job pushed Postgresql data to Kafka topics, ensuring decoupling between ingestion and processing layers.
Data from Kafka was processed using lightweight transformation jobs implemented in Scala micro-services.
Airbyte was used to ingest data from various sources (Postgresql, ftp, etc.) to GCP BigQuery to be consumed by the data science team to generate product recommendations.
ElasticSearch was used to store product catalog and recommendations by importing transformed data from BigQuery.
GCP Composer coordinated the end-to-end pipeline. It:
Processed data was stored in Postgresql as a source of truth, Google BigQuery for analytics and ElasticSearch for data search and consumption. Partitioning and clustering ensured optimized query performance and cost management.
Google Cloud Operations Suite and Airbyte’s monitoring tools were used to:
Client name: Confidential
Services: Marketplace, Personalized fashion
Technology: GCP Composer, Kafka, AirByte
Industry: E-Commerce
Location: USA
The client is a consumer-facing fashion marketplace offering a personalized shopping experience.
Dedicated Team with Scala and Data Engineering expertise
Budget Optimisation
On time Delivery
The architecture leveraged GCP’s managed services, along with open-source tools and Kafka for data ingestion and orchestration.
Pizenith Technologies It Advisor
+1 647-356-6855