Scope of the project
Our client is a big ecommerce group comprising more than 150 online stores and brands. Daily, each of their stores produces a tremendous volume of data. Plus, the company uses lots of back-office systems for managing their operations. As a result, the client gets a number of independent data sources, which makes it hard for them to make data-driven decisions.
The main data sources used by the client are databases, PIM systems, ERP systems, Google Analytics, Google AdWords, tracking systems, performance testing systems, Excel (Google Spreadsheets), etc. The key data types include:
- Product data
- Marketing data
- Financial data
The client’s request was to migrate the crucial historical data to Google’s warehouse and enable real-time/regular data collection across the main channels. The problem with disintegrated data management was that the client could view metrics on each store separately, without seeing the whole picture of their 150+ stores, business systems, plus back-office tools.
The company wanted to enable consistent data collection in one place and get role-based access to different datasets. The task was complicated by the fact that each dataset was managed and controlled by a different stakeholder within the enterprise. Our goal was to bring it all together and set an effective mechanism for data storage and collection.
We’ve managed to deliver measurable project results. Firstly, our team migrated data from the main sources, including Google Analytics, to BigQuery. Secondly, we ensured that these data are connected and can be leveraged in different types of reports. Both historical and real-time data are now available to the client.
Thirdly, we securely store the data in one place and expose them to powerful BI and analytics tools. Depending on the client’s purpose, the team can access different datasets and manipulate them in a way that drives useful business insights.