Category: 4 SPL Big Data Computing
Data maintenance routine
Overview Composite table is an important file storage format of SPL. To ensure high performance, composite table often requires data to be stored in order. However, the order in which the data are generated is usually different from the order required by composite table, so it needs to adjust the order of data when maintaining composite table data. In addition, the composite table cannot be read when writing data to it, yet some query computation tasks cannot be stopped during data maintenance, which also needs to be ensured using corresponding means. This routine will give solutions to these problems. Related
SPL practice: implement real-time write, second-level count of daily 10 billion time series data on a single node
Problem description In order to monitor the operation status of power generation equipment, sensors (DCS) will be placed in the equipment to collect data. A power monitoring and statistical system designed by a company needs to collect the data measured on multiple sensors (referred to as monitoring point below) in real time at a fixed frequency and then store, and provide the query and statistical functionalities for the data of specified monitoring point within any time interval. The statistics include maximum, minimum, mean, variance, median, etc. Data structure and data scale Field name Field meaning Field type Sample data id
SPL practice: improve concurrency through route calculation
Problem description System architecture The frontend application is a BI system that is mainly to implement query based on some filter conditions, and the backend is a certain MPP database. The frontend generates the SQL suitable for wide table ‘orders’ (order table) based on user’s selection and operation on the page, and then submit the SQL to the backend, and finally display the result returned from backend. The generated SQL is roughly as follows: In the where condition, the date is a required field (it means to only query one day’s data each time); other fields and filter conditions can
SPL practice: multi-index calculation in real time
Problem description Data structure and data scale Deposit details table (‘deposit’ for short): Field name Field type Field meaning Sample data dt Date Date 2023-11-23 curr String Currency type 1: RMB, 0: others cust_no String Customer number 2889412 dept Number Department number 1-500 code11 Number Code 11 1, 0 code12 Number Code 12 1, 0 amt Number Amount 32499.90 … This table contains a dozen fields, and stores 20 million records per day, totaling 600 million records for 30 days. Customer table (‘cust’ for short): Field name Field type Field meaning Sample data dt Date Date 2023-11-23 cust_no String Customer







