Category: 4 SPL Big Data Computing
User Behavior Analysis in Practice 4: Using Column-wise Storage
Target task: We have a user events table T. Below is its structure and part of its data: Time UserID EventTypeID EventType Product Quantity Unit Price … 2022/6/1 10:20 1072755 3 Search Apple 5 Pound 5.5 … 2022/6/1 12:12 1078030 2 Browse Tissue 4 Packs 16 … 2022/6/1 12:36 1005093 5 Submit Beef 3 Pound 35 … 2022/6/1 13:21 1048655 1 Login … 2022/6/1 14:46 1037824 6 Logout … 2022/6/1 15:19 1049626 4 AddtoCart Wine 4 Bottles 120 … 2022/6/1 16:00 1009296 5 Submit Pork 6 Pound 25 … 2022/6/1 16:39 1070713 2 Browse Bread 3 Packs 10 … 2022/6/1
User Behavior Analysis in Practice 3: Order-based Filtering Using Binary Search
Target task: We have a user events table T. Below is its structure and part of its data: Time UserID EventTypeID EventType 2022/6/1 10:20 1072755 3 Search 2022/6/1 12:12 1078030 2 Browse 2022/6/1 12:36 1005093 5 Submit 2022/6/1 13:21 1048655 1 Login 2022/6/1 14:46 1037824 6 Logout 2022/6/1 15:19 1049626 4 AddtoCart 2022/6/1 16:00 1009296 5 Submit 2022/6/1 16:39 1070713 2 Browse 2022/6/1 17:40 1090884 3 Search Fields in table T: Field name Data type Description Time Datetime Time stamp of an event, accurate to milliseconds UserID Integer User ID EventTypeID Integer Event type ID EventType String Event type name
User Behavior Analysis in Practice 2: Redundant Grouping Key Field
Target task: We have a user events table T. Below is its structure and part of its data: Time UserID EventTypeID EventType 2022/6/1 10:20 1072755 3 Search 2022/6/1 12:12 1078030 2 Browse 2022/6/1 12:36 1005093 5 Submit 2022/6/1 13:21 1048655 1 Login 2022/6/1 14:46 1037824 6 Logout 2022/6/1 15:19 1049626 4 AddtoCart 2022/6/1 16:00 1009296 5 Submit 2022/6/1 16:39 1070713 2 Browse 2022/6/1 17:40 1090884 3 Search Fields in table T: Field name Data type Description Time Datetime Time stamp of an event, accurate to milliseconds UserID Integer User ID EventTypeID Integer Event type ID EventType String Event type name







