Category: 1 SPL&Code
Calculate prime number table within 10000 using screening method
The screening method is an ancient way of finding all prime numbers that do not exceed the natural number N (N>1). Arrange natural numbers 1 to N. The smallest prime number is 2, so first cross out 1. Starting from 2, leave 2, and cross out all numbers after 2 that can be divided by 2. Then the first number after 2 that is not crossed out is 3, leave 3 behind, and then cross out all numbers after 3 that can be divided by 3. The first number after 3 that is not crossed out is 5. Leave 5
This Is Probably the Most Lightweight Columnar Storage Technology
Columnar storage is an important technique to improve data processing & analysis. If a data table has a large number of columns but the processing/analysis only involves a very small number of them, columnar storage lets you read the target columns only. This helps reduce disk accesses and boost performance. One column generally contains data of the same type. Sometimes it even has values that are close to each other. By storing such data in order, it can be compressed more efficiently. But setting up a column-oriented data warehouse for use in real-world applications is too complicated. All the commonly
A Universal File Format N Times Faster Than CSV
Using csv files to store data is very common. Similar formats include text files, tsv files, etc. These files all have a plain structure—no hierarchy—and store data as text characters. Some xls files also have a similar plain structure. CSV files have characteristics such as simplicity and strong versatility, but they also have significant drawbacks. Text characters cannot directly participate in calculations, and parsing them to in-memory data types such as dates and real numbers is a very time-consuming process, especially when checking for invalid cases, which makes it even more complex. Therefore, the performance of csv files is consistently
Using SQL for data analysis seems to be a false proposition
Using SQL for data anlysis is actually quite basic SQL is widely used for data analysis and often considered a default skill of data analysts. Indeed, being able to write SQL in a database environment is very convenient; it might seem as if you can solve any query by just writing a single SQL statement. For example, when you want to find sales grouped by user, the SQL code written out is as simple as English: However, data analysis tasks are not always that simple. For example, to calculate the next-day retention rate of new users, the SQL code will
This is probably the most user-friendly programming for Excel data analysts
The puzzles facing Excel data analysts In the world of data, data analysts who work with Excel are like a group of “knights holding the ‘Excalibur’”. Relying on Excel’s powerful interactive ability and ease of use, Excel knights cut through obstacles easily and perform data analysis efficiently. However, potential challenges are surging continuously in the said world. As the scale of data continues to grow and the requirements for data analysis are increasingly complex, relying solely on the ‘Excalibur’ seems insufficient to cope with the ever-emerging challenges. Excel is good, but it seems powerless when faced with complex ‘data enemies.’
SSC (Scudata SPL Cloud) Global Variable and Lock
Global variable In SSC environment, sometimes information needs to be shared between computing tasks (QVM scripts) on multiple QVMs. In this case, SSC global variable (simply written as global variable in the following part) is useful. Managed by QVS, the global variable is available to every script run on each QVM applied through QVS. Qenv function is used to assign value to the global variable: Qenv(“gv1”,1) #gv1 is the variable name Qenv(“gv1”) The function is also used to obtain the global variable: Qenv(“gv1”) Unlike an ordinary SPL global variable, related data will be transmitted between QVS and QVM once the







