?? ibm數(shù)據(jù)生成器所生成文件說明.txt
字號(hào):
(1) Associations and Sequential Patterns:
Code:
assoc.gen.tar.Z (26,286 bytes)
Downloading and Compiling Tips
Usage:
gen lit|tax|seq [options]
gen lit|tax|seq -help For more detailed list of options
lit: large (frequent) itemsets without taxonomies
tax: large (frequent) itemsets with taxonomies
seq: sequential patterns
Output Format:
There are two posssible output formats for the data file, based on whether or not the "-ascii" option is specified.
Binary
Consists of <CustID, TransID, NumItems, List-Of-Items.> Each of these is a 4-byte integer.
Ascii
Each line contains a CustID, TransID, and Item. Each of these take up 10 bytes, for a total of 33 bytes per line.
Apart from the data file, this program also generates a pattern file. The pattern file has three parts:
A description of the data.
A list of items with high weights. (Recall that the weight corresponds to the probability that item will appear in an itemset.) Each line has the item number, followed by the weight.
A list of the itemsets/sequential patterns with high weight. (Recall that the weight corresponds to the probability that the itemset will appear in a transaction.) Each line has the weight, the expected confidence for rules generated from this itemset, and the itemset.
?? 快捷鍵說明
復(fù)制代碼
Ctrl + C
搜索代碼
Ctrl + F
全屏模式
F11
切換主題
Ctrl + Shift + D
顯示快捷鍵
?
增大字號(hào)
Ctrl + =
減小字號(hào)
Ctrl + -