*** *** *** *** *** *** *****
** Two wire/I2C Bus READ/WRITE SAMple Routines of Microchip s
** 24Cxx / 85Cxx serial CMOS EEPROM interfacing to a
** PIC16C54 8-bit CMOS single chip microcomputer
** Revsied Version 2.0 (4/2/92).
**
** Part use = PIC16C54-XT/JW
** Note: 1) All timings are based on a reference crystal frequency of 2MHz
** which is equivalent to an instruction cycle time of 2 usec.
** 2) Address and literal values are read in octal unless otherwise
** specified.
We address the problem of predicting a word from previous words in a SAMple of text. In particular,
we discuss n-gram models based on classes of words. We also discuss several statistical algorithms
for assigning words to classes based on the frequency of their co-occurrence with other words. We
find that we are able to extract classes that have the flavor of either syntactically based groupings
or semantically based groupings, depending on the nature of the underlying statistics.
決策樹算法C4.5源碼,
The directory Data contains some SAMple datasets. Many interesting data
collections are maintained by the University of California, Irvine, in a
data repository