亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频

? 歡迎來到蟲蟲下載站! | ?? 資源下載 ?? 資源專輯 ?? 關于我們
? 蟲蟲下載站

?? infomap-build.1

?? 有關自然語言理解理解方面的源碼
?? 1
字號:
.\" Process this file with .\"    groff -man -Tascii infomap-build.1.TH INFOMAP-BUILD 1 "February 2004" "Infomap Project" "Infomap NLP Manual".SH NAME.TP infomap-build \- build an Infomap WordSpace model.SH SYNOPSIS.B infomap-build.RB [ "-w " working_dir] .RB [ "-p " param_file].RB [ "-D " "var_1=val_1 ... " "-D " "var_N=val_N]".RB ( "-s " "single_corpus_file | " "-m " multi_file_list)<model_tag>.B infomap-build .BR -s \ <single_corpus_file> <model_tag>.B infomap-build.BR -m \ <file_list_file> <model_tag>.SH DESCRIPTION.B infomap-buildbuilds an Infomap WordSpace model from a properly formatted inputcorpus.  It is the main driver program of the Infomap NLP software..B infomap-buildis a wrapper around.BR make (1),which in turn builds a model by invoking various other Infomap NLPtools.In its simplest form, shown in the last two lines in the abovesynopsis, .B infomap-buildis passed a corpus and a model tag.  The corpus is either a singlefile (specified as an argument to the.B -soption), or is stored in multiple files, one file per corpus document.For multi-file corpora, a file listing the names of all the files making upthe corpus is given as an argument to the.B -moption.  The model tag will be used to refer to the resulting model..B infomap-build creates a directory whose name is the model tag.  The files generatedduring model building will be generated in this directory, which is a subdirectory of the default working directory.  The default workingdirectory is the value of the .B INFOMAP_WORKING_DIRenvironment variable if it is set; otherwise it is.I /tmp/$USERNAME/infomap_working_dir..SH OPTIONS.TP.BI -D \ var=valThis option defines a variable whose value will be passed throughto .BR make .It can be used to set parameters that control the building of themodel, such as the size of word vectors.  Values set using.B -Doverride both the defaults (from .IR @pkgdatadir@/default-params ) and the values specified using.BR -p ,if any.Useful variables that can be set using -D are describedin the.B MODEL PARAMETERSsection below..TP.BI -m \ file_list_fileFor multi-file corpora (hence the "m"), a file listing all of thefiles that make up the corpus, one per line.  Each file must consistof exactly one corpus document.  This option and the.B -soption are mutually exclusive..TP.BI -p \ param_fileA file containing parameters to control the building of the model.  These parameters should be specified in variable=value format, one per line.The values in this file override the defaults given in.IR @pkgdatadir@/default-params .Values passed to.B -D override the values in this file.See the .B MODEL PARAMETERSsection below..TP.BI -s \ single_corpus_fileFor single-file corpora (hence the "s"), the file containing thecorpus.  Within this file, documents should be marked by <DOC> and</DOC> tags; within each document, the text that is actually to beprocessed should be within a <TEXT> tag and a </TEXT> tag.  Thisoption and the.B -m option are mutually exclusive..TP.BI -w \ working_dirThe working directory in which to build the model.  Model files willbe written to a directory named.I model_tagthat is a subdirectory of this directory.  This option overrides boththe .B INFOMAP_WORKING_DIRenvironment variable and the system default.RI ( /tmp/$USERNAME/infomap_working_dir ).SH MODEL PARAMETERSThe following parameters control the building of models.  These parameterscan be specified by listing them in a file in .B VAR=VALUEform and passing that file as an argument to the.B -poption.  They can also be specified on the command line using the.B -Doption.  Values given on the command line override those given ina file.While default values for these parameters are listed below forconvenience, the true defaults are obtained from the file.I $pkgdatadir/default-paramsat runtime, and should be trusted over the values given herein case of conflict..B ROWS.RSThe number of words for which to learn word vectors.  Called.B ROWSbecause it is the number of rows in the matrix of co-occurrence countsproduced by.BR count_wordvec (1).  Default is 20,000..RE.B COLUMNS.RSThe number of content-bearing words to use as features in the processof computing word vectors.  Called.B COLUMNSbecause it is the number of columns in the matrix of co-occurrencecounts produced by.BR count_wordvec (1).Each word vector is reduced from .B COLUMNSdimensions to .B SINGVALSdimensions by.BR svdinterface (1).Default is 1000..RE.B SINGVALS.RSThe number of dimensions that the word vectors ultimately producedwill have.  Called.B SINGVALSbecause the original co-occurrence vectors are reduced to this manyelements by Singular Value Decomposition (SVD) (see.BR svdinterface (1)).Default is 100..RE.B SVD_ITER.RSThe number of iterations to be used by the SVD algorithm. Default is100.  See .BR svdinterface (1)..RE.B PRE_CONTEXT_SIZE.RSThis parameter and .B POST_CONTEXT_SIZEcontrol the size of the context window used by.BR count_wordvec (1)in computing its co-occurrence counts.Any word occurring in the .B PRE_CONTEXT_SIZE words immediately preceeding a target word.B wwill be considered to have appeared in the context ofthat occurrence of .BR w .(Note that context windows can also be truncated bydocument boundaries.)Default is 15..RE.B POST_CONTEXT_SIZE.RSThis parameter and.B PRE_CONTEXT_SIZEcontrol the size of the context window used by.BR count_wordvec (1)in computing its co-occurrence counts.Any word occurring in the.B POST_CONTEXT_SIZEwords immediately following a target word.B wwill be considered to have appeared in the context ofthat occurrence of .BR w .(Note that context windows can also be truncated by documentboundaries.)Default is 15..RE.B WRITE_MATLAB_FORMAT.RSThis parameter is a binary flag.  If it is set to 1, .BR count_wordvec (1)will write the co-occurrence matrix in MATLAB's input format,as well as in the format used by.BR svdinterface (1).If it is set to 0, no such additional output will bewritten.Default is 0..RE.B VALID_CHARS_FILE.RSThe valid characters file contains the valid word characters. Thesecharacters are the ones your words will eventually be composed of. Allother characters are considered by the tokenization to be breaking andare skipped. The list of characters in the valid characters file aregiven as a continuous string without delimiters.The default valid characters file is .B $pkgdatadir/valid_chars.en, which is for the English language and specifies [a-z][A-Z], '_' and'~' as valid word characters. If you want to use infomap for languagesusing a different character sets (say ISO-8859-2 for Central European)or wish to use other breaking characters, you have to prepare your ownvalid chars file.Watch out for newlines: if you have one at the end of this file, it will be considered as a legitimate part of words (may not be what you want). See .B prepare_corpus (1) for more on the hard-wired features of the tokenization method..RE.B STOPLIST_FILE.RSThe stoplist file should contain a list of words, one wordper line, that are to be treated as stopwords and ignored duringprocessing (i.e., they will not be selected as content-bearing words,and word vectors will not be computed for them).  The defaultis $pkgdatadir/stop.list, which is a reasonable choice for the English language. If you want to use infomap for languages other than English, youhave to prepare your own list of stopwords or at least prevent the English listfrom operating by specifying an empty stoplist file..RE.B COL_LABELS_FROM_FILE.RSIf equal to 1, this Boolean variable indicates that the column labels of the word-word co-occurrence matrix should be read from the file .B COL_LABEL_FILE.If set to 0, .BR count_wordvec (1)will choose column labels automatically.Default is 0..RE.B COL_LABEL_FILE.RSIf.B COL_LABELS_FROM_FILEequals 1,then this is the name of the file containing a set of user-specified content-bearing words which .BR count_wordvec (1) will use as column labels of the co-occurrence matrix..RE.\" .SH EXAMPLES.SH FILES.I @pkgdatadir@/Makefile.data.RSDescribes dependencies between generated model files..B infomap-buildinvokes.BR make (1)with this as the Makefile..RE.I @pkgdatadir@/default-params.RSThis file contains default values for model-building parameters, suchas the size of word vectors, the number of words for which to learnvectors,and the number of content-bearing words.These values can be overridden by specifying a different parameter fileusing the.B -poption and/or by setting individual parameters using.BR -D ..RE.SH ENVIRONMENT VARIABLES.B INFOMAP_WORKING_DIR.RSThe working directory in which to build the model; model fileswill be created in a subdirectory named.I model_tagin this directory, which will be created if necessary.This variable overrides the systemwide default(/tmp/$USERNAME/infomap_working_dir), and can be overridden by the.B -woption..RE.SH SEE ALSO.BR associate (1), \ infomap_build (1), \ prepare_corpus (1), \ count_wordvec (1), \ svdinterface (1), \ encode_wordvec (1), \ count_artvec (1), \ write_text_params (1)..SH DIAGNOSTICSReturns 0 to indicate success; nonzero value to indicate error..SH BUGSPlease report bugs to .BR infomap-nlp-users@lists.sourceforge.net ..SH CREDITSThe Infomap NLP software was written by Stefan Kaufmann, HinrichSchuetze, Dominic Widdows, Beate Dorow, and Scott Cederberg.  TheInfomap algorithm was originally developed by Hinrich Schuetze.The.B infomap-buildscript was written by Scott Cederberg..SH AUTHORThis manual page was written by Scott Cederberg.  Please directinquiries and bug reports to .BR infomap-nlp-users@lists.sourceforge.net .

?? 快捷鍵說明

復制代碼 Ctrl + C
搜索代碼 Ctrl + F
全屏模式 F11
切換主題 Ctrl + Shift + D
顯示快捷鍵 ?
增大字號 Ctrl + =
減小字號 Ctrl + -
亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频
亚洲精品精品亚洲| 欧美日韩免费视频| 久久蜜桃av一区二区天堂| 丝袜亚洲另类欧美| 在线观看三级视频欧美| 亚洲另类中文字| av在线不卡观看免费观看| 国产精品久久久久久户外露出 | 综合在线观看色| 成人午夜电影网站| 精品一区二区在线播放| 日韩一区二区免费高清| 免费在线观看一区| 9191成人精品久久| 亚洲午夜电影在线观看| 欧美丝袜丝交足nylons图片| 亚洲国产成人av| 51精品秘密在线观看| 蜜臀av一区二区三区| 欧美大片一区二区| 国产一区二区三区香蕉| 欧美经典一区二区| 国产精品一二二区| 久久看人人爽人人| 9i看片成人免费高清| 亚洲乱码国产乱码精品精98午夜| 在线欧美日韩精品| 日韩精品电影在线观看| 日韩女同互慰一区二区| 国产成人精品免费| 亚洲激情自拍视频| 91精品国产综合久久福利| 视频一区视频二区中文| 欧美mv日韩mv国产| av中文一区二区三区| 亚洲国产综合在线| 久久综合九色综合97_久久久| 国产麻豆精品theporn| 中文字幕亚洲在| 日本精品一级二级| 亚洲va在线va天堂| 久久女同性恋中文字幕| 色94色欧美sute亚洲13| 免费观看在线色综合| 国产精品无人区| 欧美精品在欧美一区二区少妇| 国产一区二区主播在线| 国产精品剧情在线亚洲| 9191国产精品| 成人sese在线| 久久精品国产一区二区三区免费看| 国产日韩欧美综合一区| 欧美日韩视频在线第一区 | 日韩国产欧美在线视频| 久久久精品综合| 欧美日韩一区精品| 国产69精品一区二区亚洲孕妇| 亚洲国产日韩a在线播放性色| 日韩午夜激情电影| 色综合久久精品| 国产精品99久久久久| 五月天欧美精品| 亚洲精品一区二区三区99| 日本福利一区二区| 国产一区二区三区日韩| 日韩精品国产欧美| 自拍偷拍亚洲欧美日韩| 久久久综合精品| 91精品国产91热久久久做人人 | 久久综合九色综合欧美就去吻| 一本大道久久a久久精品综合| 欧美日韩成人综合| 国产一区二区电影| 日韩av在线发布| 亚洲精品成人精品456| 国产精品无圣光一区二区| 欧美v亚洲v综合ⅴ国产v| 97久久人人超碰| 国产激情视频一区二区在线观看| 理论电影国产精品| 婷婷综合另类小说色区| 夜夜嗨av一区二区三区网页| 中文字幕高清不卡| 亚洲精品在线三区| 欧美性大战xxxxx久久久| 国产.欧美.日韩| 国产一区二区三区在线观看免费| 男女性色大片免费观看一区二区| 亚洲一区二区三区自拍| 亚洲欧美自拍偷拍色图| 国产亚洲一区二区在线观看| 精品奇米国产一区二区三区| 欧美午夜在线一二页| 99re6这里只有精品视频在线观看| 国产成人亚洲精品青草天美| 激情图片小说一区| 裸体一区二区三区| 久久国产三级精品| 日韩av网站在线观看| 日本91福利区| 日本亚洲最大的色成网站www| 日韩精品免费专区| 视频精品一区二区| 免费成人av资源网| 久久激情五月婷婷| 日韩av电影免费观看高清完整版在线观看 | 日产欧产美韩系列久久99| 日韩精品午夜视频| 久久99国产乱子伦精品免费| 久久精品国产一区二区三 | 99久久免费视频.com| 国产成人免费视频一区| 成人免费三级在线| yourporn久久国产精品| 在线亚洲一区二区| 欧美蜜桃一区二区三区| 日韩欧美一二三| 久久久久久久久99精品| 欧美精品一区男女天堂| 国产精品全国免费观看高清| 国产精品高潮呻吟| 香蕉久久夜色精品国产使用方法| 日本视频一区二区| 日韩精品欧美成人高清一区二区| 一区精品在线播放| 日本不卡视频在线| 91偷拍与自偷拍精品| 欧美mv和日韩mv的网站| 亚洲在线成人精品| 国产电影精品久久禁18| 91精品免费在线| 亚洲精选视频在线| 国产mv日韩mv欧美| 欧美挠脚心视频网站| 日韩一区日韩二区| 7777女厕盗摄久久久| 成人免费一区二区三区在线观看| 欧美aaaaa成人免费观看视频| 一本一道综合狠狠老| 久久久久久久综合色一本| 午夜精品成人在线| 色综合久久六月婷婷中文字幕| 国产亚洲精品久| 久久精品国产澳门| 欧美一区二区视频观看视频| 亚洲一区二三区| 色婷婷av一区| 日韩一区欧美一区| 99久久精品久久久久久清纯| 久久久www免费人成精品| 美女国产一区二区三区| 欧美日韩情趣电影| 亚洲综合久久久久| 日本高清视频一区二区| 亚洲精品精品亚洲| 91九色最新地址| 亚洲欧美一区二区三区久本道91| 国产成人在线看| 中文字幕欧美激情一区| 国产成人精品aa毛片| 久久久久免费观看| 国产乱码字幕精品高清av| 欧美哺乳videos| 国产在线视视频有精品| 2021国产精品久久精品| 国模一区二区三区白浆| 欧美成人性福生活免费看| 精品无码三级在线观看视频 | 欧美制服丝袜第一页| 亚洲日本丝袜连裤袜办公室| 色综合久久综合中文综合网| 亚洲免费在线视频一区 二区| 91麻豆精品在线观看| 亚洲精品免费播放| 欧美日韩国产精品自在自线| 天堂久久久久va久久久久| 91精品国产综合久久小美女| 精品一区二区三区在线观看 | 伦理电影国产精品| 337p粉嫩大胆噜噜噜噜噜91av | 欧美日韩一区在线观看| 爽好久久久欧美精品| 欧美一区二区二区| 国内一区二区在线| 中文字幕一区二区三区在线观看| 色综合视频一区二区三区高清| 亚洲一级在线观看| 欧美大片一区二区三区| 粉嫩一区二区三区在线看| 亚洲乱码中文字幕| 日韩一区二区三区精品视频| 国产一区二区调教| 亚洲男帅同性gay1069| 欧美精品在欧美一区二区少妇| 久久99精品一区二区三区三区| 欧美激情一区三区| 欧美日韩国产免费| 国产精品77777| 亚洲一二三四久久| 久久综合久久综合久久|