常見例句In 1998, funded by national 863 program, we built a read speech corpus based on linguistic/phonetic rules, which have helped the ASR a lot. 在863項(xiàng)目資金的資助下,在1998年建立了一個(gè)基于語言學(xué)/語音學(xué)規(guī)則的朗讀文本語料庫(kù)。Corpus based similarity word extraction can be used in various kind of fields such as lexicon compiling and intelligent search engine. 在此基礎(chǔ)上定義的詞匯相似關(guān)系或聚類關(guān)系可以應(yīng)用于詞典編繤、智慧搜索引擎的開發(fā)等許多領(lǐng)域。All word carry on frequency of the word count legal corpus base secondly, divide according to five frequentness law grade. 其次是將法律語料庫(kù)中所有的詞進(jìn)行詞頻統(tǒng)計(jì),并根據(jù)五級(jí)頻度劃分法進(jìn)行分級(jí)。It is mainly a corpus base of the legal written language to set up legal corpus base at first, and mark automatic participle of the machine and morphological feature to it. 首先是建立法律語料庫(kù),主要是法律書面語的語料庫(kù),并對(duì)其進(jìn)行機(jī)器自動(dòng)分詞和詞性標(biāo)注。The corpus base about web page is also analyzed. Finally, this paper synthesized the above research results and designed and realized a systematic model of automatic summarization of a Chinese web page, which has been tested by the real web page. 最后本文綜合以上研究成果設(shè)計(jì)并實(shí)現(xiàn)了一個(gè)中文web網(wǎng)頁(yè)自動(dòng)摘要系統(tǒng)模型,并對(duì)實(shí)際的網(wǎng)頁(yè)進(jìn)行了測(cè)試。Classifying Text Corpus Based on Information Gain Weight of Feature 基于特征信息增益權(quán)重的文本分類算法 返回 Corpus-based