1) Chinese Lexical Analysis Algorithm
中文分词算法解析
2) Chinese morpheme analysis
中文词法分析
1.
For Chinese language, Chinese morpheme analysis is the basic and key technology for Chinese information processing because it directly relates to the sentence analysis and semantic understanding of the next step, and finally affects the actual application system.
对于中文来说,中文词法分析是中文信息处理技术的基础和关键,它直接关系到后续的句法分析和语义理解,并最终影响到实际的应用系统。
3) Overview on Chinese Segmentation Algorithm
中文分词算法概述
4) file parsing algorithm
文档解析算法
5) Chinese words segmentation
中文分词
1.
For Chinese words segmentation,a module which is based on word library and uses the positive direction maximum matching algorithm was presented.
在该系统模型中,针对中文分词实现了基于词库的采用正向最大匹配算法的中文分词模块;针对多种格式文档的处理采用接口实现的方式和动态实例化的方法,实现了可以有效地处理txt、xml、html、pdf、doc和rtf等常见格式文档。
6) chinese participle
中文分词
1.
Chinese WEB documents classification involves to the documents automatic capture, the information processing and the extraction, the automatic sorting and so on, this article realizes a open style Chinese WEB documents automatic sorting system, and has applied several improvement algorithms in the system module, in the main solution present information retrieval involves when Chinese participle .
中文WEB文档的分类涉及到文档的自动抓取、信息加工和提取、自动分类等,本文实现一个开放式的中文WEB文档自动分类系统,并在系统模块中应用了几个改进算法,主要解决目前信息检索中涉及中文分词搜索时所遇到的一些问题。
2.
The computer may very easily understand English word, but Chinese sentence which is composed by the word, which can be understood through Chinese participle technology.
计算机可以很容易地理解英文单词,而对由词组成的中文句子,必须通过中文分词技术才得以理解。
3.
It returned to the first results of the Chinese participle and fully tap its semantic information use CC4 neural networks to judge the he rele-vant web page to re-sort the results,and a good solution integrated search engine does not prevail in the search results accurate information stagnant.
它使用神经网络对检索结果进行优化排序,它先对返回结果进行中文分词,在充分挖掘其语义信息的基础上,利用CC4神经网络对网页的相关性进行判断,对返回结果重新排序,很好地解决了综合性搜索引擎中普遍存在搜索结果不准确、信息滞后等问题。
补充资料:中文
1.中国语言文字或中国语言文学的省称。特指汉语言文字或汉语言文学。
说明:补充资料仅用于学习参考,请勿用于其它任何用途。
参考词条