搜索资源列表
SQLET_split
- 另外一个中文分词程序,采用的是可显示的字典,可供大家分析-another Chinese word segmentation procedures, the show is the dictionary for our analysis
SharpICTCLAS
- Sharp ICTCLAS分词系统,对ICTCLAS分词系统进行了详尽的分析,并将代码移植到C#。-Sharp ICTCLAS segmentation system, right ICTCLAS segmentation system for a detailed analysis, and code to C# transplantation.
kalmanfiler11
- 卡尔曼滤波器是一个“optimal recursive data processing algorithm(最优化自回归数据处理算法)”。 对于解决很大部分的问题,他是最优,效率最高甚至是最有用的。他的广泛应用已经超过30年,包括机器人导航,控制, 传感器数据融合甚至在军事方面的雷达系统以及导弹追踪等等。近年来更被应用于计算机图像处理, 例如头脸识别,图像分割,图像边缘检测等等。-Kalman filter is an "optimal recursive data pro
ShootSearch_path_for_bate_1.0
- 一段关于中文信息分类的分词算法,他实现了信息过滤中的分词,为信息过滤的分类打下基础-section on the classification of information in Chinese word segmentation algorithm, he realized the information filtering the word, Information filtering to lay the foundation for the classification
worldSeg
- 参考詹卫东写的分词程序,完成的一个基于C#的简易分词程序。-reference Danweidong word written procedures for the completion of a simple C# segmentation procedures.
segment10_31
- 中文分词代码,对TXT文件进行分词,并组织分词结果-Chinese code word for TXT documents segmentation, and the organization of segmentation results
AutoSpliter
- 用于中文分词的算法。包括逆向分词和反向分词-for Chinese word segmentation algorithm. Including reverse and reverse word segmentation
WordSegmentation.rar
- 很久以前做的最大概率法分词程序,语料比较大,Maximum probability method for Word Segmentation
PatTermExtraction
- 使用无监督的机器学习方法进行术语抽取的系统,具有预处理、分词、抽取术语等功能。-Unsupervised machine learning methods for term extraction system with preprocessing, segmentation, extracted terms, and so on.
SW_I_WordSegment
- SW-I中文分词算法,MFC程序,在visual studio 2008中调试通过。默认词库为mdb,由于较大未包含在源文件中,请自行下载mdb格式的词典。-SW-I Chinese word segmentation algorithm, MFC procedures, visual studio 2008 in debug through. Default thesaurus for the mdb, as a result of the larger not included in the
CIPP_JSsetup
- 可以实现自动分词功能,支持自动标引,是处理中文自然语言的良好工具-Can achieve automatic word segmentation function, support for automatic indexing is a good tool to deal with Chinese natural language
Source_Code
- 中文分词系统分析库,可以对UTF8表述的中文文字进行分词操作, 支持外挂的词库和手动干预(回调方式)的加权处理。 可用于搜索引擎的基础分词开发-Chinese word segmentation system analysis libraries, can be expressed UTF8 Chinese text segmentation operation, support plug and manual intervention in the thesaurus (callback me
random_walker_matlab_code
- The random walker algorithm was introduced in the paper: Leo Grady and Gareth Funka-Lea, "Multi-Label Image Segmentation for Medical Applications Based on Graph-Theoretic Electrical Potentials", in Proceedings of the 8th ECCV04, Workshop on Compute
text
- python写的gbk分词分句程序 可以使用sogou或者谷歌输入法的词库进行分词-python written procedures gbk participle clause can use Google sogou or input method for segmentation of the thesaurus
SimpleSplit
- 自己写的简单分词程序,能够识别中英文,标点符号,数字等,但是速度不是很理想,其中思想可以供大家参考!-Himself wrote a simple segmentation procedure can identify in both Chinese and English, punctuation, numbers, etc., but the speed is not very ideal, in which ideas can be for your reference!
IR_Lib
- XPDF:把pdf文档转化为TEXT文档的库,如需中文支持,请到官方网站下载中文语言包 HTM2TXT:把HTML文件转化为TEXT文件的库 ICTCLAS:对中文字符串进行分词的库 PS2TXT:把Postscr ipt文件转化为TEXT文件的源码-XPDF: the pdf file into a TEXT document library, for Chinese language support, please visit the official website to do
Separate_word_program
- 中文分词程序,能够在短时间内对中文内容进行分解split.txt中是要分解的内容而word.TXT则是分词表,分词表越完整最后结果越正确。-Chinese word segmentation process, can in a short time for Chinese content split.txt in decomposition is to decompose the content is word.TXT participle form, word form the final o
77433632mmseg-v0.1
- 使用最大正向匹配算法,实现中文自动分词,对于初学者有很高的参考价值。-The biggest being the use of matching algorithm, the realization of Chinese word segmentation, for beginners, has high reference value.
C99
- n algorithm for domain independent linear text segmentation This the Windows version of the C99 algorithm that was presented in my NAACL00 paper. [Directories] bin contains executables, JAR file and test files classes compiled code as
MyAnalyzer
- 简单的中文分词及敏感词检索工具,其中可以任意添加敏感词,并且可以选择使用正向最大匹配算法,逆向最大匹配算法,双向最大匹配算法中的一种进行分词。-Simple Chinese word and sensitive word retrieval tool, which can add any sensitive words, and you can choose to use the forward maximum matching algorithm, reverse maximum matchi