jiebaR包安装与使用 r语言分词

2014-12-17 15:12阅读：

http://blog.sina.cn/dpool/blog/u/3227204473

开源介绍文档---最早出现Python

#安装经验：反正就是需要devtools包和rstool工具，其实根本原因是，jiebar包是二进制编码的包，需要这两个工具协助的。
第一步：### install.packages(“devtools”)
library(devtools)
第二步 install_github('qinwf/jiebaR')###这个过程会自动把rstool安装上并配好坏境，我个人很讨厌配坏境变量。所以自动安装比较好。
第三步 install.packages(“jiebaR”)

#使用方法，下面的使用时来自，帮组文档

Examples
### Note: Can not display Chinese character on Windows here.
######简单的分词
words = 'hello world'
test1 = worker()#设置分词模式
test1
test1 <= words
####文件分词
## Not run:
test <= './temp.txt'
#######保留标点符号分词symbol = T
engine2 = worker('mix',symbol = T)
engine2 <= './temp.txt'
engine2
engine2$symbol = T
engine2
engine2 <= words
###不知道目的是干嘛
engine3 = worker(type = 'mix', dict = 'dict_path',symbol = T)

新浪博客

jiebaR包安装与使用 r语言分词

分享

我的更多文章

下载客户端阅读体验更佳

疯狂捕鱼