tag:blogger.com,1999:blog-486244449122404425.post1697186781166238610..comments2024-03-08T11:56:03.801+08:00Comments on 数据科学中的R和Python: 红楼梦文本折腾纪要写长城的诗http://www.blogger.com/profile/00652199274036685555noreply@blogger.comBlogger10125tag:blogger.com,1999:blog-486244449122404425.post-250374899976707692015-11-05T05:38:41.956+08:002015-11-05T05:38:41.956+08:00您实在是太过分了!哈哈您实在是太过分了!哈哈Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-43639975580020211122014-08-12T02:58:15.356+08:002014-08-12T02:58:15.356+08:00结果可喜可贺, 不过有几点还要请教。首先,置换检验是仅用于第一部分还是随后所有显著的检验。其次,林黛...结果可喜可贺, 不过有几点还要请教。首先,置换检验是仅用于第一部分还是随后所有显著的检验。其次,林黛玉名字的出现问题可能是情节设置,不知放入考虑因素是否妥当。最后,我个人以为关于章节中句子数量受情节影响较大,作为考量因素未必合适。此外,如果考虑每短句(逗号之间的句子)的长度可能更能表现作者的写作风格。(因为和音律有关)还有,听说有人探求红龙梦中stopwords(罢,亦,了等等)的使用模式从而判定前后两部分显著区分与否。我以为此方法相对较为可靠,毕竟越细节的部分是越难模仿的。Anonymoushttps://www.blogger.com/profile/11372669704302295818noreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-25079912493582058392014-08-12T02:51:46.634+08:002014-08-12T02:51:46.634+08:00此评论已被作者删除。Anonymoushttps://www.blogger.com/profile/11372669704302295818noreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-75161680500883075712013-08-25T15:48:26.446+08:002013-08-25T15:48:26.446+08:00大師作品大師作品Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-5318557340544234002013-04-02T08:03:12.635+08:002013-04-02T08:03:12.635+08:00终于解惑了,thank you终于解惑了,thank you写长城的诗https://www.blogger.com/profile/00652199274036685555noreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-4070088742742147802013-04-01T11:50:25.956+08:002013-04-01T11:50:25.956+08:00对于词频矩阵中获得三个字以上词的矩阵,可以使用
list(wordLengths=c(2, lnf)...对于词频矩阵中获得三个字以上词的矩阵,可以使用<br />list(wordLengths=c(2, lnf))就可以生成两个字的词了。<br />呵呵,其实因为英文中的单词大多都是三个字母以上的组合(除了a、 I 、am这些),估计就默认设置为三个了。Beibei KONGhttps://www.blogger.com/profile/06841158034712436219noreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-27716873668355499092012-07-04T05:36:31.226+08:002012-07-04T05:36:31.226+08:00楼上说的是,我后来也想到这问题,人类总喜欢寻找可能不存在的模式,比如看一片云。楼上说的是,我后来也想到这问题,人类总喜欢寻找可能不存在的模式,比如看一片云。写长城的诗https://www.blogger.com/profile/00652199274036685555noreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-83340473247899435362012-07-03T23:11:29.941+08:002012-07-03T23:11:29.941+08:00> wilcox.test(paragraph[1:70],paragraph[71:120]...> wilcox.test(paragraph[1:70],paragraph[71:120])<br /><br /> Wilcoxon rank sum test with continuity correction<br /><br />data: paragraph[1:70] and paragraph[71:120] <br />W = 2158.5, p-value = 0.02962<br />alternative hypothesis: true location shift is not equal to 0 <br /><br />> wilcox.test(paragraph[1:90],paragraph[91:120])<br /><br /> Wilcoxon rank sum test with continuity correction<br /><br />data: paragraph[1:90] and paragraph[91:120] <br />W = 1921.5, p-value = 0.0005283<br />alternative hypothesis: true location shift is not equal to 0Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-89353048200458100412012-06-11T20:42:37.028+08:002012-06-11T20:42:37.028+08:00这个可以去去吓吓那些红学家,呵呵这个可以去去吓吓那些红学家,呵呵Anonymoushttps://www.blogger.com/profile/16395179426117054598noreply@blogger.comtag:blogger.com,1999:blog-486244449122404425.post-52071780516697827942012-06-06T02:03:05.598+08:002012-06-06T02:03:05.598+08:00強大! 但有些地方没看懂 :'( 可能本科水平不夠高強大! 但有些地方没看懂 :'( 可能本科水平不夠高Anonymousnoreply@blogger.com