|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑
/ E+ F4 E+ n5 s$ x0 ]8 X& x' L8 n煮酒正熟 发表于 2013-12-20 12:05 3 W' Y# W7 h+ l9 E [; \
基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... ) [7 B ^0 o( n4 Q1 v+ w
' P; I7 W; Q% x4 H) t这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。 6 @3 k: r! ^8 x
0 O0 L1 c: `- L/ x0 R1 f结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)
$ g* ?* l9 ]% [
" t9 Y, e' k! {6 D8 uR example:
, [9 N% ]# k' j( i; s+ c# }# n
* P: V9 [, K8 X R/ D) \> M<-as.table(rbind(c(1668,5173),c(287,930)))
M( Z8 e4 a& M> chisq.test(M)/ m4 `# r/ p8 m& [/ u) @2 y4 ?3 \
5 g+ I. n* B8 Q5 p4 f8 G* M Pearson's Chi-squared test with Yates' continuity correction; g7 G5 B. W6 U) `6 r6 r5 t- _
% W8 \6 G) g ?$ q w c7 b
data: M( q4 u, {9 k9 V
X-squared = 0.3175, df = 1, p-value = 0.5731
- v0 i8 E. k; G" X
; t4 \! @$ `6 q5 t. ?5 J7 RPython example:
+ R T! X: ?- r& u$ X6 }, G1 M
5 I7 o2 C& q, \3 W5 q. Q5 E>>> from scipy import stats
- |5 s& k! u; P; F/ M>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])
p, k4 @7 U3 x! a# M& j J, i(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],
$ U0 \$ z" x1 L( I" q/ ^ [ 295.26371308, 921.73628692]])) |
|