|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑 5 q8 e6 T% T) X& s6 d+ O4 R: p
煮酒正熟 发表于 2013-12-20 12:05 ![]()
_( ]) P- A* ]* y+ M6 K) e基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ...
& A" [, L% S% @, U9 X& L2 T" E6 x' n* d& K
这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。 $ I* z; E# }; d) t
9 A. x6 A; T$ @8 B$ a/ l- _9 d
结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)! W3 z6 V, C9 w
, k, f' w8 M& vR example:- |: r: O' o' t r3 m3 ]9 m0 C
$ H. f3 Q: h, w2 G8 e
> M<-as.table(rbind(c(1668,5173),c(287,930))), z e- H l+ f5 M) }$ R6 j
> chisq.test(M): ]. r* c7 z: z9 v* a! W* i0 J9 |
; \3 ~ H1 w e; K
Pearson's Chi-squared test with Yates' continuity correction/ `" F! ]/ ~8 c7 l" V2 E
. t8 H" R2 Z5 |6 J( g0 ~
data: M
2 L4 {' e- ?' t9 VX-squared = 0.3175, df = 1, p-value = 0.57312 V$ p+ E! G4 e: ?% Q; N
5 D5 l, l; S# U' ?$ NPython example:
- H. ^& G' p4 u- K' d! V8 y" |* Z1 H- C( ?: H. m: y
>>> from scipy import stats5 D. ?% }/ h4 `) a, ?8 N- P( B/ D
>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]]): a! p' a8 p9 o
(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],0 k+ u! X0 f" Y
[ 295.26371308, 921.73628692]])) |
|