|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑 ( D( d3 H6 k3 N1 N9 p2 d, @
煮酒正熟 发表于 2013-12-20 12:05 5 x% }3 y8 f3 I
基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... ( e! j8 a1 s; B' p5 r% b
2 l) b' I; Y" A& T! d5 h) o1 f" j
这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。 ! ]. _# Y& n' k
8 Y! U. x& f7 H# b% J$ ~
结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)& s& `- n) z6 `" e e, }
8 g; k2 _: R$ ?' |3 e0 o4 I: A
R example:
9 ^5 D3 l: N( }; j4 \; J2 L8 a
3 s1 [* X* |# E4 ~* @> M<-as.table(rbind(c(1668,5173),c(287,930)))
7 Q# u' t4 V- M- n> chisq.test(M)% d/ ]! _4 N: o2 r3 ?* a9 k
: F# E; X7 g9 _9 ` Pearson's Chi-squared test with Yates' continuity correction
5 N; C4 v2 U9 X2 M8 I7 ?6 t7 `. d. ~2 a- l3 j# c3 n. {6 l
data: M
( `# V# }+ m2 d. T3 k- qX-squared = 0.3175, df = 1, p-value = 0.57318 W2 g0 q: ~% d0 W D+ y
( \/ a- u+ J8 g/ c% M$ `
Python example:
7 |1 h! G; K4 @0 [ w! q2 \ L o9 B2 o
5 L2 P( M8 m4 t" p: I>>> from scipy import stats
- ]2 M9 T5 k G: s' O6 O- C. w>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])
$ R1 B: A8 X) N4 }" o' A( ?2 S2 A(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],
4 u; ?: d" }) v% S) O [ 295.26371308, 921.73628692]])) |
|