2. S 2 ɛ 3. ˆβ S 2 ɛ (n p 1)S 2 ɛ χ 2 n p 1 Z N(0, 1) S 2 χ 2 n T = Z/ S 2 /n n t- Z T = S2 /n t- n ( ) (n+1)/2 Γ((n + 1)/2) f(t) = 1 + t2 nπγ(n/2) n

Similar documents
.3 ˆβ1 = S, S ˆβ0 = ȳ ˆβ1 S = (β0 + β1i i) β0 β1 S = (i β0 β1i) = 0 β0 S = (i β0 β1i)i = 0 β1 β0, β1 ȳ β0 β1 = 0, (i ȳ β1(i ))i = 0 {(i ȳ)(i ) β1(i ))

(lm) lm AIC 2 / 1

DAA09

1 15 R Part : website:

k2 ( :35 ) ( k2) (GLM) web web 1 :

講義のーと : データ解析のための統計モデリング. 第3回

201711grade2.pdf

waseda2010a-jukaiki1-main.dvi

第11回:線形回帰モデルのOLS推定

untitled

インターネットを活用した経済分析 - フリーソフト Rを使おう

一般化線形 (混合) モデル (2) - ロジスティック回帰と GLMM

1 kawaguchi p.1/81

R John Fox R R R Console library(rcmdr) Rcmdr R GUI Windows R R SDI *1 R Console R 1 2 Windows XP Windows * 2 R R Console R ˆ R

28

R R 16 ( 3 )

kubostat2017c p (c) Poisson regression, a generalized linear model (GLM) : :

Use R

10


tnbp59-17_Web:プO1/ky079888509610003201

.. ( )T p T = p p = T () T x T N P (X < x T ) N = ( T ) N (2) ) N ( P (X x T ) N = T (3) T N P T N P 0

> usdata01 と打ち込んでエンター キーを押すと V1 V2 V : : : : のように表示され 読み込まれていることがわかる ここで V1, V2, V3 は R が列のデータに自 動的につけた変数名である ( variable

BMIdata.txt DT DT <- read.table("bmidata.txt") DT head(dt) names(dt) str(dt)


Part () () Γ Part ,

Copyright (c) 2004,2005 Hidetoshi Shimodaira :43:33 shimo X = x x 1p x n1... x np } {{ } p n = x (1) x (n) = [x 1,..

最小2乗法

A B P (A B) = P (A)P (B) (3) A B A B P (B A) A B A B P (A B) = P (B A)P (A) (4) P (B A) = P (A B) P (A) (5) P (A B) P (B A) P (A B) A B P

回帰分析 単回帰

²¾ÁÛ¾õ¶·É¾²ÁË¡¤Î¤¿¤á¤Î¥Ñ¥Ã¥±¡¼¥¸DCchoice ¡Ê»ÃÄêÈÇ¡Ë

L P y P y + ɛ, ɛ y P y I P y,, y P y + I P y, 3 ŷ β 0 β y β 0 β y β β 0, β y x x, x,, x, y y, y,, y x x y y x x, y y, x x y y {}}{,,, / / L P / / y, P

N cos s s cos ψ e e e e 3 3 e e 3 e 3 e

q( ) 2: R 2 R R R R C:nProgram FilesnRnrw1030) [File] [Change Dir] c:ndatadir OK 2

J1順位と得点者数の関係分析

.. F x) = x ft)dt ), fx) : PDF : probbility density function) F x) : CDF : cumultive distribution function F x) x.2 ) T = µ p), T : ) p : x p p = F x

201711grade1ouyou.pdf

k3 ( :07 ) 2 (A) k = 1 (B) k = 7 y x x 1 (k2)?? x y (A) GLM (k

Rikon. # x y Rikon Zouka. Ninzu -.3 Kaku. Tomo -1.7 Tandoku.7 X5Sai -1. Kfufu -.5 Ktan.13 Konin.7 # x Zouka Ninzu

tokei01.dvi

放射線専門医認定試験(2009・20回)/HOHS‐01(基礎一次)

R Console >R ˆ 2 ˆ 2 ˆ Graphics Device 1 Rcmdr R Console R R Rcmdr Rcmdr Fox, 2007 Fox and Carvalho, 2012 R R 2

y = x 4 y = x 8 3 y = x 4 y = x 3. 4 f(x) = x y = f(x) 4 x =,, 3, 4, 5 5 f(x) f() = f() = 3 f(3) = 3 4 f(4) = 4 *3 S S = f() + f() + f(3) + f(4) () *4

ii 3.,. 4. F. ( ), ,,. 8.,. 1. (75% ) (25% ) =7 24, =7 25, =7 26 (. ). 1.,, ( ). 3.,...,.,.,.,.,. ( ) (1 2 )., ( ), 0., 1., 0,.

151021slide.dvi

σ t σ t σt nikkei HP nikkei4csv H R nikkei4<-readcsv("h:=y=ynikkei4csv",header=t) (1) nikkei header=t nikkei4csv 4 4 nikkei nikkei4<-dataframe(n

1 R ID 1. ID Before After 1 X 1 Y 1 2 X 2 Y n 1 X n 1 Y n 1 n X n Y n. ID Group Measure. 1 1 Y 1... n 1 1 Y n1 n Y n n 0 Y n 1 E

日本内科学会雑誌第98巻第4号

日本内科学会雑誌第97巻第7号

he T N/ N/

知能科学:ニューラルネットワーク

知能科学:ニューラルネットワーク

untitled

ii 3.,. 4. F. (), ,,. 8.,. 1. (75%) (25%) =7 20, =7 21 (. ). 1.,, (). 3.,. 1. ().,.,.,.,.,. () (12 )., (), 0. 2., 1., 0,.

Z: Q: R: C: sin 6 5 ζ a, b

ii

2 / 39

6.1 (P (P (P (P (P (P (, P (, P.

1 variation 1.1 imension unit L m M kg T s Q C QT 1 A = C s 1 MKSA F = ma N N = kg m s 1.1 J E = 1 mv W = F x J = kg m s 1 = N m 1.

untitled

卓球の試合への興味度に関する確率論的分析

統計学のポイント整理

lec03

数学Ⅱ演習(足助・09夏)

R = Ar l B r l. A, B A, B.. r 2 R r = r2 [lar r l B r l2 ]=larl l B r l.2 r 2 R = [lar l l Br ] r r r = ll Ar l ll B = ll R rl.3 sin θ Θ = ll.4 Θsinθ

JMP V4 による生存時間分析

AR(1) y t = φy t 1 + ɛ t, ɛ t N(0, σ 2 ) 1. Mean of y t given y t 1, y t 2, E(y t y t 1, y t 2, ) = φy t 1 2. Variance of y t given y t 1, y t

chap9.dvi

9 5 ( α+ ) = (α + ) α (log ) = α d = α C d = log + C C 5. () d = 4 d = C = C = 3 + C 3 () d = d = C = C = 3 + C 3 =

kubostat2015e p.2 how to specify Poisson regression model, a GLM GLM how to specify model, a GLM GLM logistic probability distribution Poisson distrib

85 4

keisoku01.dvi

3.300 m m m m m m 0 m m m 0 m 0 m m m he m T m 1.50 m N/ N

untitled

6.1 (P (P (P (P (P (P (, P (, P.101

ii 3.,. 4. F. (), ,,. 8.,. 1. (75% ) (25% ) =9 7, =9 8 (. ). 1.,, (). 3.,. 1. ( ).,.,.,.,.,. ( ) (1 2 )., ( ), 0. 2., 1., 0,.

untitled

limit&derivative

1.2 R R Windows, Macintosh, Linux(Unix) Windows Mac R Linux redhat, debian, vinelinux ( ) RjpWiki ( RjpWiki Wiki

untitled

1. 1 A : l l : (1) l m (m 3) (2) m (3) n (n 3) (4) A α, β γ α β + γ = 2 m l lm n nα nα = lm. α = lm n. m lm 2β 2β = lm β = lm 2. γ l 2. 3

1. A0 A B A0 A : A1,...,A5 B : B1,...,B

2 1 1 α = a + bi(a, b R) α (conjugate) α = a bi α (absolute value) α = a 2 + b 2 α (norm) N(α) = a 2 + b 2 = αα = α 2 α (spure) (trace) 1 1. a R aα =

1. z dr er r sinθ dϕ eϕ r dθ eθ dr θ dr dθ r x 0 ϕ r sinθ dϕ r sinθ dϕ y dr dr er r dθ eθ r sinθ dϕ eϕ 2. (r, θ, φ) 2 dr 1 h r dr 1 e r h θ dθ 1 e θ h

(1) 3 A B E e AE = e AB OE = OA + e AB = (1 35 e ) e OE z 1 1 e E xy e = 0 e = 5 OE = ( 2 0 0) E ( 2 0 0) (2) 3 E P Q k EQ = k EP E y 0

1 Tokyo Daily Rainfall (mm) Days (mm)

卒業論文

基礎数学I

10:30 12:00 P.G. vs vs vs 2

I A A441 : April 15, 2013 Version : 1.1 I Kawahira, Tomoki TA (Shigehiro, Yoshida )

renshumondai-kaito.dvi

stat2_slides-13.key

chap10.dvi

r 1 m A r/m i) t ii) m i) t B(t; m) ( B(t; m) = A 1 + r ) mt m ii) B(t; m) ( B(t; m) = A 1 + r ) mt m { ( = A 1 + r ) m } rt r m n = m r m n B

ohpmain.dvi

mosaic Daniel Kaplan * 1 Nicholas J. Horton * 2 Randall Pruim * 3 Macalester College Amherst College Calvin College St. Paul, MN Amherst, MA Grand Rap

x y 1 x 1 y 1 2 x 2 y 2 3 x 3 y 3... x ( ) 2

kubostat2017e p.1 I 2017 (e) GLM logistic regression : : :02 1 N y count data or

<4D F736F F F696E74202D BD95CF97CA89F090CD F6489F18B4195AA90CD816A>

power.tex

Transcription:

Copright (c) 2004,2005 Hidetoshi Shimodaira 1. t- 2. 3. F 4. 1 t- 1.1 = β0 + β11 + + βpp + ɛ = Xβ + ɛ 2005-04-13 17:37:08 shimo S 2 ɛ ˆV ( ˆβi) = S 2 ɛ aii ŝe( ˆβi) = t-p- ti = ˆβi ŝe( ˆβi), ˆV ( ˆβi) = Sɛ aii pi = 2 PrT > ti T n p 1 t- βi = 0 ti t- βi = 0 βi 0 α α = 0.05 0.01 pi < α βi 0 pi α βi = 0 βi = 0 pi [0, 1] pi < α βi 0 α β ˆβ = (X X) 1 X ɛ Nn(0, In) ˆβ Np+1(β, (X X) 1 ) (p + 1) (p + 1) A A = (aij) = (X X) 1, i, j = 0,..., p ˆβi N(βi, aii) V ( ˆβi) = aii ˆ ɛ Sɛ 2 1 n = e 2 i = e 2 n p 1 n p 1 β0, β1,..., βp p+1 n (p+1) 1 α α βi = 0 βi 0 βi 0 βi = 0 t- n p 1 t- # run0073.r # t- # fit <- lm(~.,data.frame(,)) # be <- coef(summar(fit)) # cat("# t-p- \n"); print(be) tval <- be[,"t value"] # t- m <- ma(abs(tval))+0.5; 0 <- seq(-m,m,len=300) # plot(0,dt(0,df.residual(fit)),tpe="l", lab="t-statistic",lab="densit") # t- rug(tval); tet(tval,0.01,names(tval),srt=90,adj=0) dev.cop2eps(file="run0073-d.eps") pv0 <- 2*pt(abs(0),df.residual(fit),lower.tail=F) # p- plot(0,pv0,tpe="l",log="",lab="t-statistic",lab="p-value") 2 abline(h=c(0.01,0.05),lt=3) rug(tval); tet(tval,pv0[1]*1.1,names(tval),srt=90,adj=0) dev.cop2eps(file="run0073-t.eps") > dat <- read.table("dat0002.tt") # (47 10 ) > <- dat[,-10]; <- dat[,10] > source("run0073.r") # t-p- Estimate Std. Error t value Pr(> t ) (Intercept) 15.89979396 6.030963886 2.6363603 0.012175936 Zouka 1.35299378 0.536658497 2.5211448 0.016136162 Ninzu -3.26224134 1.066976341-3.0574636 0.004131840 Kaku -0.02794050 0.036376197-0.7680984 0.447303442 Tomo -0.01586652 0.009506475-1.6690220 0.103554493 Tandoku -0.11120432 0.052370545-2.1234134 0.040470167 X65Sai 0.03597994 0.035336821 1.0181996 0.315195078 Kfufu -0.20127472 0.058708226-3.4283904 0.001504385 Ktan 0.10625525 0.053686459 1.9791816 0.055273358 Konin -0.08330936 0.113092312-0.7366492 0.465981104 n Z1,..., Zn N(0, 1) n n Zi 2 χ 2 n n ɛ 2 i σ 2 χ2 n ei N(0, (1 hii)) ei/σ N(0, 1) n n p 1 e 2 i χ2 n e = (In H)ɛ e 2 = ɛ (In H) 2 ɛ = ɛ (In H)ɛ = ɛ 2 ɛ Hɛ H Span(0,..., p) In H n n U = (u1, u2,..., un) H = (u1,..., up+1)(u1,..., up+1) densit 0.0 0.1 0.2 0.3 0.4 Kfufu Ninzu Tandoku Tomo Kaku Konin 4 2 0 2 4 X65Sai Ktan (Intercept) Zouka p value 5e 04 5e 03 5e 02 5e 01 Kfufu Ninzu Tandoku Tomo Kaku Konin 4 2 0 2 4 X65Sai Ktan (Intercept) Zouka In H = (up+2,..., un)(up+2,..., un) zi = u iɛ/σ, i = 1,..., n N(0, 1) ɛ Hɛ/ = z 2 1 + + z 2 p+1 χ 2 p+1 e 2 / = z 2 p+2 + + z 2 n χ 2 n p 1 ˆβ z1,..., zp+1 zp+2,..., zn ˆβ S 2 ɛ = e 2 /(n p 1) t statistic run0073-d t statistic run0073-t 1. ˆβ 1.2 t- ˆβ Np+1(β, (X X) 1 ) (p + 1) (p + 1) A n p 1 n e 2 i σ 2 χ2 n p 1 3 A = (aij) = (X X) 1, i, j = 0,..., p ˆβi N(βi, aii) 4

2. S 2 ɛ 3. ˆβ S 2 ɛ (n p 1)S 2 ɛ χ 2 n p 1 Z N(0, 1) S 2 χ 2 n T = Z/ S 2 /n n t- Z T = S2 /n t- n ( ) (n+1)/2 Γ((n + 1)/2) f(t) = 1 + t2 nπγ(n/2) n ˆβi βi σ aii R t- βi = 0 ˆβi βi σ N(0, 1) aii (n p 1)S 2 ɛ χ 2 n p 1 / Sɛ 2 σ = ˆβi βi t- 2 n p 1 aiisɛ ˆβi βi ŝe( ˆβi) t- n p 1 ti = ˆβi ŝe( ˆβi) ti t- n p 1 βi > 0 t- ti βi < 0 t- ti 1.3 # run0074.r # source("run0044.r") # drawhist 5 #n <- 11 # #be <- c(0,1) # (beta0,beta1) # <- seq(0,1,len=n) # #filename <- "run0074-" p <- length(be)-1 # p+1 X <- cbind(1,) # colnames(x) <- names(be) <- c("beta0","beta1") A <- solve(t(x) %*% X) # A=(X X)^-1 B <- A %*% t(x) # B = (X X)^-1 X sqra <- sqrt(diag(a)) # A func0074 <- function() be <- B %*% # pred <- X %*% be # resid <- -pred # se2 <- sum(resid^2)/(n-p-1) # sigma^2 se <- sqrt(se2) # sigma tval <- be/(se*sqra) # t- pval <- 2*pt(abs(tval),n-p-1,lower.tail=F) # p- list(be=be,se2=se2,tval=tval,pval=pval) 0 <- X %*% be # ) sigma0 <- 0.3 # cat("# start simulation: ",date(),"\n") b <- 10000 # sim <- matri(0,n,b) # simbe <- matri(0,length(be),b) # simse2 <- rep(0,b) # se2 simtval <- matri(0,length(be),b) # t simpval <- matri(0,length(be),b) # p- for(i in 1:b) sim[,i] <- 0 + rnorm(n,mean=0,sd=sigma0) fit <- func0074(sim[,i]) simbe[,i] <- fit$be; simse2[i] <- fit$se2 simtval[,i] <- fit$tval; simpval[,i] <- fit$pval cat("# end simulation: ",date(),"\n") cat("# \n"); print(func0074(sim)) cat("# pval0 < 0.05 = ",sum(simpval[1,]<0.05),"\n") cat("# pval1 < 0.05 = ",sum(simpval[2,]<0.05),"\n") if(!is.null(filename)) plot(,0); abline(be) dev.cop2eps(file=paste(filename,"s0.eps",sep="")) 6 plot(,sim); abline(simbe) dev.cop2eps(file=paste(filename,"s1.eps",sep="")) plot(simbe[1,],simbe[2,],pch=".",lab="beta0",lab="beta1") dev.cop2eps(file=paste(filename,"th1.eps",sep="")) plot(simse2,simbe[2,],pch=".",lab="se2",lab="beta1") dev.cop2eps(file=paste(filename,"th2.eps",sep="")) for(i in 1:2) drawhist(simtval[i,],30,paste("tval",i-1,sep="")) t0 <- seq(min(simtval[i,]),ma(simtval[i,]),len=300) lines(t0,dt(t0,n-p-1),col=4,lt=2) dev.cop2eps(file=paste(filename,"tval",i-1,".eps",sep="")) drawhist(simpval[i,],20,paste("pval",i-1,sep=""),filename) 0 run0074-s0 sim[, 1] run0074-s1 > n <- 11 # > be <- c(0,1) # (beta0,beta1) > <- seq(0,1,len=n) # > filename <- "run0074-" > source("run0074.r") # start simulation: Wed Oct 6 11:45:50 2004 # end simulation: Wed Oct 6 11:45:52 2004 # $be beta0-0.01592933 beta1 0.98142543 $se2 [1] 0.05716638 $tval beta0-0.1181108 beta1 4.3051007 $pval beta0 0.908573939 beta1 0.001975822 # pval0 < 0.05 = 504 # pval1 < 0.05 = 8748 beta1 0.0 0.5 1.0 1.5 2.0 0.0 0.1 0.2 0.3 0.4 0.6 0.4 0.2 0.0 0.2 0.4 0.6 beta0 run0074-th1 tval0 5 0 5 beta1 0.0 0.5 1.0 1.5 2.0 0.00 0.25 0.30 0.00 0.25 0.30 0.35 se2 run0074-th2 tval1 0 5 10 15 mean= 0.0082902, sd= 1.1296 run0074-tval0 mean= 3.83, sd= 1.5606 run0074-tval1 7 8

pval0 mean= 0.50308, sd= 0.28884 run0074-pval0 0 5 10 15 pval1 mean= 0.024496, sd= 0.058195 run0074-pval1 b = 10000 2 Pentium 1G vmware linu ) 2 2.1 v = w0β0 + w1β1 + + wpβp ˆv = w0 ˆβ0 + w1 ˆβ1 + + wp ˆβp w = (w0, w1,..., wp) ˆv wj = 1, wk = 0, k j ˆv = ˆβj w0 = i0,... wp = ip ˆv = ŷi v = w β, ˆv = w ˆβ ˆβ Np+1(β, (X X) 1 ) ˆv N(w β, w (X X) 1 w) 2.2 S 2 ɛ v = w β, ˆv N(v, v) v = w (X X) 1 w ˆσ v 2 = Sɛ 2 w (X X) 1 w = Sɛ 2 σv 2 σ = e 2 / 2 n p 1 σ2 v (n p 1) ˆσ2 v χ 2 σv 2 n p 1 ˆv v σv / ˆσ v 2 t- σv 2 n p 1 ˆv v t- n p 1 R n p 1 t- T t pt(t) PrT t = pt(t,n-p-1) PrT > t = 1-pt(t,n-p-1) = pt(t,n-p-1,lower.tail=f) t- βi = 0 v = v0 v v0 p- ( ( )) p- (v0) = Pr T > ˆv v0 ˆv v0 = 2 1 pt v = v0 p- (v0) p- (v0) < α v = v0 v v0 v ˆv v σv / ˆσ v 2 t- σv 2 n p 1 p- (v) [0, 1] Prp- (v) < α = α 9 10 2.3 v = v0 p- (v0) < α v = v0 v v0 v0 (1 α) = v0 p- (v0) α 1 α α = 0.05 0.95 v 1 α Prv (1 α) = 1 α v (1 α)p- (v) α p- (v) [0,1] 1 α a = pt(b) b = qt(a) n p 1 t- T t a t qt(a) R PrT qt(a) = a qt(a) = qt(a,n-p-1) 0 < a < 1 ( ( )) ˆv v0 2 1 pt α ( ) ˆv v0 pt 1 α/2 ˆv v0 qt(1 α/2) ˆv qt(1 α/2) v0 ˆv + qt(1 α/2) (1 α) = [ˆv qt(1 α/2), 11 ˆv + qt(1 α/2)] 2.4 βj w wj = 1, wk = 0, k j ˆv = ˆβj V (ˆv) ˆσ v 2 = Sɛ 2 w (X X) 1 w = Sɛ 2 ajj βj (1 α) = [ ˆβj Sɛ ajjqt(1 α/2), ˆβj + Sɛ ajjqt(1 α/2)] 2.5 = (1, 1,..., p) ŷ = ˆβ = β v = w = V (ˆv) ˆσ v 2 = Sɛ 2 w (X X) 1 w = Sɛ 2 (X X) 1 (1 α) = [ ˆβ Sɛ (X X) 1 qt(1 α/2), ˆβ+Sɛ (X X) 1 qt(1 α/2)] # run0075.r # # = = p= # a <- func0075a(,p); b <- func0075b(,0.05,a) pow <- function(a,p) a^(0:p) # c(a^0,a^1,...,a^p) calcx <- function(,p) # X X <- matri(0,length(),p+1) for(i in 1:length()) X[i,] <- pow([i],p) X calcq0 <- function(n,p,alpha) qt(1-alpha/2,n-p-1) # calcq1 <- function(n,p,alpha) sqrt((p+1)*qf(1-alpha,p+1,n-p-1)) # func0075a <- function(,p) # X <- calcx(,p) # colnames(x) <- paste("beta",0:p,sep="") A <- solve(t(x) %*% X) # A=(X X)^-1 B <- A %*% t(x) # B = (X X)^-1 X sqra <- sqrt(diag(a)) # A 0 <- seq(min(),ma(),len=300) # 300 12

X0 <- calcx(0,p) # 0 sqrxax <- appl(x0,1,function() sqrt(t() %*% A %*% )) # sqrt( A) list(x=x,a=a,b=b,sqra=sqra,0=0,x0=x0,sqrxax=sqrxax) func0075b <- function(,alpha,a,calcq=calcq0) # n <- nrow(a$x); p <- ncol(a$x)-1 q0 <- calcq(n,p,alpha) be <- a$b %*% # pred <- a$x %*% be # resid <- -pred # rss <- sum(resid^2) # se2 <- rss/(n-p-1) # sigma^2 se <- sqrt(se2) # sigma bese <- se*a$sqra # rsq <- 1-rss/sum((-mean())^2) # tval <- be/(se*a$sqra) # t- pval <- 2*pt(abs(tval),n-p-1,lower.tail=F) # p- beconf <- cbind(be-q0*se*a$sqra,be+q0*se*a$sqra) # pred0 <- a$x0 %*% be # (0) pred0conf <- cbind(pred0-q0*se*a$sqrxax,pred0+q0*se*a$sqrxax) list(be=be,bese=bese,se=se,rss=rss,rsq=rsq,tval=tval,pval=pval, beconf=beconf,pred0=pred0,pred0conf=pred0conf) func0075c <- function(,,a,b,col=2,lt=2,add=f) if(!add) plot(,) lines(a$0,b$pred0) lines(a$0,b$pred0conf,col=col,lt=lt) lines(a$0,b$pred0conf[,2],col=col,lt=lt) coef <- cbind(b$be,b$bese,b$tval,b$pval,b$beconf) colnames(coef) <- c("estimate","stderr", "t-value","p-value","lower","upper") invisible(list(coef=coef,se=b$se,rss=b$rss,rsq=b$rsq)) # run0076.r # p # = = p= source("run0075.r") for(i in 0:p) cat("# =",i,"\n") a <- func0075a(,i) c1 <- func0075c(,,a,func0075b(,0.05,a)) c2 <- func0075c(,,a,func0075b(,0.01,a),col=3,add=t) colnames(c1$coef)[5:6] <- c("lo05","up05") colnames(c2$coef)[5:6] <- c("lo01","up01") coef <- cbind(c1$coef,c2$coef[,5:6,drop=f]) cat("rss=",c1$rss,", RSQ=",c1$rsq,"\n") print(round(coef,3)) dev.cop2eps(file=paste("run0076-s",i,".eps",sep="")) > dat <- read.table("dat0001.tt") # (47 2 ) > <- dat/100; <- dat[,2] > p <- 3 > source("run0076.r") # = 0 RSS= 0.815383, RSQ= 0 Estimate StdErr t-value p-value Lo05 Up05 Lo01 Up01 beta0 1.473 0.019 75.848 0 1.434 1.512 1.421 1.525 # = 1 RSS= 0.3812667, RSQ= 0.5324078 Estimate StdErr t-value p-value Lo05 Up05 Lo01 Up01 beta0 1.742 0.040 43.592 0 1.662 1.823 1.635 1.850 beta1-2.825 0.395-7.158 0-3.620-2.030-3.886-1.763 # = 2 RSS= 0.3786836, RSQ= 0.5355758 Estimate StdErr t-value p-value Lo05 Up05 Lo01 Up01 beta0 1.681 0.120 14.043 0.000 1.440 1.922 1.359 2.003 beta1-1.672 2.141-0.781 0.439-5.987 2.642-7.436 4.091 beta2-4.698 8.576-0.548 0.587-21.983 12.586-27.788 18.391 # = 3 RSS= 0.3747561, RSQ= 0.5403926 Estimate StdErr t-value p-value Lo05 Up05 Lo01 Up01 beta0 1.462 0.347 4.212 0.000 0.762 2.162 0.527 2.398 beta1 4.415 9.320 0.474 0.638-14.381 23.211-20.704 29.534 beta2-56.680 77.913-0.727 0.471-213.807 100.447-266.664 153.304 beta3 135.499 201.844 0.671 0.506-271.559 542.557-408.492 679.491 13 14 3. p = 2 β0 = 0 β0 0 β1 = 0 0.439 β1 = 0 β2 = 0 p = 0 4. p = 3 β0 0 β1 = β2 = β3 = 0 p = 0 t- p = 1 p = 2 β2 = 0 or β2 0 3 F run0076-s0 run0076-s1 3.1 p = β0 + β11 + + βpp + ɛ = Xβ + ɛ k p k = β0 + β11 + + βkk + ɛ = X (k) β (k) + ɛ run0076-s2 run0076-s3 X (k) = (0, 1,..., k), β (k) = (β0, β1,..., βk) p = 0, 1, 2, 3 α = 0.05 α = 0.01 RSS = e 2 R 2 p = 1 t 1. p = 0 β0 = 0 β0 0 p 0 X ( k) = (k+1, k+2,..., p), β ( k) = (βk+1, βk+2,..., βp) [ ] X = (X (k), X ( k) ), β = β (k) β ( k) β (k) ˆβ (k) = (X (k) X (k) ) 1 X (k) i = 0,..., k ˆβ (k) i ˆβ i 2. p = 1 β0 = 0 β0 0 β1 0 p 1 15 16

p βk+1 = βk+2 = = βp = 0 k k k Span(0,..., k) Span(0,..., p) 3.2 F (k) (k) (k) H (k) = X (k) (X (k) X (k) ) 1 X (k) ŷ (k) = H (k) e (k) = ŷ (k) = (In H (k) ) RSS (k) = e (k) 2 RSS (k) χ 2 n k 1 RSS (k) RSS (p) = e(k) 2 e(p) 2 χ 2 p k RSS (p) χ 2 n p 1 (k) RSS(k) σ 2 (k) F = (RSS(k) RSS (p) )/(p k) RSS (p) /(n p 1) p k n p 1 p k, n p 1 F F Fp k,n p 1 17 F R pf X m, n F PrX = pf(, m, n) PrX > = 1 pf(, m, n) = pf(, m, n, lower.tail = F) pf qf PrX qf(a, m, n) = a # run0077.r # fkentei <- function(fitk,fitp) rssk <- sum(resid(fitk)^2) # RSS^(k) rssp <- sum(resid(fitp)^2) # RSS^(p) dfk <- df.residual(fitk) # n-k-1 dfp <- df.residual(fitp) # n-p-1 bunshi <- (rssk - rssp)/(dfk-dfp) # (RSS^(k)-RSS^(p))/(p-k) bunbo <- rssp/dfp # RSS^(p) / (n-p-1) fvalue <- bunshi/bunbo # F pvalue <- pf(fvalue,dfk-dfp,dfp,lower.tail=f) list(fvalue=fvalue,df1=dfk-dfp,df2=dfp,pvalue=pvalue) > source("run0077.r") > # > dat <- read.table("dat0001.tt") # (47 2 ) > <- dat/100; <- dat[,2] > fit0 <- lm( ~ 1, data.frame(,)) # k=0 > fit1 <- lm( ~, data.frame(,)) # k=1 > fit2 <- lm( ~ + I(^2), data.frame(,)) # k=2 > summar(fit2) # k=2 Call: lm(formula = ~ + I(^2), data = data.frame(, )) Residuals: Min 1Q Median 3Q Ma -0.29411-0.04875-0.00797 0.04600 0.32282 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 1.6807 0.1197 14.043 <2e-16 *** 18-1.6725 2.1408-0.781 0.439 I(^2) -4.6985 8.5762-0.548 0.587 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Residual standard error: 0.09277 on 44 degrees of freedom Multiple R-Squared: 0.5356,Adjusted R-squared: 0.5145 F-statistic: 25.37 on 2 and 44 DF, p-value: 4.7e-08 > unlist(fkentei(fit0,fit2)) # F! fvalue df1 df2 pvalue 2.537048e+01 2.000000e+00 4.400000e+01 4.700319e-08 > unlist(fkentei(fit1,fit2)) # t (^2 )! fvalue df1 df2 pvalue 0.3001376 1.0000000 44.0000000 0.5865649 > sqrt(0.3001376) # t (^2 ) [1] 0.5478482 F (k) (p) 1. k = 0 = (p) F p-value < α p 2. k = p 1 p βp = 0 or βp 0 βp t- ( ) t 2 = F F t- 3.3 F F (k) (p) l1(θ1) l2(θ2) θ1 θ2 p1 = dim θ1,p2 = dim θ2 θ1 = (β0, β1,..., βk, σ) θ2 = (β0, β1,..., βp, σ) p1 = k + 2 p2 = p + 2 19 ˆθ1, ˆθ2 l1(ˆθ1) l2(ˆθ2) l1(ˆθ1) = l( ˆβ0, ˆβ1,..., ˆβk, ˆσ) = n 2 l2(ˆθ1) = l( ˆβ0, ˆβ1,..., ˆβp, ˆσ) = n 2 1 + log(2π RSS(k) ) n 1 + log(2π RSS(p) ) n 1 p2 p1 n # run0081.r # 2 (l2(ˆθ1) l1(ˆθ2)) χ 2 p2 p1 2 (l2(ˆθ1) l1(ˆθ2)) = n log RSS (k) n log RSS (p) χ 2 p k udohikentei <- function(fitk,fitp) rssk <- sum(resid(fitk)^2) # RSS^(k) rssp <- sum(resid(fitp)^2) # RSS^(p) dfk <- df.residual(fitk) # n-k-1 dfp <- df.residual(fitp) # n-p-1 n <- length(resid(fitk)) # n chisqvalue <- n*(log(rssk)-log(rssp)) # pvalue <- pchisq(chisqvalue,dfk-dfp,lower.tail=f) list(chisqvalue=chisqvalue,df=dfk-dfp,n=n,pvalue=pvalue) > dat <- read.table("dat0001.tt") # (47 2 ) > <- dat/100; <- dat[,2] > fit0 <- lm( ~ 1, data.frame(,)) # k=0 > fit1 <- lm( ~, data.frame(,)) # k=1 > fit2 <- lm( ~ + I(^2), data.frame(,)) # k=2 > source("run0081.r") > unlist(fkentei(fit0,fit2)) # F fvalue df1 df2 pvalue 2.537048e+01 2.000000e+00 4.400000e+01 4.700319e-08 > unlist(udohikentei(fit0,fit2)) # chisqvalue df n pvalue 3.604697e+01 2.000000e+00 4.700000e+01 1.487646e-08 > unlist(fkentei(fit1,fit2)) # F 20

fvalue df1 df2 pvalue 0.3001376 1.0000000 44.0000000 0.5865649 > unlist(udohikentei(fit1,fit2)) # chisqvalue df n pvalue 0.3195130 1.0000000 47.0000000 0.5719005 3.4 QR QR X = QR X n (p + 1) Q n (p + 1) R (p + 1) (p + 1) Q Q Q = Ip+1 R X X = R Q QR = R R (X X) 1 = (R R) 1 = R 1 R 1 QR ˆβ ˆβ = (X X) 1 X = R 1 R 1 R Q = R 1 Q Q ˆβ = R 1 (Q ) R 1 R Q R ˆβ R 1 > X <- matri(c(1^(1:5),2^(1:5),3^(1:5)),5,3) > X [,2] [,3] [1,] 1 2 3 [2,] 1 4 9 [3,] 1 8 27 [4,] 1 16 81 [5,] 1 32 243 > QR <- qr(x) > R <- qr.r(qr) > Q <- qr.q(qr) > R [,2] [,3] [1,] -2.236068-27.72724-162.33854 [2,] 0.000000 24.39672 197.92824 [3,] 0.000000 0.00000 29.99355 > Q [,2] [,3] 21 [1,] -0.4472136-0.4262868 0.4925791 [2,] -0.4472136-0.3443086 0.1516455 [3,] -0.4472136-0.1803521-0.3301785 [4,] -0.4472136 0.1475608-0.6936976 [5,] -0.4472136 0.8033866 0.3796515 > Q %*% R [,2] [,3] [1,] 1 2 3 [2,] 1 4 9 [3,] 1 8 27 [4,] 1 16 81 [5,] 1 32 243 > t(q) %*% Q [,2] [,3] [1,] 1.000000e-00-8.180305e-17 1.924459e-17 [2,] -8.180305e-17 1.000000e-00-1.428436e-17 [3,] 1.924459e-17-1.428436e-17 1.000000e-00 3.5 F U = (u1, u2,..., un) H (k) = (u1,..., uk+1)(u1,..., uk+1), k = 0,..., p 0, 1,..., p QR X = QR ui = q i, i = 1,..., p + 1 up+2,..., un (p) = Xβ + ɛ ɛ Nn(0, In) = X (k) β (k) + X ( k) β ( k) + ɛ (k) e (k) = (In H (k) ) = (In H (k) )(X ( k) β ( k) + ɛ) β ( k) = 0 e (k) = (In H (k) ) = (In H (k) )ɛ = (uk+2,..., un)(uk+2,..., un) ɛ 22 zi = u ɛ/σ, i = 1,..., n (k) e (k) 2 = z 2 k+2 + + z 2 n RSS (k) = e(k) 2 = z 2 k+2 + + zn 2 χ 2 n k 1 RSS (p) = e(p) 2 = z 2 p+2 + + zn 2 χ 2 n p 1 RSS (k) RSS (p) = e(k) 2 e(p) 2 = z 2 k+2 + + zp+1 2 χ 2 p k RSS (p) RSS (k) RSS (p) zi 4 4.1 1. ˆβ ˆβ Np+1(β, (X X) 1 ) F = z 2 /(p + 1) Sɛ 2 / Fp+1,n p 1 p + 1, n p 1 F z 2 = X( ˆβ β) 2 / F = X( ˆβ β) 2 (p + 1)S 2 ɛ ˆβ or 1 α (1 α) = qf(a) = qf(a, p + 1, n p 1) PrF qf(1 α) = 1 α β : (β ˆβ) (X X)(β ˆβ) (p + 1)Sɛ 2 qf(1 α) ˆβ Prβ (1 α) = 1 α 2. S 2 ɛ (n p 1)S 2 ɛ ˆβ (p + 1) (p + 1) R X X = R R χ 2 n p 1 QR X = QR R (X X) 1 = (R R) 1 = R 1 R 1 γ = Rβ, ˆγ = R ˆβ γ β = R 1 γ γ γ : γ ˆγ 2 (p + 1)Sɛ 2 qf(1 α) ˆγ Sɛ (p + 1)qf(1 α) p + 1 z = (z1, z2,..., zp+1) z = 1 σ R( ˆβ β) z E(z) = 0 V (z) = 1 σ Rσ2 (X X) 1 R 1 σ = RR 1 R 1 R = Ip+1 z Np+1(0, Ip+1) z1, z2,..., zp+1 N(0, 1) 23 # run0078.r # # = = source("run0075.r") p <- 1 # =1 n <- length() # alpha <- 0.05 # =1-alpha a <- func0075a(,p) # a$qr <- qr(a$x); a$r <- qr.r(a$qr) # QR a$ir <- solve(a$r) # R^-1 b0 <- func0075b(,alpha,a) # 24

th <- seq(0,2*pi,len=300) # 0..2*pi 300 ga <- b0$se*calcq1(n,p,alpha)*rbind(cos(th),sin(th)) # gamma be <- as.vector(b0$be) + a$ir %*% ga # beta plot(t(be),tpe="l") # points(t(b0$be)) # abline(v=b0$beconf[1,],lt=2,col=2) # beta0 abline(h=b0$beconf[2,],lt=2,col=2) # beta1 b1 <- func0075b(,alpha,a,calcq1) # calcq1 abline(v=b1$beconf[1,],lt=3,col=4) # beta0 abline(h=b1$beconf[2,],lt=3,col=4) # beta1 dev.cop2eps(file="run0078-b.eps") coef <- cbind(b0$be,b0$beconf,b1$beconf) colnames(coef) <- c("estimate","lo","up","jointlo","jointup") print(coef) > dat <- read.table("dat0001.tt") # (47 2 ) > <- dat/100; <- dat[,2] > source("run0078.r") Estimate Lo Up JointLo JointUp beta0 1.742483 1.661974 1.822993 1.641291 1.843676 beta1-2.824869-3.619719-2.030019-3.823917-1.825821 Prβ1 1 = 1 α Prβ0 0, β1 1 < 1 α Prβ0 0, β1 1 1 α β 4.2 v = w β (1 α) = [ˆv qt(1 α/2), ˆv + qt(1 α/2)] = Sɛ w (X X) 1 w beta1 3.5 3.0 2.5 2.0 qt(a) = qt(a,n-p-1) Prv (1 α) = 1 α w W Prw β w (1 α), w W 1 α 1.65 1.70 1.75 1.80 1.85 beta0 run0078-b β = (β0, β1) α = 0.05 Prβ = 1 α βi, i = 0, 1 t- ( ) β0 β1 W Prβ0 0, β1 1 1 α Prβ0 0 = 1 α 25 26 w (1 α) = [ˆv (p + 1)qf(1 α), ˆv + (p + 1)qf(1 α)] qf(a) = qf(a, p + 1, n p 1) # run0079.r # # = = p=, alpha source("run0075.r") func0079 <- function(,,a,alpha,output=t) b0 <- func0075b(,alpha,a) b1 <- func0075b(,alpha,a,calcq1) coef <- cbind(b0$be,b0$bese,b0$tval,b0$pval,b0$beconf,b1$beconf) colnames(coef) <- c("estimate","stderr","t-value","p-value", "Lo","Up","JointLo","JointUp") if(output) func0075c(,,a,b0); func0075c(,,a,b1,col=4,lt=3,add=t) cat("rss=",b0$rss,", RSQ=",b0$rsq,"\n") print(round(coef,3)) list(b0=b0,b1=b1,coef=coef) func0079b <- function(,,p,alpha,filename="run0079-") for(i in 0:p) cat("# =",i,"\n") a <- func0075a(,i) func0079(,,a,alpha) if(!is.null(filename)) dev.cop2eps(file=paste(filename,"s",i,".eps",sep="")) beta0 1.473 0.019 75.848 0 1.434 1.512 1.434 1.512 # = 1 RSS= 0.3812667, RSQ= 0.5324078 beta0 1.742 0.040 43.592 0 1.662 1.823 1.641 1.844 beta1-2.825 0.395-7.158 0-3.620-2.030-3.824-1.826 # = 2 RSS= 0.3786836, RSQ= 0.5355758 beta0 1.681 0.120 14.043 0.000 1.440 1.922 1.333 2.029 beta1-1.672 2.141-0.781 0.439-5.987 2.642-7.895 4.550 beta2-4.698 8.576-0.548 0.587-21.983 12.586-29.628 20.231 # = 3 RSS= 0.3747561, RSQ= 0.5403926 beta0 1.462 0.347 4.212 0.000 0.762 2.162 0.345 2.579 beta1 4.415 9.320 0.474 0.638-14.381 23.211-25.578 34.407 beta2-56.680 77.913-0.727 0.471-213.807 100.447-307.403 194.042 beta3 135.499 201.844 0.671 0.506-271.559 542.557-514.031 785.029 run0079-s0 run0079-s1 > source("run0079.r") > dat <- read.table("dat0001.tt") # (47 2 ) > <- dat/100; <- dat[,2] > p <- 3; alpha <- 0.05 > func0079b(,,p,alpha) # = 0 RSS= 0.815383, RSQ= 0 27 28

4.4 run0079-s2 α = 0.05 95% (Up,Lo) (JointLo,JointUp) 4.3 v = w β, w W run0079-s3 γ = Rβ, a = R 1 w v = w β = a γ (Schwartz) a ˆγ γ ˆv v = a (ˆγ γ) a ˆγ γ γ γ : γ ˆγ Sɛ (p + 1)qf(1 α) γ ˆv v a Sɛ (p + 1)qf(1 α) γ 1 α Pr ˆv v a Sɛ (p + 1)qf(1 α) 1 α a = w R 1 R 1 w = w (X X) 1 w w = v : ˆv v Sɛ w (X X) 1 w (p + 1)qf(1 α) # run0080.r # # run0074.r # run0075.r,run0079.r source("run0079.r") func0080a <- function(v) # check if v1 in [v2,v3] (v[1] >= v[2]) && (v[1] <= v[3]) func0080b <- function(be,00,d) coef <- cbind(be,d$coef) be0 <- appl(coef[,c("be","lo","up")],1,func0080a) be1 <- appl(coef[,c("be","jointlo","jointup")],1,func0080a) pred0 <- all(appl(cbind(00,d$b0$pred0conf),1,func0080a)) pred1 <- all(appl(cbind(00,d$b1$pred0conf),1,func0080a)) list(be0=be0,be1=be1,pred0=pred0,pred1=pred1) a <- func0075a(,p) # 00 <- a$x0 %*% be # 300 cat("# \n") d <- func0079(,sim,a,alpha) # abline(be,col=3,lt=4) # cat("# \n") esno1 <- unlist(func0080b(be,00,d)); print(esno1) dev.cop2eps(file="run0080-s1.eps") cat("# \n") esno <- matri(0,length(esno1),b) # rownames(esno) <- names(esno1) for(i in 1:b) d <- func0079(,sim[,i],a,alpha,output=f) esno[,i] <- unlist(func0080b(be,00,d)) print(esno[,1:5]) cat("# \n") print(appl(esno,1,sum)) > n <- 11 # > be <- c(0,1) # (beta0,beta1) > <- seq(0,1,len=n) # > filename <- NULL > source("run0074.r") 29 30 # start simulation: Wed Oct 6 11:46:43 2004 # end simulation: Wed Oct 6 11:46:45 2004 # $be beta0-0.1491011 beta1 1.3627077 $se2 [1] 0.04218202 $tval beta0-1.287003 beta1 6.958817 $pval beta0 2.302066e-01 beta1 6.619553e-05 # pval0 < 0.05 = 508 # pval1 < 0.05 = 8751 > alpha <- 0.05 > source("run0080.r") # RSS= 0.5859556, RSQ= 0.6594943 beta0-0.208 0.144-1.447 0.182-0.534 0.117-0.628 0.212 beta1 1.016 0.243 4.175 0.002 0.465 1.566 0.306 1.726 # be0.beta0 be0.beta1 be1.beta0 be1.beta1 pred0 pred1 TRUE TRUE TRUE TRUE FALSE TRUE # [,2] [,3] [,4] [,5] be0.beta0 1 1 1 1 1 be0.beta1 1 1 1 0 1 be1.beta0 1 1 1 1 1 be1.beta1 1 1 1 1 1 pred0 0 1 1 0 1 pred1 1 1 1 1 1 # be0.beta0 be0.beta1 be1.beta0 be1.beta1 pred0 pred1 9498 9453 9835 9825 8736 9519 31 > sum(esno[1,] & esno[2,]) # joint0 [1] 9235 > sum(esno[3,] & esno[4,]) # joint1 [1] 9750 0.2 0.0 0.2 0.4 0.6 0.8 run0080-s1 β0 = 0, β1 = 1 10000 =95%9500 #β0 0 = 9498 #β1 1 = 9453 β0 β1 95% #β0 0, β1 1 = 9235 92% #β0 0, β1 1 = 9750 98% #β0 + β1 (), = 9500, #β0 + β1 (), = 8736 87%95% 32

#β0 + β1 (), = 9519 95% 5 5.1 7-1 X2000 p = 1, 2, 3 t p- p, 95% 95% 5.2 7-2 run0075.r calcq0 <- function(n,p,alpha) qt(1-alpha/2,n-p-1) # calcq1 <- function(n,p,alpha) sqrt((p+1)*qf(1-alpha,p+1,n-p-1)) # n = 30, α = 0.05 calcq0(n,p,alpha) calcq1(n,p,alpha) p = 0, 1,..., 10 =p= ) ( ) 33