๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

Statistics/Bayesian Statistics

์‚ฌํ›„ํ‰๊ท (Posterior mean)๊ณผ ESS(Effective Sample Size)

๋ฌธ์ œ๋ฅผ ํ†ตํ•ด ์‚ฌํ›„ํ‰๊ท (Posterior mean)๊ณผ ESS(Effective Sample Size)์— ๋Œ€ํ•ด ์•Œ์•„๋ณด์ž.

 

๋ฌธ์ œ 1)

 

์‚ฌ์ „๋ถ„ํฌ๊ฐ€ ๋ฒ ํƒ€๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๊ณ  ๊ฐ€๋Šฅ๋„ ํ•จ์ˆ˜๊ฐ€ ๋ฒ ๋ฅด๋ˆ„์ด ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅผ ๋•Œ, ์‚ฌํ›„๋ถ„ํฌ์˜ ํ‰๊ท ๊ณผ ESS๋ฅผ ๊ตฌํ•˜์—ฌ๋ผ.

 

ํ’€์ด)

 

 

โ–ท ์‚ฌํ›„ํ‰๊ท ์€ ์‚ฌ์ „๋ถ„ํฌ์˜ ํ‰๊ท ๊ณผ ๋ฐ์ดํ„ฐ ํ‰๊ท ์˜ ๊ฐ€์ค‘ํ‰๊ท (Weighted average)์œผ๋กœ ๋‚˜ํƒ€๋‚ผ ์ˆ˜ ์žˆ๋‹ค. ๋ฐ์ดํ„ฐ ๊ฐ€์ค‘์น˜์˜ ๋ถ„์ž๋Š” ํ‘œ๋ณธํฌ๊ธฐ, ์‚ฌ์ „๋ถ„ํฌ ๊ฐ€์ค‘์น˜์˜ ๋ถ„์ž๋Š” alpha์™€ beta์˜ ํ•ฉ์ด๋‹ค. ์ด๋•Œ, ESS๋Š” ์‚ฌ์ „ํ‰๊ท  ๊ฐ€์ค‘์น˜์˜ ๋ถ„์ž์ธ alpha์™€ beta์˜ ํ•ฉ์ด๋‹ค. ์ฆ‰, ESS๋ž€ ์‚ฌ์ „ํ‰๊ท ์ด ์‚ฌํ›„ํ‰๊ท ์— ๋ฐ˜์˜๋˜๋Š” ๋น„์ค‘์„ ์ƒ˜ํ”Œ ๊ฐœ์ˆ˜๋กœ ๋‚˜ํƒ€๋‚ธ ๊ฒƒ์ด๋‹ค.

 

โ–ถ ESS๊ฐ€ ์ปค์ง€๋ฉด ์‚ฌํ›„ํ‰๊ท ์—์„œ ์‚ฌ์ „ํ‰๊ท ์˜ ๋น„์ค‘์ด ์ปค์ง€๊ณ  ๋ฐ์ดํ„ฐ ํ‰๊ท ์˜ ๋น„์ค‘์ด ์ค„์–ด๋“ ๋‹ค. ์ฆ‰, ์‚ฌ์ „์ •๋ณด๊ฐ€ ์‚ฌํ›„๋ถ„ํฌ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์ด ๋” ์ปค์ง„๋‹ค๊ณ  ๋ณผ ์ˆ˜ ์žˆ๋‹ค.

 

๋ฌธ์ œ 2)

 

์น™์ด‰ ๊ณผ์ž 1๊ฐœ์— ๋ฐ•ํ˜€ ์žˆ๋Š” ์ดˆ์ฝœ๋ › ๊ฐœ์ˆ˜๊ฐ€ ํฌ์•„์†ก ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๊ณ , ํฌ์•„์†ก ๋ถ„ํฌ์˜ ๋ชจ์ˆ˜๊ฐ€ ๊ฐ๋งˆ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅธ๋‹ค๊ณ  ํ•˜์ž. n๊ฐœ์˜ ์น™์ด‰ ๊ณผ์ž๋กœ๋ถ€ํ„ฐ ์ดˆ์ฝœ๋ › ๊ฐœ์ˆ˜์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ์–ป์—ˆ๋‹ค. ์ด๋•Œ, ์น™์ด‰ ๊ณผ์ž 1๊ฐœ์— ๋ฐ•ํ˜€ ์žˆ๋Š” ์ดˆ์ฝœ๋ › ๊ฐœ์ˆ˜์˜ ์‚ฌํ›„ํ‰๊ท ์„ ๊ตฌํ•˜์—ฌ๋ผ.

 

ํ’€์ด)

 

 

โ–ท ์ด์ „ ๋ฌธ์ œ์˜ ๊ฒฐ๊ณผ์™€ ๊ฐ™์ด ์‚ฌํ›„ํ‰๊ท ์€ ์‚ฌ์ „๋ถ„ํฌ ํ‰๊ท ๊ณผ ๋ฐ์ดํ„ฐ ํ‰๊ท ์˜ ๊ฐ€์ค‘ํ‰๊ท ์ธ ๊ฒƒ์„ ์•Œ ์ˆ˜ ์žˆ๋‹ค.

 

โ–ท ESS๋Š” beta์™€ ๊ฐ™๋‹ค.

 

โ–ถ ์‚ฌ์ „๋ถ„ํฌ์— ๋ชจ์ˆ˜๋ฅผ ์–ด๋–ป๊ฒŒ ์„ค์ •ํ•˜๋Š๋ƒ์— ๋”ฐ๋ผ ESS๊ฐ€ ๊ฒฐ์ •๋˜๊ฒŒ ๋œ๋‹ค.

 

๋ฌธ์ œ 2-1)

 

[๋ฌธ์ œ 2]์—์„œ ์‚ฌ์ „๋ถ„ํฌ์˜ ๋ชจ์ˆ˜๋ฅผ ์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•ด ๋…ผํ•˜์—ฌ๋ผ.

 

ํ’€์ด)

 

 

โ–ท ์ฒซ ๋ฒˆ์งธ ๋ฐฉ๋ฒ•์€ ์‚ฌ์ „ํ‰๊ท ์„ ๊ฐ€์ •ํ•˜์—ฌ ๋ชจ์ˆ˜๋ฅผ ์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์ด๋‹ค. ์ด๋•Œ, (a)๋Š” ์‚ฌ์ „๋ถ„์‚ฐ์„ ์ถ”๊ฐ€๋กœ ๊ฐ€์ •ํ•จ์œผ๋กœ์จ, (b)๋Š” ESS๋ฅผ ์ถ”๊ฐ€๋กœ ๊ฐ€์ •ํ•จ์œผ๋กœ์จ ๋‘ ๋ชจ์ˆ˜ alpha์™€ beta๋ฅผ ๊ตฌํ•  ์ˆ˜ ์žˆ๋‹ค.

 

โ–ท ๋‘ ๋ฒˆ์งธ ๋ฐฉ๋ฒ•์€ ์—„์ฒญ ์ž‘์€ epsilon์„ ๋ชจ์ˆ˜๋กœ ์ •ํ•˜์—ฌ ์‚ฌํ›„๋ถ„ํฌ์— ์˜ํ–ฅ์„ ๊ฑฐ์˜ ์ฃผ์ง€ ์•Š๋Š” ๋ชจํ˜ธ ์‚ฌ์ „๋ถ„ํฌ(Vague prior distribution)๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด๋‹ค. ์ด ์‚ฌ์ „๋ถ„ํฌ๋Š” ๋ถ„์‚ฐ์ด ์—„์ฒญ ํฌ๋ฏ€๋กœ ๋„“๊ฒŒ ํผ์ง„ ํ˜•ํƒœ๋ฅผ ๋„๊ณ ์žˆ์œผ๋ฉฐ, ์‚ฌ์ „ํ‰๊ท ์€ 1์ด์ง€๋งŒ ESS๊ฐ€ ์—„์ฒญ ์ž‘์•„ ์‚ฌํ›„ํ‰๊ท ์— ๊ฑฐ์˜ ์˜ํ–ฅ์„ ์ฃผ์ง€ ๋ชปํ•œ๋‹ค.

 


Reference:

"Bayesian Statistics: From Concept to Data Analysis," Coursera, https://www.coursera.org/learn/bayesian-statistics/.