프로빗 모형

Probit Model

이항 (binary) 또는 ordered 결과 변수의 확률을 표준정규 누적분포 (CDF) 로 모형화하는 이산선택 모형 의 한 갈래. logit 과 함께 응용 미시계량의 표준 도구이고, error 가 정규분포라는 가정 덕에 selection-bias 보정 (heckman-selection-model) · 잠재변수 (latent variable) 표현 · 다변량 확장 (bivariate / multinomial probit) 으로 자연스럽게 일반화된다. 정책효과 평가·기업 행동 결정요인·기술 채택 결정 분석에서 광범위하게 쓰인다.

유형: 모수, 확률모형, latent variable 기반
핵심 가정: latent variable 의 normal error, exogeneity (regressor 와 error 의 독립), 함수형 specification 의 정확성
주요 변형: binary probit, ordered probit, multinomial probit (시뮬레이션 필요), bivariate probit, heckman-selection-model, dynamic probit, panel probit (random / fixed effects)

개요

probit 의 통계적 기원은 Bliss (1934) 의 toxicology 응용 (“probability unit”) 이고, 회귀 framework 으로의 통합은 Goldberger (1964) 의 Econometric Theory 와 Tobin (1958) 의 tobit 모형을 거쳐 1970 년대 응용 미시계량의 standard 로 자리잡았다. Heckman (1976, 1979) 의 selection model 이 probit 의 latent variable interpretation 을 활용해 두 단계 selection 보정을 가능하게 한 게 결정적이었다. ordered probit (McKelvey-Zavoina 1975) 은 ordinal outcome (예: 신용등급, 정책 등급) 으로, bivariate probit 은 두 이항 결정 (예: R&D 와 특허, 채택과 사용) 의 공동 결정으로 확장된다. logit 과 비교하면 (i) error 분포가 정규라 다변량 확장이 깔끔하지만 (ii) closed-form 이 아니라 simulation/maximum likelihood 가 필요하고, (iii) 실용적 marginal effect 는 logit 과 거의 같은 결과를 준다. 그래서 어떤 ratio 가 필요한가 (예: WTP) 에 따라 둘 중 선택이 결정된다.

핵심 식·정의

latent variable $y^* = X\beta + \epsilon$ , $\epsilon \sim N(0,1)$ 일 때, 관측되는 이항 결과 $y = \mathbb{1}\{y^* > 0\}$ 의 확률은 다음과 같다.

\Pr(y = 1 \mid X) = \Phi(X\beta)

여기서 $\Phi(\cdot)$ 은 표준정규 누적분포함수다. ordered probit 은 잠재변수를 threshold $\kappa_1 < \kappa_2 < \cdots < \kappa_{J-1}$ 로 분할해 categorical 결과 $j \in \{1, \ldots, J\}$ 의 확률을 모형화한다.

\Pr(y = j \mid X) = \Phi(\kappa_j - X\beta) - \Phi(\kappa_{j-1} - X\beta)

Heckman selection 은 두 식을 결합한다 — selection equation $z^* = W\gamma + u$ (probit) 와 outcome equation $y = X\beta + \epsilon$ 의 error 가 상관 ( $\rho$ ) 일 때, inverse Mills ratio $\lambda = \phi(W\gamma)/\Phi(W\gamma)$ 를 outcome equation 에 통제변수로 넣어 selection bias 를 보정한다. bivariate probit 은 두 binary outcome 의 잠재변수 error 가 bivariate normal 일 때 likelihood 를 동시 최대화한다.

TEMEP 라인

황준석 · Inha Oh 의 정책효과 평가 라인이 probit 의 중심 적용이다. Evaluation of credit guarantee policy using propensity score matching 가 신용보증 사업 참여 결정 자체의 selection 을 probit 으로 모형화하고 그 결과를 heckman-selection-model 로 outcome 보정에 활용한다. Comparison of Effects from Different Institutions: Public Credit Guarantee in Korea 은 그 selection equation 의 추정 결과를 성향점수매칭 의 성향점수로 재사용한다. Economic Impact Assessment of the Government-led Venture Firm Certification Policy 도 정부 R&D 사업 참여 결정의 probit · 후속 outcome 효과 평가의 결합 구조다.

Taewon Kang 라인은 기업 성장·지식 축적 결정요인의 probit 적용이다. R&D activities for becoming a high-growth firm through large jumps: evidence from Korean manufacturing 가 high-growth firm 진입 결정의 binary probit 추정으로 기업 활동 (innovation·export·M&A) 의 marginal contribution 을 식별한다. Effects of knowledge accumulation strategies through experience and experimentation on firm growth 도 기업 R&D 결정의 probit 추정을 포함한다. Role of production in fostering innovation 의 product-technology space 진입 결정, A Study on Factors Affecting Patent Citation Counts and Patent Licensing in Chemistry Field 와 Technology strategy for enhancing the public-to-private technology transfer: evidence from the duration of patent 의 특허 산출·기술 전략 결정요인 분석도 probit 의 응용이다. 이 라인 전반에서 probit 은 binary 결정 의 표준 1 단계 모형이고, 그 결과는 성향점수매칭 · 이중차분 · heckman-selection-model 의 input 으로 재사용된다.

인접 그래프

1-hop 이웃 31개

인물 3
방법론 15
주제 2
분류 2
논문 9

휠 = 확대/축소 · 드래그 = 이동 · hover = 강조 · 클릭 = 페이지 이동

프로빗 모형

개요

핵심 식·정의

TEMEP 라인

See also

인접 그래프

이 문서를 가리키는 페이지

논문 (9)

방법론 (13)

주제 (2)