Sunday, May 9, 2010

The ultimate dataset to life, universe and everything

Pippa Noris has the ultimate dataset to life, universe and everything. The 40 MB (!) dataset contains basically ALL data EVER used to run cross-country regressions (Polity IV, Freedom House, PWT, Maddison, Fractionalization, Legal Origin, WB etc. pp.). It contains over 600 variables for 191 countries from 1971 to 2007. If you have ever done some data mining and got terribly pissed off with matching all the different datasets that use different coding schemes for countries (GMY, DEU, GER for Germany etc.), well, here you have it all neatly in one file. Massive respect!

Codebook is here and data set (Stata) here. 

No comments:

Post a Comment