Marriage is good for your income

For those of you who are into machine learning, here you can find a cool collection of databases to play around with your favorite algorithm. I choose one out of the available 200 and fit a logistic regression model. The idea is to see what kind of properties are common for those who earn above 50K a year. Our data is such that the “y” variable is binary. A value of 1 is given if the individual earns above 50K and 0 if below. We know many things about the individual. Level of education in years, age, is she married, where from, which sector is she working in, how many working hours per week, race, and more. We can fit logistic regression, which is quite standard for a binary dependent variable, and see which variables are important.

Continue reading

U.S. prison population

I recently finished reading: [asa mytpl]0312343493[/asa]The author writes that 6.6% of U.S. American residents will find themselves at some point in their life incarcerated, about 20 million people. A big number on anyone’s scale. You can also find disturbing figures in Wikipedia: figure. Are these facts misleading? we need to account for population growth. The prison population should naturally rise even if the proportion of crime in the general population is constant, since the population itself is growing. Here I show that these facts are NOT misleading, and that the system is indeed not fulfilling its purpose.

Continue reading

Flash Crash

In his book, “A demon of our design”: [asa mytpl]0471227277[/asa] Richard Bookstaber talks about the concept of coupled systems. These are systems where, once launched, are impossible to shut down. One such process is a plain take off. Once started, the pilot has no way back, he cannot stop after getting off the ground, so the only way is up. Well, in financial markets, up is generally considered good,

Continue reading