LIBNAME mydata "/courses/d1406ae5ba27fe300 " access=readonly;
2) These are the frequency tables: PDF linkDATA new; set mydata.gapminder;LABEL incomeperperson="Income per person"suicideper100TH="Suicide rate among 100,000 inhabitants";PROC SORT; by COUNTRY;PROC FREQ; TABLES incomeperperson suicideper100TH;RUN;
3) Results are a little mixed. Even though some rich countries have high suicide rates, there are poor countries with high suicide rates as well.
But since the purpose of this exercise was to analyze the frequency distribution of variables, for now it is sufficient to say that about half of the countries has a suicide rate of 7 or less (per 100,000 inhabitants), while the other half has a suicide rate of 8 to up to 35, so there are clear differences in this variable among countries.
Income per person also varies, with half of the countries with an income per person of 2,000 or less, and the other half with incomes of up to 105,147.
There is a total of 213 countries in the data set. Results from 23 countries are missing the in the income per person variable, and for 22 countries in the suicide rate. (But they are not the same countries, though - most of them have at least one of these two variables).
I chose to work with 2 variables, instead of 3, so that's why there is no third variable in the program.
Next steps: I'll try to subset the data in order to get only countries from specific regions of the world - something like "Europe X Africa", so it would be easier to work with the data, and results might pop out more clearly.
No comments:
Post a Comment