Don't wanna be here? Send us removal request.
Text
Visualizing Data
Objective
The overall ADDHEALTH dataset has 6504 observations. Since I am interested in the effect of adolescent employment on pursuit of higher education, I focused only on the adolescents who claim to have worked in the past 4 weeks. This cut my dataset down to 3687 observations representing only the teens that have worked. The variables of interest to me were: H1EE1, H1EE2, H1EE3, H1EE4, H1EE5, H1EE6, H1EE7, H1EE12, H1EE13, H1EE14. The Frequency Table results have these variables conveniently labeled.
SAS CODE
LIBNAME mydata "/courses/d1406ae5ba27fe300" access=readonly; data new; set mydata.addhealth_pds; LABEL H1EE1='Scale of 1-5, Want to go to College?' H1EE2='Scale of 1-5, How Likely is College for you?' H1EE3='Past 4 weeks, Work for Pay?' H1EE4='How many hours of work, non-summer week? (Group Responses 1-140)' H1EE5='Total income, non-summer week? (Group Responses 1-900)' H1EE6='How many hours of work, summer week? (Group Responses 1-99)' H1EE7='Total income, summer week? (Group Responses 1-900)' H1EE12='Live to Age 35?' H1EE13='Married by Age 25?' H1EE14='Killed by Age 21?'; /* My subset of the ADDHEALTH dataset, containing variables of interest. The "if" statement selects only the 3687 participants in the study who worked for pay in the last 4 weeks*/ if H1EE1 >=6 then H1EE1=.; if H1EE2 >=6 then H1EE2=.;
if H1EE3 =1; /* Only those who worked past four weeks */
if H1EE4 >=996 then H1EE4=.; if H1EE5 >=996 then H1EE5=.; if H1EE6 >=996 then H1EE6=.; if H1EE7 >=996 then H1EE7=.; if H1EE12 >=6 then H1EE12=.; if H1EE13 >=6 then H1EE13=.; if H1EE14 >=6 then H1EE14=.; /* If statements above remove "refused", "legitimate skip", "don't know", and "not applicable" values. */
if H1EE4 <= 20 then WORKHOURS=1; /* Part-Time */ else if H1EE4 <= 40 then WORKHOURS=2; /* Full-Time*/ else WORKHOURS =3; /* Overtime */
if H1EE5 <= 20 then INCOMELEVEL=1; /* $20 or Less */ else if H1EE5 <= 50 then INCOMELEVEL=2; /* $21-$50 */ else if H1EE5 <= 100 then INCOMELEVEL=3; /* $51-$100 */ else if H1EE5 <= 150 then INCOMELEVEL=4; /* $101-$150 */ else if H1EE5 <= 200 then INCOMELEVEL=5; /* $151-$200 */ else INCOMELEVEL=6; /* Above $200 */
if H1EE6 <= 20 then WORKHOURS_SUM=1; else if H1EE6 <= 40 then WORKHOURS_SUM=2; else WORKHOURS_SUM =3;
if H1EE7 <= 20 then INCOMELEVEL_SUM=1; else if H1EE7 <= 50 then INCOMELEVEL_SUM=2; else if H1EE7 <= 100 then INCOMELEVEL_SUM=3; else if H1EE7 <= 150 then INCOMELEVEL_SUM=4; else if H1EE7 <= 200 then INCOMELEVEL_SUM=5; else INCOMELEVEL_SUM=6; proc sort; by AID;
proc freq; TABLES H1EE1 H1EE2 H1EE3 H1EE4 WORKHOURS H1EE5 INCOMELEVEL H1EE6 WORKHOURS_SUM H1EE7 INCOMELEVEL_SUM H1EE12 H1EE13 H1EE14;
proc gchart; vbar H1EE2/ type=mean sumvar=H1EE1; Title 'Likelihood of College vs Desire for College'; Title2 'H1EE2 vs H1EE1';
proc gchart; vbar H1EE12/ type=mean sumvar=H1EE1; Title 'Life Expectancy vs Desire for College'; Title2 'H1EE12 vs H1EE1';
proc gchart; vbar H1EE13/ type=mean sumvar=H1EE1; Title 'Outlook on Marriage vs Desire for College'; Title2 'H1EE13 vs H1EE1';
proc gchart; vbar H1EE14/ type=mean sumvar=H1EE1; Title 'Extreme Life Circumstances vs Desire for College'; Title2 'H1EE14 vs H1EE1';
proc univariate; var H1EE4 H1EE5 H1EE6 H1EE7;
proc gchart; vbar WORKHOURS/discrete type=mean sumvar=H1EE1; Title 'Non-Summer Work Hours vs Desire for College'; Title2 'H1EE4 vs H1EE1';
proc gchart; vbar INCOMELEVEL/discrete type=mean sumvar=H1EE1; Title 'Non-Summer Income vs Desire for College'; Title2 'H1EE5 vs H1EE1';
proc gchart; vbar WORKHOURS_SUM/discrete type=mean sumvar=H1EE1; Title 'Summer Work Hours vs Desire for College'; Title2 'H1EE6 vs H1EE1';
proc gchart; vbar INCOMELEVEL_SUM/discrete type=mean sumvar=H1EE1; Title 'Summer Income vs Desire for COllege'; Title2 'H1EE7 vs H1EE1';
Results
All Frequency Tables, Graphs and relevant Summary Statistics are included in this section. Please follow the link below...
https://docs.google.com/document/d/1u7NJBatZ9cL3B8Z318Yc84dyt4LOTNHcQHfgqIPcnIE/edit?usp=sharing
Summary
Before making judgements it is important to understand the nature of the data we are working with.
Here are some key characteristics of the employed teens in this sample show the following:
1) Employed Teens have a relatively strong desire to go to college (70.32%)
2) Employed Teens have a high expectancy of going to college (56.36%)
3) Outside of the summertime, most employed teens only work part-time (79.98%)
4) Outside of the summertime, most employed teens only earn at most $100 a week (81.80%)
5) In the summertime, most employed teens work either part-time (51.69%) or full-time (39.30%)
6) In the summertime, most employed teens only earn at most $100 a week (58.18%), though a considerable amount earn between$101 and $200 a week (28.80%)
7) Most employed teens expect to live to age 35 (56.36%)
8) Most employed teens either believe there is a 50-50 chance of getting married by age 25 (35.27%), or it is pretty likely (30.98%)
9) Most employed teens either believe it is unlikely that they will be killed by age 21 (50.49%), or that it is probable but pretty unlikely (33.82%)
Our interest is in examining potential underlying factors that influence employed teens to want to go to college. For this reason, all variables, whether quantitative or qualitative, were compared to the variable H1EE1 (Desire for College).
The first comparison is the “Likelihood of College vs. Desire for College” of employed teens. We know already that in this sample, 56.36% of employed teens have a strong desire to go to college (see bullet 2 above). Our data seems to suggest that though the vast majority want to go to college, it is not a given that these teens will be able to attend a university. The corresponding graph for this relationship shows a positive association between likelihood of attendance and the desire to go to college. Meaning, those with high likelihood responses often want to go to college. However, the opposite is also true... Those with low likelihood responses often don’t want to go. Why is that?
Perhaps, those teens with low likelihood responses feel that there is no hope for them to attend. Maybe their parents cannot afford it, or they are limited in their dreams of higher learning because of the extreme circumstances of the realities of where they live. There can be a multitude of reasons. Since these are employed teens under consideration who don’t have large salaries, it is unlikely that this group of individuals is prone to delinquency. They probably just feel hopeless. It should be a normative goal of the government and society as a whole to provide a hopeful outlook to these working children. An investment in scholarships and grants towards these young adults may improve their desire to obtain a degree, which nowadays is a standard for achievement in this generation.
The second comparison is “Life Expectancy vs Desire for College” of employed teens. We know that 56.36% of employed teens expect to live beyond the age of 35 (see bullet 7 above). It turns out that there was no obvious relationship between life expectancy responses and whether or not the youth wanted to go to college. Life expectancy may not be an important factor in the decision to go to college for employed teens.
The third comparison is “Outlook on Marriage vs Desire for College” of employed teens. Most employed teens have a 50-50 outlook on being married by age 25. This variable was initially of interest because marriage requires planning and money for both parties. I figured that those with a plan to get married at an earlier age might also be more mature individuals, with their minds set on higher learning and achievement in order to obtain financial stability. This data suggest that the out look on marriage by age 25 is not a determining factor for desire to go to college.
The fourth comparison is “Extreme Life Circumstances vs Desire for College” of employed teens. The teens in this sample generally believe it is unlikely that they will be killed by age 21 (50.49%, see bullet 9). The data suggests that whether they believe that they will likely die by age 21 or not is of no consequence in determining their desire to go to college.
The fifth comparison is “Non-Summer Work Hours vs Desire for College” of employed teens. The Non-summer Work Hours variable was quantitative and thus broken up into three categories of Part-Time=1, Full-Time=2, and Overtime=3. We know that outside of the summertime, most employed teens work only Part-Time (79.98%, see bullet 3). The data suggests that the more hours teens work outside of the summer time, their desire for college slightly decreases. There may be a slightly negative relationship between these two variables.
The sixth comparison is “Non-Summer Income vs Desire for College” of employed teens. We know that outside of the summertime, most employed teens only earn at most $100 a week (81.80%, see bullet 4). The data suggests generally that the more money employed teens make per week, the lower their desire for college becomes. There may be a slightly negative relationship between these two variables.
The seventh comparison is “Summer Work Hours vs Desire for College” of employed teens. We know that In the summertime, most employed teens work either part-time (51.69%) or full-time (39.30%, see bullet 5). The data suggests that the more hours teens work during the summertime, their desire for college slightly decreases. There may be a slightly negative relationship between these two variables.
The eight and final comparison is “Summer Income vs Desire for College” of employed teens. We know that In the summertime, most employed teens only earn at most $100 a week (58.18%), though a considerable amount earn between$101 and $200 a week (28.80%, see bullet 6). The data suggest that there is relatively no association between adolescents summer income and their desire to go to college.
0 notes
Text
Managing Data
The overall ADDHEALTH dataset has 6504 observations. Since I am interested in the effect of adolescent employment on pursuit of higher education, I focused only on the adolescents who claim to have worked in the past 4 weeks. This cut my dataset down to 3687 observations representing only the teens that have worked. The variables of interest to me were: H1EE1, H1EE2, H1EE3, H1EE4, H1EE5, H1EE6, H1EE7, H1EE12, H1EE13, H1EE14. The Frequency Table results have these variables conveniently labeled.
My previous post, “First Program in SAS”, had output that did not account for missing data values or group responses into reasonable categories. This is a revised code for the project.
SAS CODE
LIBNAME mydata "/courses/d1406ae5ba27fe300" access=readonly; data new; set mydata.addhealth_pds; LABEL H1EE1='Scale of 1-5, Want to go to College?' H1EE2='Scale of 1-5, How Likely is College for you?' H1EE3='Past 4 weeks, Work for Pay?' H1EE4='How many hours of work, non-summer week? (Group Responses 1-140)' H1EE5='Total income, non-summer week? (Group Responses 1-900)' H1EE6='How many hours of work, summer week? (Group Responses 1-99)' H1EE7='Total income, summer week? (Group Responses 1-900)' H1EE12='Live to Age 35?' H1EE13='Married by Age 25?' H1EE14='Killed by Age 21?'; /* My subset of the ADDHEALTH dataset, containing variables of interest. The "if" statement selects only the 3687 participants in the study who worked for pay in the last 4 weeks*/ if H1EE1 >=6 then H1EE1=.; if H1EE2 >=6 then H1EE2=.;
if H1EE3 =1; /* Only those who worked past four weeks */
if H1EE4 >=996 then H1EE4=.; if H1EE5 >=996 then H1EE5=.; if H1EE6 >=996 then H1EE6=.; if H1EE7 >=996 then H1EE7=.; if H1EE12 >=6 then H1EE12=.; if H1EE13 >=6 then H1EE13=.; if H1EE14 >=6 then H1EE14=.; /* If statements above remove "refused", "legitimate skip", "don't know", and "not applicable" values. */
if H1EE4 <= 20 then WORKHOURS=1; /* Part-Time */ else if H1EE4 <= 40 then WORKHOURS=2; /* Full-Time*/ else WORKHOURS =3; /* Overtime */
if H1EE5 <= 20 then INCOMELEVEL=1; /* $20 or Less */ else if H1EE5 <= 50 then INCOMELEVEL=2; /* $21-$50 */ else if H1EE5 <= 100 then INCOMELEVEL=3; /* $51-$100 */ else if H1EE5 <= 150 then INCOMELEVEL=4; /* $101-$150 */ else if H1EE5 <= 200 then INCOMELEVEL=5; /* $151-$200 */ else INCOMELEVEL=6; /* Above $200 */
if H1EE6 <= 20 then WORKHOURS_SUM=1; else if H1EE6 <= 40 then WORKHOURS_SUM=2; else WORKHOURS_SUM =3;
if H1EE7 <= 20 then INCOMELEVEL_SUM=1; else if H1EE7 <= 50 then INCOMELEVEL_SUM=2; else if H1EE7 <= 100 then INCOMELEVEL_SUM=3; else if H1EE7 <= 150 then INCOMELEVEL_SUM=4; else if H1EE7 <= 200 then INCOMELEVEL_SUM=5; else INCOMELEVEL_SUM=6; proc sort; by AID; proc freq; TABLES H1EE1 H1EE2 H1EE3 H1EE4 WORKHOURS H1EE5 INCOMELEVEL H1EE6 WORKHOURS_SUM H1EE7 INCOMELEVEL_SUM H1EE12 H1EE13 H1EE14;
Frequency Tables
For the resulting output, follow the link below...
file:///Users/macintosh/Downloads/Results_%20Coursera%20Addhealth%20Program%201.sas.html
Results Interpretation
According to the frequency tables, here are some interesting findings about employed teens within this sample…
When asked to rate on a scale of 1 to 5 how much they wanted to go to college, 70.32% of the employed teens responded with a strong desire to go to college.
When asked how likely college is for them, 56.36% of employed teens responded with a high likelihood to go to college.
When asked on a scale of 0-140 how many hours they worked in a non-summer work week, 79.98% worked part-time (1-20 hours), 18.58% worked full-time (21-40 hours), and 1.44% work overtime (more than 40 hours). This is reasonable considering that non-summer weeks for adolescents are typically preoccupied with school.
When asked on a scale of $0-$900 how much was the total income in a non-summer week, 41.38% claimed to earn $20 or less, 18.69% earned between $21 and $50, 21.75% earned between $51 and $100, 9.60% earned between $101 and $150, 4.67% earned between $151 and $200, and 3.93% earned more than $200. From these results we can see that the majority (81.8%) of working adolescents from this sample earned $100 or less per week, 14.27% earned between $101-$200 per week, and about 4% earned more than $200.
When asked on a scale of 0-99 how many hours they worked in a summer week, 51.69% worked part-time (1-20 hours), 39.30% worked full-time (21-40 hours), and 8.81% worked overtime (more than 40 hours). From this data we observe that in the summertime a considerable amount of adolescents work both part-time and full-time jobs.
When asked on a scale of $0-$900 how much was the total income in a summer week, 24.90% of working adolescents earned $20 or less, 15.16% earned between $21 and $50, 18.12% earned between $51 and $100, 15.81% earned between $101 and $150, 12.99% earned between $151 and $200, and 13.02% earned more than $200. From these results we see that the majority (58.18%) of working adolescents earned $100 or less, 28.8% earned between $101 and $200, and 13.02% earned more than $200 per week.
When asked to rate on a scale of 1 to 5 their belief that they will live to the age of 35, 56.36% of the employed teens responded with 5, suggesting that it is very likely and 30.72% responded with a 4. Overall, this appears to be an optimistic group of employed teens in terms of life expectancy.
When asked on a scale of 1 to 5 what are the chances of being married by age 25, 35.27% claimed there is a 50-50 chance, while 30.98% claimed its pretty likely.
When asked on a scale of 1 to 5 how likely it is that they will be killed by age 21, 50.49% claimed there is relatively no chance of that happening and 33.82% claimed most likely not.
Summary
The characteristics of the employed teens in this sample show the following:
1) Employed Teens have a relatively strong desire to go to college (70.32%)
2) Employed Teens have a high expectancy of going to college (56.36%)
3) Outside of the summertime, most employed teens only work part-time (79.98%)
4) Outside of the summertime, most employed teens only earn at most $100 a week (81.80%)
5) In the summertime, most employed teens work either part-time (51.69%) or full-time (39.30%)
6) In the summertime, most employed teens only earn at most $100 a week (58.18%), though a considerable amount earn between$101 and $200 a week (28.80%)
7) Most employed teens expect to live to age 35 (56.36%)
8) Most employed teens either believe there is a 50-50 chance of getting married by age 25 (35.27%), or it is pretty likely (30.98%)
9) Most employed teens either believe it is unlikely that they will be killed by age 21 (50.49%), or that it is probable but pretty unlikely (33.82%)
0 notes
Link
Another interesting read on the subject of Employment and Higher Education, among other factors...
0 notes
Text
First Program in SAS
The overall ADDHEALTH dataset has 6504 observations. Since I am interested in the effect of adolescent employment on pursuit of higher education, I focused only on the adolescents who claim to have worked in the past 4 weeks. This cut my dataset down to 3687 observations representing only the teens that have worked. The variables of interest to me were: H1EE1, H1EE2, H1EE3, H1EE4, H1EE5, H1EE6, H1EE7, H1EE12, H1EE13, H1EE14. The Frequency Table results have these variables conveniently labeled.
SAS CODE
Here is the code for my dataset taken from ADDHEALTH...
LIBNAME mydata "/courses/d1406ae5ba27fe300" access=readonly; data new; set mydata.addhealth_pds; LABEL H1EE1='Scale of 1-5, Want to go to College?' H1EE2='Scale of 1-5, How Likely is College for you?' H1EE3='Past 4 weeks, Work for Pay?' H1EE4='How many hours of work, non-summer week? (Group Responses 1-140)' H1EE5='Total income, non-summer week? (Group Responses 1-900)' H1EE6='How many hours of work, summer week? (Group Responses 1-99)' H1EE7='Total income, summer week? (Group Responses 1-900)' H1EE12='Live to Age 35?' H1EE13='Married by Age 25?' H1EE14='Killed by Age 21?'; /* My subset of the ADDHEALTH dataset, containing variables of interest. The "if" statement selects only the 3687 participants in the study who worked for pay in the last 4 weeks*/ if H1EE3=1; proc sort; by AID; proc freq; TABLES H1EE1 H1EE2 H1EE3 H1EE4 H1EE5 H1EE6 H1EE7 H1EE12 H1EE13 H1EE14;
Frequency Tables
Follow the link below (copy and paste onto a separate webpage) to see the Frequency Table Results for the data...
file:///Users/macintosh/Downloads/Coursera%20Project%20Frequency%20Tables.htm
Results Interpretation
According to the frequency tables, here are some interesting findings about employed teens within this sample...
When asked to rate on a scale of 1 to 5 how much they wanted to go to college, 70.25% of the employed teens responded with a strong desire to go to college.
When asked how likely college is for them, 56.25% of employed teens responded with a high likelihood to go to college.
When asked on a scale of 0-140 how many hours they worked in a non-summer work week, 79.01% worked between 1-140 hours. Further, 58.99% of those who worked were only working part-time (1-20 hours) with a considerable amount (18.58%) working full-time. This is reasonable considering that non-summer weeks for adolescents are typically preoccupied with school.
When asked on a scale of $0-$900 how much was the total income in a non-summer week, 77.21% claimed to have earned between $1-$900 per week. A closer look at those who acquired some income shows that 59.01% earned between $1-$100 per week, while 14.27% earned between $101-$200 per week.
When asked on a scale of 0-99 how many hours they worked in a summer week, 88.06% worked between 1-99 hours. Of those who worked, about 39.93% worked part-time (1-20 hours) and about 39.31% worked full-time (21-40) hours. It is noted that there was one missing value.
When asked on a scale of $0-$900 how much was the total income in a summer week, 86.38% earned between $1-$900 per week. Of those who acquired some money, about 44.54% earned between $1-$100 per week and about 28.82% earned between $101-$200 per week. It is noted that there was one missing value.
When asked to rate on a scale of 1 to 5 their belief that they will live to the age of 35, 56.33% of the employed teens responded with 5, suggesting that it is very likely and 30.70% responded with a 4. Overall, this appears to be an optimistic group of employed teens in terms of life expectancy.
When asked on a scale of 1 to 5 what are the chances of being married by age 25, 35.20% claimed there is a 50-50 chance, while 30.92% claimed its pretty likely.
When asked on a scale of 1 to 5 how likely it is that they will be killed by age 21, 50.37% claimed there is relatively no chance of that happening and 33.74% claimed most likely not.
Summary
The characteristics of the employed teens in this sample show the following:
1) Employed Teens have a relatively strong desire to go to college (70.25%)
2) Employed Teens have a high expectancy of going to college (56.25%)
3) Outside of the summertime, most employed teens only work part-time (58.99%)
4) Outside of the summertime, most employed teens only earn at most $100 a week (59.01%)
5) In the summertime, most employed teens work either part-time (39.93%) or full-time (39.31%)
6) In the summertime, most employed teens only earn at most $100 a week (44.54%), though a considerable amount earn at most $200 a week (28.82%)
7) Most employed teens expect to live to age 35 (56.33%)
8) Most employed teens either believe there is a 50-50 chance of getting married by age 25 (35.20%), or it is pretty likely (30.92%)
9) Most employed teens either believe it is unlikely that they will be killed by age 21 (50.37%), or that it is probable but pretty unlikely (33.74%)
0 notes
Link
A great article addressing the relationship of adolescent employment and higher learning, among other factors.
0 notes