Problem Set 1
1.Text Problems: 3.1, 3.6, 3.13, 3.14, 4.1, 4.2, 4.5
2 . The Demographics Journal published the 97-98 percentage
population changes
in states left (http://www.amcity.com/journals/demographics/report78/78-4.html)
Rank
1. Nevada +4.06%
2. Arizona +2.53%
3. Georgia +2.03%
3. Colorado +2.03%
5. Texas +1.93%
6. Utah +1.68%
7. Idaho +1.64%
8. Florida +1.63%
9. North Carolina +1.56%
10. California +1.51%
11. Washington +1.31%
12. South Carolina +1.26%
13. Oregon +1.19%
14. Delaware +1.15%
15. New Hampshire +1.10%
15. Tennessee +1.10%
17. Kansas +1.06%
18. Minnesota +0.81%
19. Virginia +0.80%
20. Maryland +0.78%
21. Oklahoma +0.76%
22. New Mexico +0.75%
22. Mississippi +0.75%
24. Alaska +0.71%
25. New Jersey +0.70%
26. Alabama +0.69%
27. Kentucky +0.67%
28. Arkansas +0.60%
29. Indiana +0.59%
30. Missouri +0.56%
31. Massachusetts +0.53%
32. Illinois +0.47%
33. Wisconsin +0.43%
34. Vermont +0.38%
34. Michigan +0.38%
36. Louisiana +0.35%
37. Nebraska +0.34%
38. Iowa +0.28%
39. Connecticut +0.21%
40. Montana +0.20%
41. Maine +0.19%
42. Wyoming +0.18%
43. New York +0.16%
44. Ohio +0.15%
45.Rhode Island+0.12%
46. Hawaii +0.08%
47. South Dakota +0.06%
48. Pennsylvania -0.08%
49. West Virginia -0.22%
50. North Dakota -0.42%
51. District of Columbia -1.28%
i. Construct a frequency distribution of the percentage population changes.
ii. What is the mean and the standard deviation of the percentages in the sample?
iii. Is the frequency distribution unimodal or multimodal?
iv. What possible explanation can you provide for the regional trends exhibited here?
Problem Set 2
In table 3, (see Problem set 2 for table) the data from table 1
have been converted into real terms by
dividing consumption and GDP by the price deflator and calculating a
real
interest rate by subtracting the rate of increase of the deflator
from the
interest rate. Table 4, which again you will use for estimating the
least
squares regressions, shows the sums of squares and cross -products
for the
real variables as well as their sums.
1. Calculate the simple regressions relating real consumption
first to the
real GDP and second to the real interest rate. Give your numerical
results
for these coefficients. Comment on any difference from the results
using
the nominal values from Tables 1 and 2 and using the real values of
Tables
3 and 4.
2. What dummy variable, if any, would be a logical addition for
the
equation in problem 3 relating real consumption to real GDP?
Supposing
there were an applicable dummy variable, how would you calculate
its
coefficient?
3. Derive the expression for the simple regression
coefficient, beginning with the normal equations. Explain each
mathematical
step in your own words.
Problem Set 3
1. Mirer 8.7, 13.13, Beals, 2.11, 3.1, 4.1, 4.5, 4.11
Problem Set 4
1. Problems in Text. Mirer: 9.9, 9.10,9.15, 9.20,9.23
2. The Beloit Corp., based in Beloit, Wisconsin, sells paper
manufacturing
equipment. Based on past experience, we know that in the summer
months it
is equally likely that it will sell 0,1,2,3, or 4 paper
manufacturing
machines in a day. (The firm has never sold more than 4 machines per
day.)
(a) Is the number of machines sold in a day a random variable?
(b) If so, what values can this random variable assume?
(c) If the number of machines sold in a day is a random variable,
construct
a table showing its probability distribution.
(d) Plot the probability distribution of the number of machines
sold in a
day in a line chart. What mathematical function can represent this
probability distribution?
(e) What is the probability that the number of machines sold on a
given day
will be less than 1? Less than 3? Less than 4? Less than 6?
3. An insurance company offers a 50-year-old woman a \$1,000
one-year term
insurance policy for an annual premium of \$18. If the annual number
of
deaths per 1,000 is six for women in this age group, what is the
expected
gain for the insurance company from a policy of this kind? What is
the
standard deviation of the gain for the insurance company from a
policy of
this kind?
Problem Set 5
1. Problems in Mirer: 10.1, 10.2, 10.4, 10.8, 10.9 Beals: 6.2, 6.4
2. Why do we use the standard normal distribution? How do we
calculate
parameters differently on samples and populations? Explain the
difference
between z-tests and t-tests.Explain the central limit theorem in your
own words.
3. LSAT scores
A. Among the applicants to one law school in 1976, the average LSAT
score
was around 600 with a standard deviation of 100. The LSAT scores
followed
the normal curve.
i. What percentage of the applicants scored above 700?
ii. Estimate the 90th percentile for the scores.
B. Incoming students at a certain law school have an average LSAT
score of
680 with a standard deviation of 25. What is your best guess for the
score
of a randomly chosen student: 680, 705, or 730? Why?
4. Write your own problem which relates to z-tests, normal curves,
or
sample distributions. Include a detailed solution.
Problem Set 6
1. Text problems: Mirer, 18.6, 18.7, 18.8, 18.9 Beals: 8.1a, 8.4,8.6,8.13
2. Bill has a hunch, which he decides to evaluate quantitatively.
Bill
believes that the numbers of people in Springfield (total population
1996
5000) relying on state supplimentary income in the form of welfare
are
''lower than they used to be.'' In order to evaluate this, he
randomly
samples 1000 people in the population. He calculates a sample mean of
3
with a sample standard deviation of 1.71. A worker plotted the data
and the
distribution appears to be normal. Construct a confidence interval at
the
95% level of confidence for the population mean.
3. In a famous medical study, not long after the aforementioned
tobacco
company investigated changing their brand variety, a researcher at
Sloan
Kettering Cancer institute began studying the effects of tobacco
smoke. Dr.
Wynder painted mice with a distillate of cigarette smoke, equivalent
to one
pack of cigarettes three times a week. Of a randomly selected sample
of 62
mice from Dr.Wynder's population, 27 mice developed cancerous tumors
after
one year (Kluger 161). Construct a confidence interval for the
population
proportion at the 95% level of confidence.
4. You are the fish buyer for a company that sells frozen seafood
to
supermarket chains. Part of your job is to bid on boatloads of shrimp
when
they arrive at the dock. The selling price of shrimp per pound
depends on
the number of shrimp in a pound, large shrimp being more valuable
than
shrimpy shrimp. You have to estimate the mean number of shrimp per
pound
for a boatload before you know how much to bid. You randomly select
50
pounds and count the number in each pound. The sample mean is 25 with
a
sample standard deviation of 6. Construct a confidence interval at
the 95%
level of confidence for the mean number of shrimp per pound in the
boatload.
5. According to a panel appointed by the Environmental Protection
Agency,
the quantity of a much-debated ingredient in gasoline, M.T.B.E., or
methyl
tertiary butyl ether, a possible carcinogen, should be ''reduced
substantially'' because it dissolves easily in water and turns up in
tap
water when gasoline has leaked or spilled. MTBE was introduced
into
gasoline, in order to meet 1990 federal clean air regulations;
experts
differ on how much the ingredient has done to control carbon monoxide
and
smog, but they agree that it has cut toxic components from gasoline
in the
air by about 30 percent (7/27/99 NYT Wald). A major gasoline
manufacturer
would like to demonstrate that M.T.B.E has significantly reduced the
parts
per million of carbon monoxide in the air. Average ppm of carbon
monoxide
in the air in Milwaukee, Wisconsin before MTBE was 346. Consumer
advocates
observe that on 16 days when air quality was measured during the year
after
the MTBE was introduced there was an average reading of 341 ppm with
a
sample standard deviation of 36. Test at the 5% significance level
the
hypothesis that the mean reading is equal to 346, assuming a
normal
distribution.
Problem Set 7
1. Text Problems: Mirer 11.10, 11.11, 11.16,11.18, 12.3, 12.5, Beals 10.3
Problem Set 8
1. Prove [ (R^2)/[k-1]/{(1-R^2)/(N-k)}=[{(ESS)/[k-1]}/{(RSS)/(N-k)}
2. Demonstrate: for the addition of one variable, the F
statistic=the
square of the t statistic.
3. Suppose that you wanted to study the behavior of cigarette
smokers over
a number of years, and the statisticians at the Department of Health
and
Human Services suggest two models for cigarette consumption:
Yt=B0+B1t and
Yt=a0+a1t+a2t2
where
Yt =consumption at time t, t=time measured in years.
a) What pattern of cigarette consumption does the first model
suggest? The
second?
b) How would you choose between the two models?
c) For what other products might the quadratic be applicable?
4. Suggest another macro model, and write out your null and
alternative
hypotheses for two different f-tests.
Problem Set 9
1. Text problems: Mirer 6.20, 6.22, 6.24, 6.26, Beals 12.1, 12.3,
b) Let Y be the price of a house and X be the living area. We can
expect
that the additional price a consumer will be willing to pay for
100
additional square feet of living area will be higher when X=1300 than
when
X=3200. This means that the marginal effect of SQFT on Price can
be
expected to decrease as SQFT increases.
i)What form would enable us to capture this relationship?
ii) Suggest what equation you would use to find the elasticity at
the mean
price of house.
2. Demand functions are frequently written as reciprocal transformations
Y= B0+B1(1/X).
a)What sign would we expect B1 to have?
b)What value does Y approach as X goes to infinite?
c)Graph this line for B1 <0 and B1
>0.
d)Suggest how you might calculate the elasticity of this demand
function.
3. Quadratic functions are frequently used to fit u-shaped cost
functions.
For the given quadratic function, of the form
Y=B0+B1X+B2X2+et
we have data on the per-unit cost of a manufacturing firm over a
20-year
period, an index of its output, and an index of its input costs. We
have
estimated a regression equation (with t-values in parentheses).
UnitCost= |
10.522 |
-.175 Output |
+.00895 Output2 |
+.0202 Impcost |
(14.3) |
(-9.7) |
(7.8) |
(14.454) |
|
R2=.978 |
d.f.=16 |
a) Test the significance of each of the variables.
b) Sketch what you think this estimated average cost function would look like.
c) For an input cost of 80, plot a few sample points using output
values of
60, 80, 100, and 120 (Ramanathan).
d) Find another function, in your Micro book which follows this
form, and
suggest likely variables.
4. Suppose that, instead of the linear demand for money function
in
Exercise 1, you hypothesize the log-log demand function
ln Mt=a0+a1lnYt+a2lnIt+et
a) Find the least squares estimates of
a0,a1, and a2.
b) Interpret the estimated coefficients.
c) Using the results you obtained in Exercise 1 and 2a, find 95%
interval
estimates for B1, B2,a1, and
a2.
d) Test each of the null hypotheses H0:
B1=B2 =0 and
H0:a1= a2=0.
e) Is there any evidence in the data to suggest that the linear or
the
log-linear model is more appropriate?
Problem Set 10
1. You are considering betting $100 on a horse named Bayes. If the
horse
wins, you will get your $100 back and an additional $900. If it
loses, you
will be out the $100. Your subjective probability of the horse's
winning is
0.2. You could buy a tip from Mr. Oracle before making the bet. Given
that
a horse will lose, the probability that Mr. Oracle will predict that
it
will win is 0.05. Given that a horse will win, the probability that
Mr.
Oracle will predict a win is 0.5. The probabilities of Mr.
Oracle's
prediction that Bayes will win or lose are 0.1 and 0.9 respectively.
Mr.
Oracle's prediction cost $100. Construct a tree diagram for this
problem
and decide on the best course of action. What is the expected payoff
of
this best choice?
2.An appliance firm must decide whether or not to offer Mr.Jones a
job. If
Mr.Jones is a success, the firm will increase its profits by 100,000;
if he
turns out not to be a success, the firm's profits will decrease by
80,000.
The firm feels the chances are 50-50 that he will be a success
(courtesy of
Mansfield 17.7).
A) How can the firm obtain information concerning whether
or not Whelan will be a success?
B) What is the expected value of Mr.Jones
to the company? Should they hire him?
3. In the game Rock, Scissors, paper, two players simultaneously
show hand
signals representing a rock, scissors, or paper. Rock beats
scissors,
scissors beats paper, and paper beats rock. Show the payoff matrix
using 1
to represent a win, -1 to represent a loss, and 0 to represent a tie.
Is
there a noncooperative equilibrium? What mixed strategy would you
recommend? (Smith 19.32,33)
4. One player plays rock 26 times, scissors 9 times, and paper 15
times.
This is your opponent; these empirical frequencies are the
probabilities
that this student's first move will be rock, scissors, or paper
(Smith
19.58). What strategy mazimizes the expected value of your
playoff?
5. Bobby M. drives to Wharton every day. He can either turn right,
from
the highway, into parking lot P where there is always plenty of space
or he
can take his chances, driving ten minutes farther, into the small
parking
lot behind the Business school (30 percent chance of finding a spot).
If he
does not find a parking space in the small parking lot, he can always
park
at the meter in front of the B School. If he's feeling punchy, he can
forgo
the daily four dollar meter fee, and instead chance a 15 dollar
ticket
(occurs one tenth of the time). It takes 10 minutes to walk to the B
school
from parking lot P, two minutes from the small parking lot, and 4
minutes
from the meter at front (Wonnacott and Wonnacott, 21.11). If his time
is
worth 12 per hour, what is his best strategy? How much does it
cost?
Suppose he faces an additional penalty for not paying the meter: Half
the
cars that get a ticket also get towed away at a cost of 40, on top of
the
15 ticket, plus an hour of time. Now, what should he do? At what
cost?
Problem Set 11
1. Suppose that a researcher is trying to estimate the
following
consumption function, where consumption is hypothesized to depend
on
current income (Yt) habit or past income
(Yt-1), and
expectations or change in income
deltaY=(Yt-Yt-1)(JJB):
Et=b0+ b1 Yt
+b2Yt-1+b3(deltaY)+et.
Show that without further information it is impossible to
estimate
b1,b2, and b3.
2. Assume that the following production function is to be
estimated from
annual data on a firm (JJB):
Q=B0 +
B1L1+B2L2+B3K
L1= acres of land
L2= dollars of labor
K= dollars of capital equipment
Suppose that the firm always budgets $30,000 a year for labor and capital
(i.e. L2+K=30,000)
a) Is there a multicolinearity problem?
b) Can the coefficients be estimated?
3. (Studenmund). Which of the following pairs of variables is
likely to
include a ''redundant'' variable?
A) the price of refridgerators and the price of washing machines
in a
durable-goods demand function
B) The number of acres harvested and the amount of seed used in
an
agricultural supply function
C) long-term interest rates and the money supply in an investment function
4. The following regression model is heteroscetastic: (1) Y=B0+B1X1+B2X2+U where Y=dollars spent by consumers
on food in a city in a week, X1= dollars of income
earned by consumers
in a city during a week, and X2=population of a city
(JJB). A
researcher uses OLS to estimate (1) by the following equation,
assuming
that E(U2)=sigma2 .
Y=b0+b1X1+b2X2+e
a) Is E(b0)=B0?Is
E(b1)=B1?Is E(b2)=B2?
E(S2)=sigma2?
b) Assuming that b0 ,b1 and b2 are
unbiased estimators of the
parameters in 1, can Sbi be used to test hypothesis?
5. Autocorr/heterosketasticity (JJB).
A) Which type of mispecified
disturbance term is more likely to occur in time-series data? In
cross
sectional data?
B)Is the variance of the disturbance term more likely to be
positively or
negatively correlated with the independent variables?
C)Is autocorrelation more likely to be positive or negative?