Statistical differences between models

For technical questions regarding estimation of single equations, systems, VARs, Factor analysis and State Space Models in EViews. General econometric questions and advice should go in the Econometric Discussions forum.

Moderators: EViews Gareth, EViews Moderator

startz
Non-normality and collinearity are NOT problems!
Posts: 3797
Joined: Wed Sep 17, 2008 2:25 pm

Re: Statistical differences between models

Postby startz » Thu Mar 26, 2009 3:47 pm

Code: Select all

ls y DUK DUS DEU DUK*X DUS*X DEU*X
Thanks for your suggestion. When I enter equation: y c DUK DUS DEU DUK*X1 DUS*X1 DEU*X1 and so on for al five variables I get the error near singular matrix, also without entering the constant. This is probably because all dummies are mutually exclusive.
Starting to get a bit desperate here as I do not seem to be able to solve this problem that shouldn't be so difficult.
When you include the constant, you should expect "near singular matrix." This is the dummy variable trap. I don't see why you get the error when the constant is omitted though.

Movid
Posts: 14
Joined: Fri Mar 13, 2009 7:44 am

Re: Statistical differences between models

Postby Movid » Fri Mar 27, 2009 6:43 am

That's more than what I could have asked for.
Basically what I've done is running 4 separate regressions for 4 datasets (entire dataset + its three subsets based on region), all consisting of the same variables. Equation: y c interest quantum low high under over

I'm trying to figure out whether there are significant differences between the coefficients of the variables when comparing each subset (America, UK, Cont. Europe) to one another. As Startz suggested I created dummy variables for each variable to indicate its region. He suggested equation
y (c) DUK DUSA DEUR DUK*X DUSA*X DEUR*X which in my case would lead to the following:

y ie ia iu ic interest*ie interest*ia interest*iu interest*ic qe qa qu qc quantum*qe quantum*qa quantum*qu quantum*qc le la lu lc low*le low*la low*lu low*lc he ha hu hc high*he high*ha high*hu high*hc ue ua uu uc under*ue under*ua under*uu under*uc oe oa ou oc over*oe over*oa over*ou over*oc Where -e is the Entire dataset, -a is America, -u is UK, and -c is Continental Europe.

This, however, leads to near singular matrix error, regardless of whether the constant is included. Any help on solving this issue or different approaches on my statistical differences problem is greatly appreciated! Dataset is attached.
Attachments
All2.0.xls
(171 KiB) Downloaded 586 times

startz
Non-normality and collinearity are NOT problems!
Posts: 3797
Joined: Wed Sep 17, 2008 2:25 pm

Re: Statistical differences between models

Postby startz » Fri Mar 27, 2009 7:04 am

That's more than what I could have asked for.
Basically what I've done is running 4 separate regressions for 4 datasets (entire dataset + its three subsets based on region), all consisting of the same variables. Equation: y c interest quantum low high under over

I'm trying to figure out whether there are significant differences between the coefficients of the variables when comparing each subset (America, UK, Cont. Europe) to one another. As Startz suggested I created dummy variables for each variable to indicate its region. He suggested equation
y (c) DUK DUSA DEUR DUK*X DUSA*X DEUR*X which in my case would lead to the following:

y ie ia iu ic interest*ie interest*ia interest*iu interest*ic qe qa qu qc quantum*qe quantum*qa quantum*qu quantum*qc le la lu lc low*le low*la low*lu low*lc he ha hu hc high*he high*ha high*hu high*hc ue ua uu uc under*ue under*ua under*uu under*uc oe oa ou oc over*oe over*oa over*ou over*oc Where -e is the Entire dataset, -a is America, -u is UK, and -c is Continental Europe.

This, however, leads to near singular matrix error, regardless of whether the constant is included. Any help on solving this issue or different approaches on my statistical differences problem is greatly appreciated! Dataset is attached.
You have a better shot of getting someone to look at this if you attach your EViews dataset.

Movid
Posts: 14
Joined: Fri Mar 13, 2009 7:44 am

Re: Statistical differences between models

Postby Movid » Fri Mar 27, 2009 7:46 am

You have a better shot of getting someone to look at this if you attach your EViews dataset.
That makes sense. I ran the regressions for the subsets on separate workfiles. The file attached combines all data.
Attachments
all2.0.wf1
(151.33 KiB) Downloaded 619 times

startz
Non-normality and collinearity are NOT problems!
Posts: 3797
Joined: Wed Sep 17, 2008 2:25 pm

Re: Statistical differences between models

Postby startz » Fri Mar 27, 2009 8:05 am

You have a better shot of getting someone to look at this if you attach your EViews dataset.
That makes sense. I ran the regressions for the subsets on separate workfiles. The file attached combines all data.
That clears it up very quickly. You have included two complete sets of dummies. That's another form of the dummy variable trap.

Movid
Posts: 14
Joined: Fri Mar 13, 2009 7:44 am

Re: Statistical differences between models

Postby Movid » Fri Mar 27, 2009 8:21 am

That clears it up very quickly. You have included two complete sets of dummies. That's another form of the dummy variable trap.
Two? I have four dummy variables (under/over/low/high) but they do not always have opposing values (e.g. under/over, low/high can both be 0).
What would you suggest me to do?

startz
Non-normality and collinearity are NOT problems!
Posts: 3797
Joined: Wed Sep 17, 2008 2:25 pm

Re: Statistical differences between models

Postby startz » Fri Mar 27, 2009 8:44 am

That clears it up very quickly. You have included two complete sets of dummies. That's another form of the dummy variable trap.
Two? I have four dummy variables (under/over/low/high) but they do not always have opposing values (e.g. under/over, low/high can both be 0).
What would you suggest me to do?
I may have gotten it wrong, but it appears that "ie ia iu ic" is one complete set of dummues and that "le la lu lc" is another.

Movid
Posts: 14
Joined: Fri Mar 13, 2009 7:44 am

Re: Statistical differences between models

Postby Movid » Fri Mar 27, 2009 10:20 am

I may have gotten it wrong, but it appears that "ie ia iu ic" is one complete set of dummues and that "le la lu lc" is another.
True, but they're not the only ones. I created those dummies to match the values of variable with a region as was suggested earlier. Originally I did not have those dummies, but ran equation y c interest quantum low high under over on 4 separate datasets. Now I merged all data into one file. To my knowledge creating just one set of dummies would only indicate differences in influence on the dependant variable, whereas I want to compare the differences of the coefficients each variable for each region. Hence, I created a set of region dummies for each variable, which leads to the following dataset:
Variable (Dummies)
Interest (ie, ia, iu, ic ) (e=Entire dataset, a=America, u=UK, c=Continental Europe)
Quantum (qe, qa, qu, qc)
Low (le, la, lu, lc)
High (he, ha, hu, hc)
Under (ue, ua, uu, uc)
Over (oe, oa, ou, oc)
Where Low, High, Under, and Over are dummies.

Running a regression on all these variables/dummies, with or without constant, leads to a near singular matrix error. Also when I multiply the variables with the dummies, as was proposed earlier.

startz
Non-normality and collinearity are NOT problems!
Posts: 3797
Joined: Wed Sep 17, 2008 2:25 pm

Re: Statistical differences between models

Postby startz » Fri Mar 27, 2009 10:41 am

I may have gotten it wrong, but it appears that "ie ia iu ic" is one complete set of dummues and that "le la lu lc" is another.
True, but they're not the only ones. I created those dummies to match the values of variable with a region as was suggested earlier. Originally I did not have those dummies, but ran equation y c interest quantum low high under over on 4 separate datasets. Now I merged all data into one file. To my knowledge creating just one set of dummies would only indicate differences in influence on the dependant variable, whereas I want to compare the differences of the coefficients each variable for each region. Hence, I created a set of region dummies for each variable, which leads to the following dataset:
Variable (Dummies)
Interest (ie, ia, iu, ic ) (e=Entire dataset, a=America, u=UK, c=Continental Europe)
Quantum (qe, qa, qu, qc)
Low (le, la, lu, lc)
High (he, ha, hu, hc)
Under (ue, ua, uu, uc)
Over (oe, oa, ou, oc)
Where Low, High, Under, and Over are dummies.

Running a regression on all these variables/dummies, with or without constant, leads to a near singular matrix error. Also when I multiply the variables with the dummies, as was proposed earlier.
This isn't really an EViews question. Perhaps if you posted in the Econometric Discussions section someone might be able to be more helpful.


Return to “Estimation”

Who is online

Users browsing this forum: No registered users and 2 guests