When you include the constant, you should expect "near singular matrix." This is the dummy variable trap. I don't see why you get the error when the constant is omitted though.Thanks for your suggestion. When I enter equation: y c DUK DUS DEU DUK*X1 DUS*X1 DEU*X1 and so on for al five variables I get the error near singular matrix, also without entering the constant. This is probably because all dummies are mutually exclusive.Code: Select all
ls y DUK DUS DEU DUK*X DUS*X DEU*X
Starting to get a bit desperate here as I do not seem to be able to solve this problem that shouldn't be so difficult.
Statistical differences between models
Moderators: EViews Gareth, EViews Moderator
-
startz
- Non-normality and collinearity are NOT problems!
- Posts: 3797
- Joined: Wed Sep 17, 2008 2:25 pm
Re: Statistical differences between models
Re: Statistical differences between models
That's more than what I could have asked for.
Basically what I've done is running 4 separate regressions for 4 datasets (entire dataset + its three subsets based on region), all consisting of the same variables. Equation: y c interest quantum low high under over
I'm trying to figure out whether there are significant differences between the coefficients of the variables when comparing each subset (America, UK, Cont. Europe) to one another. As Startz suggested I created dummy variables for each variable to indicate its region. He suggested equation
y (c) DUK DUSA DEUR DUK*X DUSA*X DEUR*X which in my case would lead to the following:
y ie ia iu ic interest*ie interest*ia interest*iu interest*ic qe qa qu qc quantum*qe quantum*qa quantum*qu quantum*qc le la lu lc low*le low*la low*lu low*lc he ha hu hc high*he high*ha high*hu high*hc ue ua uu uc under*ue under*ua under*uu under*uc oe oa ou oc over*oe over*oa over*ou over*oc Where -e is the Entire dataset, -a is America, -u is UK, and -c is Continental Europe.
This, however, leads to near singular matrix error, regardless of whether the constant is included. Any help on solving this issue or different approaches on my statistical differences problem is greatly appreciated! Dataset is attached.
Basically what I've done is running 4 separate regressions for 4 datasets (entire dataset + its three subsets based on region), all consisting of the same variables. Equation: y c interest quantum low high under over
I'm trying to figure out whether there are significant differences between the coefficients of the variables when comparing each subset (America, UK, Cont. Europe) to one another. As Startz suggested I created dummy variables for each variable to indicate its region. He suggested equation
y (c) DUK DUSA DEUR DUK*X DUSA*X DEUR*X which in my case would lead to the following:
y ie ia iu ic interest*ie interest*ia interest*iu interest*ic qe qa qu qc quantum*qe quantum*qa quantum*qu quantum*qc le la lu lc low*le low*la low*lu low*lc he ha hu hc high*he high*ha high*hu high*hc ue ua uu uc under*ue under*ua under*uu under*uc oe oa ou oc over*oe over*oa over*ou over*oc Where -e is the Entire dataset, -a is America, -u is UK, and -c is Continental Europe.
This, however, leads to near singular matrix error, regardless of whether the constant is included. Any help on solving this issue or different approaches on my statistical differences problem is greatly appreciated! Dataset is attached.
- Attachments
-
- All2.0.xls
- (171 KiB) Downloaded 586 times
-
startz
- Non-normality and collinearity are NOT problems!
- Posts: 3797
- Joined: Wed Sep 17, 2008 2:25 pm
Re: Statistical differences between models
You have a better shot of getting someone to look at this if you attach your EViews dataset.That's more than what I could have asked for.
Basically what I've done is running 4 separate regressions for 4 datasets (entire dataset + its three subsets based on region), all consisting of the same variables. Equation: y c interest quantum low high under over
I'm trying to figure out whether there are significant differences between the coefficients of the variables when comparing each subset (America, UK, Cont. Europe) to one another. As Startz suggested I created dummy variables for each variable to indicate its region. He suggested equation
y (c) DUK DUSA DEUR DUK*X DUSA*X DEUR*X which in my case would lead to the following:
y ie ia iu ic interest*ie interest*ia interest*iu interest*ic qe qa qu qc quantum*qe quantum*qa quantum*qu quantum*qc le la lu lc low*le low*la low*lu low*lc he ha hu hc high*he high*ha high*hu high*hc ue ua uu uc under*ue under*ua under*uu under*uc oe oa ou oc over*oe over*oa over*ou over*oc Where -e is the Entire dataset, -a is America, -u is UK, and -c is Continental Europe.
This, however, leads to near singular matrix error, regardless of whether the constant is included. Any help on solving this issue or different approaches on my statistical differences problem is greatly appreciated! Dataset is attached.
Re: Statistical differences between models
That makes sense. I ran the regressions for the subsets on separate workfiles. The file attached combines all data.You have a better shot of getting someone to look at this if you attach your EViews dataset.
- Attachments
-
- all2.0.wf1
- (151.33 KiB) Downloaded 619 times
-
startz
- Non-normality and collinearity are NOT problems!
- Posts: 3797
- Joined: Wed Sep 17, 2008 2:25 pm
Re: Statistical differences between models
That clears it up very quickly. You have included two complete sets of dummies. That's another form of the dummy variable trap.That makes sense. I ran the regressions for the subsets on separate workfiles. The file attached combines all data.You have a better shot of getting someone to look at this if you attach your EViews dataset.
Re: Statistical differences between models
Two? I have four dummy variables (under/over/low/high) but they do not always have opposing values (e.g. under/over, low/high can both be 0).That clears it up very quickly. You have included two complete sets of dummies. That's another form of the dummy variable trap.
What would you suggest me to do?
-
startz
- Non-normality and collinearity are NOT problems!
- Posts: 3797
- Joined: Wed Sep 17, 2008 2:25 pm
Re: Statistical differences between models
I may have gotten it wrong, but it appears that "ie ia iu ic" is one complete set of dummues and that "le la lu lc" is another.Two? I have four dummy variables (under/over/low/high) but they do not always have opposing values (e.g. under/over, low/high can both be 0).That clears it up very quickly. You have included two complete sets of dummies. That's another form of the dummy variable trap.
What would you suggest me to do?
Re: Statistical differences between models
True, but they're not the only ones. I created those dummies to match the values of variable with a region as was suggested earlier. Originally I did not have those dummies, but ran equation y c interest quantum low high under over on 4 separate datasets. Now I merged all data into one file. To my knowledge creating just one set of dummies would only indicate differences in influence on the dependant variable, whereas I want to compare the differences of the coefficients each variable for each region. Hence, I created a set of region dummies for each variable, which leads to the following dataset:I may have gotten it wrong, but it appears that "ie ia iu ic" is one complete set of dummues and that "le la lu lc" is another.
Variable (Dummies)
Interest (ie, ia, iu, ic ) (e=Entire dataset, a=America, u=UK, c=Continental Europe)
Quantum (qe, qa, qu, qc)
Low (le, la, lu, lc)
High (he, ha, hu, hc)
Under (ue, ua, uu, uc)
Over (oe, oa, ou, oc)
Where Low, High, Under, and Over are dummies.
Running a regression on all these variables/dummies, with or without constant, leads to a near singular matrix error. Also when I multiply the variables with the dummies, as was proposed earlier.
-
startz
- Non-normality and collinearity are NOT problems!
- Posts: 3797
- Joined: Wed Sep 17, 2008 2:25 pm
Re: Statistical differences between models
This isn't really an EViews question. Perhaps if you posted in the Econometric Discussions section someone might be able to be more helpful.True, but they're not the only ones. I created those dummies to match the values of variable with a region as was suggested earlier. Originally I did not have those dummies, but ran equation y c interest quantum low high under over on 4 separate datasets. Now I merged all data into one file. To my knowledge creating just one set of dummies would only indicate differences in influence on the dependant variable, whereas I want to compare the differences of the coefficients each variable for each region. Hence, I created a set of region dummies for each variable, which leads to the following dataset:I may have gotten it wrong, but it appears that "ie ia iu ic" is one complete set of dummues and that "le la lu lc" is another.
Variable (Dummies)
Interest (ie, ia, iu, ic ) (e=Entire dataset, a=America, u=UK, c=Continental Europe)
Quantum (qe, qa, qu, qc)
Low (le, la, lu, lc)
High (he, ha, hu, hc)
Under (ue, ua, uu, uc)
Over (oe, oa, ou, oc)
Where Low, High, Under, and Over are dummies.
Running a regression on all these variables/dummies, with or without constant, leads to a near singular matrix error. Also when I multiply the variables with the dummies, as was proposed earlier.
Who is online
Users browsing this forum: No registered users and 2 guests
