Senior Thesis help - serial correlation in time series
Posted: Wed Mar 03, 2010 2:09 am
Hello,
I have data on credit spreads for Goldman Sachs (2007-2009, daily). I want to be able to forecast what the spreads would have been had the recent financial crisis had not occurred using macroeconomic data and firm specific data. For example, I run the regression:
GoldmanSpreads = c(1)+c(2)ROE+c(3)PB+c(4)DE+c(5)GDP
where ROE=return on equity (firm specific)
PB= price to book value (firm specific)
DE= debt to equity ratio (firm specific)
GDP=GDP growth (macroeconomic variable)
The output of this regression is great - I get very high (~0.9 adjusted Rsquared) but the Durbin Watson stat is low (0.4). There is obvious serial correlation going on.
However, if I add an AR(1) into this equation:
GoldmanSpreads = c(1)+c(2)ROE+c(3)PB+c(4)DE+c(5)GDP+AR(1)
the coefficient on AR(1) is so large that it makes the other variables that were originally significant to become NOT significant (according to t-stats).
I know that ROE, PB, DE, GDP growth are utilized and makes sense in a model of credit spreads. But, I want to correct for the low Durbin Watson Stat WITHOUT including AR(1) or something of the sort that takes away the original predictive power of the variables like ROE, etc.
Does anyone have suggestions?!
I am a senior undergrad writing my econ thesis and ANY inputs would be wonderful.
thanks in advance.
I have data on credit spreads for Goldman Sachs (2007-2009, daily). I want to be able to forecast what the spreads would have been had the recent financial crisis had not occurred using macroeconomic data and firm specific data. For example, I run the regression:
GoldmanSpreads = c(1)+c(2)ROE+c(3)PB+c(4)DE+c(5)GDP
where ROE=return on equity (firm specific)
PB= price to book value (firm specific)
DE= debt to equity ratio (firm specific)
GDP=GDP growth (macroeconomic variable)
The output of this regression is great - I get very high (~0.9 adjusted Rsquared) but the Durbin Watson stat is low (0.4). There is obvious serial correlation going on.
However, if I add an AR(1) into this equation:
GoldmanSpreads = c(1)+c(2)ROE+c(3)PB+c(4)DE+c(5)GDP+AR(1)
the coefficient on AR(1) is so large that it makes the other variables that were originally significant to become NOT significant (according to t-stats).
I know that ROE, PB, DE, GDP growth are utilized and makes sense in a model of credit spreads. But, I want to correct for the low Durbin Watson Stat WITHOUT including AR(1) or something of the sort that takes away the original predictive power of the variables like ROE, etc.
Does anyone have suggestions?!
I am a senior undergrad writing my econ thesis and ANY inputs would be wonderful.
thanks in advance.