How to Correctly Structure a File for Import into E-Views
Posted: Wed May 26, 2010 8:43 pm
I am using E-Views 7. I have an unbalanced panel of data that consists of 20 cross-sectional units (markets). Each of these markets markets contains a different number of time-series (daily) observations. These range from 31 days for the shortest market to 48 days for the longest market.
I currently have the data in stacked form in an Excel 2007 file. The data are, somewhat arbitrarily, organized in this stacked format according to alphabetical order of the cross-section name. To be as clear as possible, please let me specify in more detail how the data is arranged in Excel:
Relative-Day Market (# of observations) Dependent Variable Independent Variables
Under the relevant headings, I have 43 observations for "Market A." I then have 41 observations for "Market B," and so on until "Market T" (the 20th and final market), which as 40 observations.
The missing data values can arguably be considered as randomly missing, so I am not concerned about any potential inferential problems associated with having an unbalanced panel. What I am concerned with is how to structure the data in Excel before importing it into E-Views.
Since the longest market has 48 observations, do I need to have 48 rows for each cross-section with NAs entered where the data is missing? In other words, do I need to "artifically balance" the data before importing it into E-Views? If not, then will I be fine leaving the data in stacked format, given an unequal number of days for each of the cross-sections?
In considering my question, please be advised that my analysis will involve the use of lagged values of the dependent variable. In other words, I will be conducting dynamic panel data analysis. As such, I need E-Views to recognize the panel structure of the data and not "lag into" the values for the preceding cross-section.
Finally, if I need to "artificially balance" the data prior to importing it into E-Views, then should I enter the NA values at the beginning or at the end of the respective markets? For instance, say that I am dealing with Market A, which has 43 observations. With the maximum number of observations at 48, I would need to enter 5 NA values. Should I do this as
NA
NA
NA
NA
NA
43 values
or as
43 values
NA
NA
NA
NA
NA
Thanks in advance for your help.
I currently have the data in stacked form in an Excel 2007 file. The data are, somewhat arbitrarily, organized in this stacked format according to alphabetical order of the cross-section name. To be as clear as possible, please let me specify in more detail how the data is arranged in Excel:
Relative-Day Market (# of observations) Dependent Variable Independent Variables
Under the relevant headings, I have 43 observations for "Market A." I then have 41 observations for "Market B," and so on until "Market T" (the 20th and final market), which as 40 observations.
The missing data values can arguably be considered as randomly missing, so I am not concerned about any potential inferential problems associated with having an unbalanced panel. What I am concerned with is how to structure the data in Excel before importing it into E-Views.
Since the longest market has 48 observations, do I need to have 48 rows for each cross-section with NAs entered where the data is missing? In other words, do I need to "artifically balance" the data before importing it into E-Views? If not, then will I be fine leaving the data in stacked format, given an unequal number of days for each of the cross-sections?
In considering my question, please be advised that my analysis will involve the use of lagged values of the dependent variable. In other words, I will be conducting dynamic panel data analysis. As such, I need E-Views to recognize the panel structure of the data and not "lag into" the values for the preceding cross-section.
Finally, if I need to "artificially balance" the data prior to importing it into E-Views, then should I enter the NA values at the beginning or at the end of the respective markets? For instance, say that I am dealing with Market A, which has 43 observations. With the maximum number of observations at 48, I would need to enter 5 NA values. Should I do this as
NA
NA
NA
NA
NA
43 values
or as
43 values
NA
NA
NA
NA
NA
Thanks in advance for your help.