Managing resources when running a lot of regression
Posted: Thu Sep 19, 2024 7:19 am
I have some questions about resource management when EViews workfiles become big, using EViews 14. I am using the <autoarma> function to automate the time series part of an overall automated model selection for combinations of exogenous variables.
As you can guess, this spits out a lot of regressions. For the set of variables I have now I need to run autoarma about 430k times. Autoarma produces a very useful spool and the original iteration of my program saved all the spools produced along with other information and saved a backup every 10k regressions. These files became quite large quickly and saving the files seemed to add significantly more time to the unsaved baseline runtime.
To save disk space--and I had hoped speed things up--I now save only a table with enough identifying information to rerun autoarma for the top N regressions selected to produce the final output for humans. What I am noticing is that when the number of regressions is small (10k-20k) the process takes about 20 minutes. To go from 390k->400k, however, it took about two and a half hours.
The reason I am here is because I am trying to get a handle on what is happening. Eviews is pretty consistently taking up up about 26% or27% of the CPU, I assume that is because it is only running on one core. I don’t believe this matters here?
Right now at >400k regressions Eviews is taking up about 400mb of RAM and the file size at 400k regressions is 201mb. None of this strikes me as exceptional. Anyway, I’m at a loss and looking for alternative ways of thinking about how to speed my program up. I expected this program to take a while, but I did not expect the exponential increase in runtimes. I assume, from what I said above, that it has to do with saving the file?
I have done all the things suggested about slow runtimes elsewhere in the forum. I am working off a server drive, but when I tested it on a local drive runtimes were not that different. I also turned off automatic snapshots.
A related auxiliary question: Is it possible to delete workfiles from within EViews? If so, does deleting files also slow the process down in the same way saving files does?
As you can guess, this spits out a lot of regressions. For the set of variables I have now I need to run autoarma about 430k times. Autoarma produces a very useful spool and the original iteration of my program saved all the spools produced along with other information and saved a backup every 10k regressions. These files became quite large quickly and saving the files seemed to add significantly more time to the unsaved baseline runtime.
To save disk space--and I had hoped speed things up--I now save only a table with enough identifying information to rerun autoarma for the top N regressions selected to produce the final output for humans. What I am noticing is that when the number of regressions is small (10k-20k) the process takes about 20 minutes. To go from 390k->400k, however, it took about two and a half hours.
The reason I am here is because I am trying to get a handle on what is happening. Eviews is pretty consistently taking up up about 26% or27% of the CPU, I assume that is because it is only running on one core. I don’t believe this matters here?
Right now at >400k regressions Eviews is taking up about 400mb of RAM and the file size at 400k regressions is 201mb. None of this strikes me as exceptional. Anyway, I’m at a loss and looking for alternative ways of thinking about how to speed my program up. I expected this program to take a while, but I did not expect the exponential increase in runtimes. I assume, from what I said above, that it has to do with saving the file?
I have done all the things suggested about slow runtimes elsewhere in the forum. I am working off a server drive, but when I tested it on a local drive runtimes were not that different. I also turned off automatic snapshots.
A related auxiliary question: Is it possible to delete workfiles from within EViews? If so, does deleting files also slow the process down in the same way saving files does?