i took a rather crude approach and just opened them each separately from windows explorer
. From my experiences: it might be optimal to open more instances than cores+virtual cores if your desire is to truly max out you CPU (I dont think a single instance will max out a single core, dependent on how your program is written), and that your instances wont complete at the same speed, which i think is due to how system processes are allocated across cores, meaning that some instances will share cores which are bogged down with running other background tasks. Naturally, you'll need to make sure you have enough ram for the task: I believe this is conditional on what your program writes to workfiles: e.g. if you're storing large matrices on each recursion or using high frequency data or creating a lot of auxiliary objects, you'll need more ram.
Please let me know if you have any further questions on this topic - very happy to help (if I have said anything incorrect in the above paragraph,which is likely, i'd be grateful for any corrections!)
Charlie