Hello,
There are some very impressive memory vs mpi process plots in the excellent mkl presentation:
https://cerfacs.fr/wp-content/uploads/2016/03/Kalinkin.pdf
but its a little confusing what the memory requirements are, is the original matrix needed on each node? Sounds like it is from :
https://software.intel.com/en-us/mkl-developer-reference-fortran-dss-distributed-symmetric-matrix-storage
"The algorithm ensures that the memory required to keep internal data on each MPI process is decreased when the number of MPI processes in a run increases. However, the solver requires that matrix A and some other internal arrays completely fit into the memory of each MPI process."
Any thoughts appreciated, thanks!
Don