Hi there!
I am a MSc student in HPC, and I am currently working with Quantum Espresso, in order to improve their performance in a cluster environment.
Recently I generated the call graph of the application through a machine with Ivy Bridge microarchitecture (Intel® Xeon® Processor E5-2670 v2), which uses AVX as extension of the instruction set, which supposedly does not support operations to gather and scatter with the MPI, but as you can see in the call graph excerpt from the link below, the application uses these operations by libmkl_avx.so library more precisely the following operations: mkl_dft_avx_gather_z_z and mkl_dft_avx_scatter_z_z.
Someone can help me understand why this occur?
Thanking you in advance, yours sincerely
Sérgio Caldas