The BLAS Level 2 routine cblas_?ger computes A := alpha*x*y'+ A. Is there a simpler routine that just calculates A := alpha*x*y'?
Setting A=0 offers the same results, but does it provide good performance too? i.e. am I wasting computation in doing the additions?