My goal is, to sum up, all elements of a vector. I used in the last post a single thread. In this post, I use multiple threads and therefore the full power of my PC. The addition will be done on a shared variable. What at first glance seems like a good idea is a very naive strategy. The synchronization overhead of the summation variable is higher than the performance benefit of my four or two cores.
Read more
Read more...