The article reports on the panel discussion on whether to focus on single-threaded or multi-threaded during the 13th Annual IEEE/ACM International Symposium on High-Performance Computer Architecture in February
The single-threaded CRT is no longer available. This topic discusses how to get the maximum performance from the multithreaded libraries. Maximizing performance The performance of the multithreaded libraries has been improved and is close to the performance of the now-eliminated single-threaded libraries...
There's a good deal of documentation about thread-safety (correctness), but not much about multithreaded performance. This is a bit different from#149, and I think we should do both. I'd like the page to help the reader develop a basic mental model for how to write programs that will ...
Multiple threads within a single process have considerably less overhead than a corresponding number of processes since they share address space and memory. With that in mind, let’s revisit our test case, but this time using Ruby’sThreadclass: ...
It is at this latter extreme where lock-free techniques become applicable. Conceptually, if the time a lock needs to be held can be shrunk to a single memory access, then it is possible to take advantage of the hardware to avoid some or all of the locking overhead. Unfortunat...
One of the most effective routes to high threading performance scale factors is to minimize the single thread performance. Also, the better your performance, on a platform with multiple cache, the more important it is to set appropriate affinity (e.g. KMP_AFFINITY=compact for ifor...
It was immediately clear that 80 Geant4-based processes, each with a footprint of more than a gigabyte, would never work, due to the large memory pressure on a single system bus to memory. A potentially sim- ple solution takes advantage of UNIX copy-on-write semantics to enhance the ...
Decompression is also sped up for larger files (many tens of megabytes or more); for smaller files, it's about the same as Java's built-in single-threaded GZipInputStream. Decompression of the aforementioned Wikipedia data was over 3x faster. ...
ingsingle-threadperformance. Weevaluateavarietyofheterogeneousarchitecturalde- signs, including processor cores that are themselves mul- tithreaded, an extension to the original architecture pro- posal [14]. Through this evaluation, we make the following ...
programs, thus each thread of a multi-threaded processor appeared to the operating system as a processor. As technology further evolved, it was possible to put multiple processors (each having an IPU) on a single semiconductor chip or die. These processors were referred to processor cores or ...