31 May 2013
Abstract:Modern computer architectures, with multicore CPUs and GPUs or other accelerators, make stronger demands than ever on writers of scientific code. As a rule of thumb, the fastest, most efficient program consists of labor- intensive code written by expert programmers for a certain application on a particular computer. This thesis deals with several algorithmic and technical approaches towards effectively satisfying the demand for high-performance parallel programming without incurring such a high cost in expert program- mer time. Effective programming is accomplished by writing performance- portable code where performance-critical functionality is provided either by external software or at least a balance between maintainability/generality and efficiency.
Available as PDF (1.25 MB)
Download BibTeX entry.