摘要:We present Template Numerical Library (TNL, www.tnl-project.org) with native supportof modern parallel architectures like multi–core CPUs and GPUs. The library offers an abstractlayer for accessing these architectures via unified interface tailored for easy and fast development ofhigh-performance algorithms and numerical solvers. The library is written in C++ and it benefitsfrom template meta–programming techniques. In this paper, we present the most important datastructures and algorithms in TNL together with scalability on multi–core CPUs and speed–up on GPUssupporting CUDA.