From da16218781e41993a93e98a88adf5ffb51de3489 Mon Sep 17 00:00:00 2001 From: Martin Reinecke <martin@mpa-garching.mpg.de> Date: Tue, 15 Jan 2019 11:07:34 +0100 Subject: [PATCH] add README.md --- README.md | 35 +++++++++++++++++++++++++++++++++++ 1 file changed, 35 insertions(+) create mode 100644 README.md diff --git a/README.md b/README.md new file mode 100644 index 0000000..7620953 --- /dev/null +++ b/README.md @@ -0,0 +1,35 @@ +# Libsharp + +Library for efficient spherical harmonic transforms at arbitrary spins, +supporting CPU vectorization, OpenMP and MPI. + +## Paper + +https://arxiv.org/abs/1303.4945 + +## News + +### January 2019 + +This update features significant speedups thanks to important algorithmic +discoveries by Keiichi Ishioka +(https://www.jstage.jst.go.jp/article/jmsj/96/2/96_2018-019/_article and +personal communication). + +These improvements reduce the fraction of CPU time spent on evaluating the +recurrences for Y_lm coefficients, which means that computing multiple +simultaneous SHTs no longer have a big performance advantage compared to SHTs +done one after the other. +As a consequence, libsharp support for simultaneous SHTs was dropped, making +its interface much simpler. + +With the proper compilers and flags (see the file COMPILE for details) libsharp +is now built with support for SSE2, AVX, AVX2, FMA3, FMA4 and AVX512f and the +appropriate implementation is selected dynamically at runtime. This should +provide a very significant performance boost for everyone using pre-compiled +portable binaries. + +### Compilation + +The library uses the standard `autotools` mechanism for configuration, +compilation and installation. See the file `COMPILE` for configuration hints. -- GitLab