- Feb 16, 2024
-
-
Andreas Klöckner authored
-
- Dec 03, 2023
-
-
- Oct 18, 2023
-
-
Isuru Fernando authored
* improve fallback for vec types * fix flake8
-
- Oct 13, 2023
-
-
Isuru Fernando authored
* Support M2L with FFT on ToyContext * flake8 fixes * Fix src_rscale, tgt_rscale ordering * More tests with FFT
-
- Oct 11, 2023
-
-
Isuru Fernando authored
* Use sumpy.toys in test_translations * flake8 fixes
-
- Oct 09, 2023
-
-
Isuru Fernando authored
* Fix enqueue_marker for NVIDIA CUDA * update comment * check for same queue * Fix bad merge
-
- Sep 25, 2023
-
-
- Sep 17, 2023
-
-
Andreas Klöckner authored
-
- Sep 09, 2023
-
-
- Aug 04, 2023
-
-
- Aug 03, 2023
-
-
- Jul 30, 2023
-
-
- Jul 28, 2023
-
-
- Jul 25, 2023
-
-
This avoids long-lived references to CL kernels held by loopy caches
-
- Jul 19, 2023
-
-
Andreas Klöckner authored
-
- Jun 02, 2023
-
-
- May 29, 2023
-
-
gives a 40% performance boost in CUDA
-
- May 25, 2023
-
-
- May 23, 2023
-
-
Isuru Fernando authored
-
-
- May 21, 2023
-
-
Isuru Fernando authored
-
- May 17, 2023
-
-
Isuru Fernando authored
* Use pytential branch * Refactor E2P * try new loopy branch * fix formatting * disable domains check * register only if not found * Move kernel_scaling to the outer kernel * Refactor P2E * Use loopy main * re-enable implemented domains check * Rename some loopy kernel handling functions --------- Co-authored-by:
Andreas Kloeckner <inform@tiker.net>
-
- May 02, 2023
-
-
- Apr 30, 2023
-
-
- Apr 29, 2023
-
-
- Apr 25, 2023
-
-
- Apr 22, 2023
-
-
Isuru Fernando authored
* Move derivative taker to a separate file * Add fold markers * fix docs
-
- Apr 06, 2023
-
-