@@ -23,8 +23,17 @@ Meshing improvements
2323 Hessian for r-adaptivity using discrete fields, and allows use of skewness
2424 and orientation based metrics.
2525
26- Improved GPU capabilities
27- -------------------------
26+ Performance improvements
27+ ------------------------
28+ - Added support for explicit vectorization in the high-performance templated
29+ code, which can now take advantage of specific intrinsics classes on the
30+ following architectures:
31+ - x86 (SSE/AVX/AVX2/AVX512),
32+ - Power8 & Power9 (VSX),
33+ - BG/Q (QPX).
34+ These are now enabled by default, and can be disabled with MFEM_USE_SIMD=NO.
35+ See the new file linalg/simd.hpp and the new directory linalg/simd.
36+
2837- Added support for Chebyshev accelerated polynomial smoother on GPU.
2938
3039Discretization improvements
@@ -57,10 +66,17 @@ Linear and nonlinear solvers
5766- Added initial support for h- and p-multigrid solvers and preconditioners for
5867 matrix-based and matrix-free discretizations with basic GPU capability.
5968
69+ - Added a new IterativeSolverMonitor class that allows to monitor the residual
70+ and solution during the solving process of an IterativeSolver after every
71+ iteration.
72+
6073- Block arrays of parallel matrices can now be merged into a single parallel
6174 matrix with the function HypreParMatrixFromBlocks. This could be useful for
6275 solving block systems with parallel direct solvers such as STRUMPACK.
6376
77+ - In SLISolver, changed the residual inner product from (Br,r) to (Br,Br) so the
78+ solver can work with non-SPD preconditioner B.
79+
6480New and updated examples and miniapps
6581-------------------------------------
6682- Added a new example, Example 25/25p, to demonstrate the use of a Perfectly
@@ -98,9 +114,6 @@ Improved testing
98114
99115Miscellaneous
100116-------------
101- - In SLISolver, changed the residual inner product from (Br,r) to (Br,Br) so the
102- solver can work with non-SPD preconditioner B.
103-
104117- Added support for ADIOS2 for parallel I/O with ParaView visualization. The
105118 classes adios2stream and ADIOS2DataCollection are introduced in mfem as the
106119 interfaces to generate ADIOS2 Binary Pack (BP4) directory datasets for the
@@ -110,9 +123,8 @@ Miscellaneous
110123- The integration order used in the ComputeLpError and ComputeElementLpError
111124 methods of class GridFunction has been increased.
112125
113- - Added a new IterativeSolverMonitor class that allows to monitor the residual
114- and solution during the solving process of an IterativeSolver after every
115- iteration.
126+ - Various other simplifications, extensions, and bugfixes in the code.
127+
116128
117129Version 4.1, released on March 10, 2020
118130=======================================
0 commit comments