MIPP
MIPP copied to clipboard
MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).
In MIPP\tests\src\bitwise_operations\xorb.cpp, I was trying to test this function, ``` template void test_msk_xorb() { constexpr int N = mipp::N(); bool inputs1[N], inputs2[N]; std::mt19937 g; std::uniform_int_distribution dis(0, 1); for (auto t...
The result of testz(m_not) is incorrect. It should be 1 instead of 0. The result is correct on AVX2. **Code:** ``` std::cout
* added simple cmake support - install just copies the header files * implemented interleave() for AVX; there are more variants, which are missing AVX implementations. i would have preferred...
trying to create a function with `constexpr mipp::Reg` return, the compiler refuses with following error message: > /home/ayguen/.local/include/mipp/mipp_object.hxx:23:7: note: ‘mipp::Reg’ is not literal because: 23 | class Reg | ^~~...
When compiling with MSVC NEON intrinsics, reg(=float32x4_t) and msk(=uintt32x4_t) are considered as the same type (__n128), causing some of the function overloading to appear like duplication. The current change is...
Hi, Unless I haven't looked in the right places, there doesn't seem to be any support lane-wise lzcnt or popcount. Is it foreseen in the future or out of scope...
I get a segmentation fault when I call `mipp::load(ptr)` when optimizations are turned off (my GCC flags `-O0 -march=skylake`). As far as I can tell, it happens because the body...
Why the function "mipp::N()" just returned 1 ?
To make it easier to capture misconfiguration or lacking support of certain instrinsics, I would like to suggest the following change: instead of throwing std::runtime_error when using unsupported intrinsics use...
for ARM: __ARM_FEATURE_FMA macro is not defined for SSE: fmadd could implement as: _mm_fmadd_ps Is there any reason ? thanks