pub fn _mm512_kunpackb(a: u16, b: u16) -> u16
stdarch_x86_avx512
Unpack and interleave 8 bits from masks a and b, and store the 16-bit result in k.
Intel’s documentation