Diazonium
Diazonium
MKVtoolnix has a number of input modules for demuxing various media containers. Could be worth a look.
*GEMV is often found as one of the bottlenecks of pivoting/rank-revealing QR/SVD algorithms. As such, it might be the most important primitive L2 operation to optimize/parallelize.
For the most part it is enough, the only thing that is still not clear is how the template files are expected to be structured. Having an example template should...