Jacob Hinkle issues

Results 15 issues of


                                            Jacob Hinkle

Enable cat for nvfuser >= 0.1.7

Benchmarks are neutral. Before: ``` --------------------------------------------------------------------------------------------------------------------- benchmark: 24 tests ---------------------------------------------------------------------------------------------------------------------- Name (time in us) Min Max Mean StdDev Median IQR Outliers OPS Rounds Iterations ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- test_nanogpt_sdpa_fwd[thunder] 85.9729 (1.0) 123.5039 (1.0)...

Demonstration of parallel atlas building

**Is your feature request related to a problem? Please describe.** This is not really a problem but would make for a nice demo. **Describe the solution you'd like** It would...

enhancement

good first issue

Sphinx documentation

**Is your feature request related to a problem? Please describe.** We currently have https://lagomorph.readthedocs.io/en/latest/ set up but there is no documentation yet. **Describe the solution you'd like** Add simple sphinx...

Unnecessary calls to `.contiguous()` hurt performance

**Describe the bug** There are many times in our python code where we force tensors to be contiguous. This is because before pytorch introduced `packed_accessor` it was pretty annoying to...

Support `local_rank` computation with MPI<3

**Is your feature request related to a problem? Please describe.** Trying to use mpirun less than version 3 with for example `lagomorph lddmm atlas` results in an error currently since...

Remove redundant PyTorch functionality

**Is your feature request related to a problem? Please describe.** Basically, some of the main work in lagomorph was already implemented by the pytorch team. I was unaware of some...

enhancement

Benchmarking suite

We need a simple way to run benchmarks for our low level functions, in addition to the tests we have which just ensure correctness. What I have in mind is...

enhancement

help wanted

good first issue

Jacob Hinkle

Enable cat for nvfuser >= 0.1.7

Demonstration of parallel atlas building

Sphinx documentation

Unnecessary calls to `.contiguous()` hurt performance

Support `local_rank` computation with MPI<3

Remove redundant PyTorch functionality

Benchmarking suite

Composition of affine with free-form deformations

CPU extension

Use __shfl_down for cuda affine interp backward