David Hou issues

Results 20 issues of


                                            David Hou

Clarify policy on breaking changes

Theres no general policy on breaking changes in CONTRIBUTING.md; I think it'd be useful to have one.

A-conventions

[WIP] Winograd Convolution

WIP. Code/progress is not interesting at the moment. Posting just for reference. Approach: - _pool 4x4 with stride=2*stride - permute kernel HW to the first 2 dims. - winograd by...

bounty locked

Dedup kernel args (bufs) that have the same underlying RawBuffer

This is part 1/k for winograd ( #1037 ). I will be splitting off orthogonal changes from that PR so that I can get some feedback / have less going...

fp16 resnet

this used to at least produce non-nan losses. testing it...

tricky assign tests

these should all pass (or assert) not comprehensive!

[poc/wip] new split reduce heuristic

Ensure occupancy. Optimize layout for group. Don't waste big memory. Probably really slow without beam

use f32 by default to calculate in randn

There are many places in the codebase where fp16 is not adequate for some particular calculation. sum() is handled well; it selects least upper dtype with float32. We might have...

initialize Tensor grad same type as self

when a tensor has different dtype then default_float, backward will initialize gradient to the wrong type (default instead of self)

realize unsafe pads even if they don't expand the buffer

a previously failing test and a quick fix. checking for unsafe pads can just be its own pass. need to think about this expand rule!

sum from expand backwards does not accumulate in fp32

if you write t.expand(), the backwards is sum(), but if t is fp16, then the sum will be in fp16 and may not have a good time. need to measure...