Yoni Kremer comments

Results 15 comments of


                                            Yoni Kremer

Runtime estimation

In my experience with TensorFlow, the time estimation is pretty accurate, so I think best effort estimation should give accurate results.

Memory requirements for c4/webtextlike

I'm pretty sure dataset size means the size of the raw text.

Can't load train embedding for growing datasets lab

Seems like the problem is loading data/saved_embeddings/train/c6-whitened-256_4.parquet.gzip to a dataframe

Implement cupy.ndarray == None and cupy.ndarray != None

Two tests fail due to the issue: `TestArrayObjectComparison::test_eq_object` `TestArrayObjectComparison::test_ne_object`

Implement cupy.ndarray == None and cupy.ndarray != None

@kmaehashi @leofang I get why you don't want to implement it that way. But cupy is supposed to be Numpy-compatible. In addition, some tests fail due to this issue: In...

CuPy ufunc makes different type promotion from numpy does when a zero-dim array is supplied

In numpy 2.1, I get: ``` >>> x_np = np.array([4]).astype(np.float32) >>> y1_np = np.array([2]) # int64 >>> y2_np = np.array(2) # int64 >>> y3_np = 2 >>> x_np / y1_np...

CuPy ufunc makes different type promotion from numpy does when a zero-dim array is supplied

@takagi Can you close the issue?

Preformence: change TopKMaskLogitsKernel to oveewrite the tensor instead of allocating a new tensor

I started thinking about it, in most cases, top k is very small comared to the vocab size (100 vs 100k), maybe storing the results as a sparse tensor would...

Preformence: change TopKMaskLogitsKernel to oveewrite the tensor instead of allocating a new tensor

I think that later on computing sotmax and sampling from a sparse tesnor should be much much faster

[preformence ]support low precision sampling

How can I check the numeric stability of the kernel?