Bruno Casella comments

Results 35 comments of


                                            Bruno Casella

[PyTorch] Increase of training time by increasing epochs

Yes, I will run the experiment for 1000 epochs. In the meantime, I am running the same experiment for 200 epochs again in order to check if it was a...

[PyTorch] Increase of training time by increasing epochs

Hello everyone. I have just completed the 3 runs for 1000 epochs. Besides time, I have also collected current and peak memory for each epoch using `tracemalloc`. I followed this...

[PyTorch] Increase of training time by increasing epochs

> Well, looks like `tracemalloc` shows incorrect data for Gramine. Or maybe it shows incorrect data for PyTorch in general? Please check what is shown for normal training... Sure, I...

[PyTorch] Increase of training time by increasing epochs

Here are the current and peak memory for normal training:

[PyTorch] Increase of training time by increasing epochs

Yes @dimakuv , this morning I have already started this experiment: `sudo perf record --call-graph dwarf -F 50 -e cpu-clock gramine-direct ./pytorch mnist.py`. I will update you when it will...

[PyTorch] Increase of training time by increasing epochs

> Also, one thing which seems suspicious to me: why the running time is so noisy without Gramine but then gets very stable with it, both direct and SGX? It...

[PyTorch] Increase of training time by increasing epochs

@monavij Tomorrow I will run the experiment with `sgx.preheat_enclave = true`. However, according to the documentation, `Using this option makes sense only if the whole enclave memory fits into [EPC](https://gramine.readthedocs.io/en/stable/sgx-intro.html#term-epc)...