Abhay Shukla
Abhay Shukla
**Between small sized, large number of part parquet files vs large sized, fewer part parquet files, which one is better for NVTabular's data loading performance on GPU?** When writing parquet...
**Describe the bug** A clear and concise description of what the bug is. **To Reproduce** Steps to reproduce the behavior: 1. Use AMI https://aws.amazon.com/releasenotes/deep-learning-ami-gpu-tensorflow-2-9-ubuntu-20-04/ to spin a ec2 cluster instance...
The model training loss is suddenly dropping to 0 after over 1000 steps. I've tried iterating over different dataset as well but got the same behaviour. ## Details I am...
I installed the conda environment using `conda env create -f envs/pyspark-330-delta-220` and tried running the notebook [01_quickstart.ipynb](https://github.com/delta-io/delta-examples/blob/master/notebooks/pyspark/01_quickstart.ipynb) but I get the following error: ``` --------------------------------------------------------------------------- RuntimeError Traceback (most recent call...
Problem: There are no examples in the [documentation](https://github.com/catboost/catboost/tree/master/catboost/spark/catboost4j-spark/core/src/test/python) for CatBoostRanker API for Spark, is the support coming in future releases?