janelu9

Results 25 comments of janelu9

> @janelu9 yes it should, hope that resolves your question. Is this question specifically related to one of the estimators (eg LIME, lightgbm, KNN, CNTKModel etc?) I cont import mmlspark...

root@DESKTOP-OPMDKT7:~/.ivy2# spark-shell --master yarn --jars file:///root/.ivy2/jars/* Setting default log level to "WARN". To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel). Spark context Web UI available at http://localhost:4040 Spark...

@imatiach-msft I try to start pyspark by this way ``` pyspark --master yarn \ --conf spark.dist.pyFiles=file:///....jar \ --conf spark.submit.pyFiles=file:///....jar \ --conf spark.yarn.dist.jars=file:///....jar ``` It still con't be used normally,though mmlspark...

My model was trained in bf16 mode ,when loading ckpt with `load_optimizer_states = False`, it still trys to load it . I avoid that by the following : ``` engine._config.bfloat16_enabled...

number of elements of one piece of data should be less than max(int32),that is the upper limit of one parquet file. try to split the data to pieces by row...

我发现强化学习部分的训练 有可以训练无监督数据是吗?它的原理是什么?根据当前片段的句子直接生成下一个片段的句子吗?

我发现强化学习部分的训练 有可以训练无监督数据是吗?它的原理是什么?根据当前片段的句子直接生成下一个片段的句子吗?

> Yes, it is :) you just need the query/prompt part. prompt 是 双向attention,answers是单向attention 无监督训练应该是用单向吧

`kl_divergence_estimate = -self.kl_ctl * (log_probs - ref_log_probs)`

> At beginning in stage3, kl_divergence_estimate should be zero. But, after several steps, the generation of actor model might be different from reference model. Please correct me if I make...