janelu9

[email protected]

icbc.sdc Shanghai, China

Results 25 comments of


                                            janelu9

Does mmlspark support running on yarn?

> @janelu9 yes it should, hope that resolves your question. Is this question specifically related to one of the estimators (eg LIME, lightgbm, KNN, CNTKModel etc?) I cont import mmlspark...

Does mmlspark support running on yarn?

root@DESKTOP-OPMDKT7:~/.ivy2# spark-shell --master yarn --jars file:///root/.ivy2/jars/* Setting default log level to "WARN". To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel). Spark context Web UI available at http://localhost:4040 Spark...

Does mmlspark support running on yarn?

@imatiach-msft I try to start pyspark by this way ``` pyspark --master yarn \ --conf spark.dist.pyFiles=file:///....jar \ --conf spark.submit.pyFiles=file:///....jar \ --conf spark.yarn.dist.jars=file:///....jar ``` It still con't be used normally，though mmlspark...

Fixes for training models with bf16 + freshly initialized optimizer via `load_module_only`

My model was trained in bf16 mode ,when loading ckpt with `load_optimizer_states = False`, it still trys to load it . I avoid that by the following : ``` engine._config.bfloat16_enabled...

OSError: List index overflow.

number of elements of one piece of data should be less than max(int32)，that is the upper limit of one parquet file. try to split the data to pieces by row...

how to train a unsupervised dataset？

我发现强化学习部分的训练有可以训练无监督数据是吗？它的原理是什么？根据当前片段的句子直接生成下一个片段的句子吗？

how to train a unsupervised dataset？

我发现强化学习部分的训练有可以训练无监督数据是吗？它的原理是什么？根据当前片段的句子直接生成下一个片段的句子吗？

how to train a unsupervised dataset？

> Yes, it is :) you just need the query/prompt part. prompt 是双向attention，answers是单向attention 无监督训练应该是用单向吧

if ref_model is a copy of act_model at begining in stage3 , does it mean the kl_divergence is 0?

`kl_divergence_estimate = -self.kl_ctl * (log_probs - ref_log_probs)`

if ref_model is a copy of act_model at begining in stage3 , does it mean the kl_divergence is 0?

> At beginning in stage3, kl_divergence_estimate should be zero. But, after several steps, the generation of actor model might be different from reference model. Please correct me if I make...

1
2
3
›