Chen Qian comments

Results 193 comments of


                                            Chen Qian

Unexpected breaking change: Optimizer.get_weights() removed

In 2.11 the optimizer does lazy loading, if you want to explicitly restore the variable values, you need to call `optimizer.build(model.trainable_variables)`, which is automatically called at the first time of...

Unexpected breaking change: Optimizer.get_weights() removed

``` import tensorflow as tf print(tf.__version__) print(tf.keras.__version__) model = tf.keras.Sequential( [ tf.keras.Input(shape=(1,)), tf.keras.layers.Dense(1, activation="softmax"), ] ) model.compile(optimizer="adam", loss="categorical_crossentropy") model.fit([[1]], [0], verbose=0) model.save("model") new = tf.keras.models.load_model("model") new.load_weights("model") new.optimizer.build(model.trainable_variables) print([v.name for v...

New optimizers are incompatible with `jit_compile` and `MirroredStrategy`

@lgeiger Thanks for reporting the issue! Could you try moving the `model.compile()` under strategy scope and rerun the tests in your setup? Also is it only failing with SGD or...

mlflow.tensorflow.MlflowCallback() cause freezing at exit _batch_status_check_threadpool.shutdown

This is strange, my recent change should not take any effect unless users are specifying the environment variable `MLFLOW_ASYNC_LOGGING_WAITING_TIME`, which is not yet publicly documented.

mlflow.tensorflow.MlflowCallback() cause freezing at exit _batch_status_check_threadpool.shutdown

Yes technically when every job is finished then this threadpool will shut down. I don't know why there is a regression. Need to take a closer look at the user's...

[FR] Optionally export cgroup metrics for system metrics

this makes a lot of sense. @borchero Would like to see your PR!

[Bug] UnsupportedParamsError with COPRO Optimizer and Amazon Bedrock

@Nasreddine Mipro and SIMBA should be better than COPRO for almost every case, and we will remove COPRO in the near future.

[Bug] Exception "Adapter JSONAdapter failed to parse the LM response" encountered in the Refine module when using Bedrock

@Nasreddine Thanks for reporting the issue! I am a bit confused about your LM response: ``` LM Response: {"type": "function", "name": "json_tool_call", "parameters": {"discussion": "The predict module is to blame...

Deprecation warnings point to non-existing `dspy.set_log_level`

I would rather just delete this weird warning and let it error out, which may sound risky but I am pretty sure that reduces users' confusion.

[Bug] streamify() truncates stream listener output when no [[ ## completed ## ]] message in LM response

@davruet Thanks for reporting the issue! could you provide a reproducible code on gpt-4.1 that doesn't have the complete identifier? This is supposed to be internal adapter logic, and I...