Fucheng Warren Zhu

Results 20 issues of Fucheng Warren Zhu

Currently hface environment is set up as follows in `CausalImage_ImageModelBackbones.R` ``` conda create -n hface python=3.11 conda activate hface python3 -m pip install --upgrade transformers torch tensorflow tensorflow_datasets pillow pip...

I am not entirely certain, but it seems like Macbook doesn't work with the current default install (at least when I do it manually) since it installs x86-64 versions of...

**Issue** I encountered an issue while trying to create a new conda environment using the `causalimages::BuildBackend` function. The process fails with the following error: ``` Error while loading conda entry...

This may be a feature of the pipeline rather than a bug, but for RandomProjection ViT or CNN (and most likely other models, but I couldn't run it on my...

Fixes #135859, following the approach detailed in the [Dynamic Shapes Manual](https://docs.google.com/document/d/1GgvOe7C8_NVOMLOCwDaYV1mXXyHMXY7ExoewHqooxrs/edit?tab=t.0#heading=h.kzbllkiwjdpm). Adds support for torch.bmm, torch.topk, torch.linalg.cholesky, and torch.linalg.norm. For bmm.out and topk.out, added an out_wrapper decorator in torch/_meta_registrations.py. For...

open source
topic: not user facing
module: dynamo

Bash Mode sends the line you type straight to the persistent shell session backing Claude Code, and is great for quick commands exploring the directory. It can be enabled by...

documentation
enhancement

I often need to visit responses to past prompts, or jump to the start of the prompt when I receive the response. Currently with Claude Code I haven't found a...

enhancement
area:tui

# Bug Report: `create_sdk_mcp_server()` passes unsupported `version` parameter causing TypeError ## Summary Agent deployment fails with `TypeError: Server.__init__() got an unexpected keyword argument 'version'` when using `create_sdk_mcp_server()`. The SDK passes...

bug

# Maintain Constant Global Batch Size Upon Failure With the current implementation of `DistributedSampler`, the `global_batch_size` is `group_batch_size * num_replica_group`. It may be more preferable if the DistributedSampler is implemented...

**Description:** The current failure model in TorchFT handles node failures by zeroing out all accumulated gradients and recalculating them in the subsequent forward/backward pass. This approach, while ensuring correctness, leads...