Tran Thanh Luan

ZenAI Hanoi Mr. Simple

Results 12 comments of


                                            Tran Thanh Luan

Error when run eval.py

> I also meet this, could anyone answer? ![image](https://user-images.githubusercontent.com/92072154/167101872-2e0aac2d-2630-4bb8-9aa5-3b8f1048d1c3.png) I solved my problem. Add map_location, "cpu" when training, "cuda" when testing.

Can't mint tokens from faucet locally

@ermenkov i resolved the `FaucetDisabled` exception by changing `cargo build --release` to `cargo build --release --features pow-faucet` in `scripts/localrun.sh` but then i encounter another issue when create new subnet: ![image](https://github.com/opentensor/bittensor-subnet-template/assets/92072154/ca67a16c-8b3c-4c4d-8a4a-f7db1bb10b9f)

Can't mint tokens from faucet locally

@ermenkov i resolved, just comment all rate limit exceptions in source code. it's ok for me cus i just want to run locally

Error: one of the variables needed for gradient computation has been modified by an inplace operation

@calclavia same here

Why weight value must be int in hashing weights commit?

@ibraheem-opentensor thanks, i gonna try it

[Core] generate from input embeds

@DarkLight1337 @Nan2018 thanks guys, this is essential pr. Can you guys make it to be supported by OpenAI Compatible Endpoint? So that, it would be convenient to call it by...

[Core] generate from input embeds

> > @DarkLight1337 @Nan2018 thanks guys, this is essential pr. Can you guys make it to be supported by OpenAI Compatible Endpoint? So that, it would be convenient to call...

Training loss curve on V2

@Zyvpeng thanks for the response. I see in the paper you mention that loss is under 0.1 So 0.xx here should be 0.0x? I have made progress in training it...

Training loss curve on V2

@Kirili4ik There is a trick that you set position identifier for soft-tokens can make model converge quickly https://arxiv.org/pdf/2409.14364v2 Example: - context positions: [1,2,3,4,5,6,7,8] - soft-tokens icae positions: [9, 10, 11,...

Add Disk_Address_Manager

@qyy2003 It looks great I assume with this future, i can manually create kv cache tensors: (n_layers, 2, batch_size, seq_len, n_head, head_dim) then save it to disk with random hash...

1
2
›