Boyuan Feng
Boyuan Feng
### Summary I am profiling the wasm performance of a function, which runs for more than 5 minutes. Then I get the following error: `got: {"sessionId":"6eed583a92cc19dbbecdfb0c114d7d25","status":21,"value":{"message":"timeout: Timed out receiving message...
In this PR we have added a stateless interface for `compile()`, `prove()`, `verify()`, `struct ProvingKey`, and `struct VerifyingKey`. Close #128
## Summary of Bug `PartialOrd` for `Variable` violates `transitivity` and `duality` properties. This [code](https://github.com/arkworks-rs/snark/blob/master/relations/src/r1cs/mod.rs#L122-L142) prints both `0 ## Steps to Reproduce https://play.rust-lang.org/?version=stable&mode=debug&edition=2021&gist=51b860bb4a8854d6636e740e8be1c9ce
Hi, Thanks for the great optimizations on batch affine! When reading the implementation for `add_affine_points_with_edge_cases(...)`, I found the following [for loop](https://github.com/AztecProtocol/barretenberg/blob/master/barretenberg/src/aztec/ecc/curves/bn254/scalar_multiplication/scalar_multiplication.cpp#L375) which seems to iterate forever. ` for (size_t i...
Summary: `constraints` argument for `torch.export` has been deprecated in favor of the `dynamic_shapes` argument. This PR updates the use of the deprecated API in `apf`. In addition, this diff registers...
This is a follow-up issue for #146 and #147. We need saturated subtraction for the `uint128` struct which will be used to store asset values that cannot be below zero.
Our verifying process is running on substrate which uses wasmtime as the backend. We need a wasmtime benchmark in the ci to better understand the verifying latency impact from each...
Thanks for this great work! I am wondering where I can find benchmarks on the performance of libSTARK? For example, what is the prover and verifier time under various constraint...
Adds a helper function for getting the block mask for a specific row index during decoding. Tested in gpt-fast [pr](https://github.com/pytorch-labs/gpt-fast/pull/196).
This PR integrates flex decoding with gpt-fast. ### End-to-end performance gain of Llama2-7b Device: H100 Unit: tokens/sec Length | spda | Flex Decoding | Speedup -- | -- | --...