penzai issues

Feature request: Animate button

Autoplay sliders would be really fun.

feature-request

FR: batched mapping for `named_axes.nmap`

5

Sometimes `nmap`'ed computations don't all fit in memory at once and there are not enough devices to shard the computation over (this is limitation is particularly salient when using penzai...

amifalk

feature-request

FR: expose device/sharding argument for NamedArray.wrap

2

amifalk

Feature request: Support for quantization

6

It'd be great if penzai would support model quantization out of the box. I know this is a lot of work to implement, but right now the lack of quantization...

JEM-Mosig

feature-request

Fix: Add missing attributes to hugging face conversion functions

1

## Changes This PR adds missing attributes to the lists of handled/ignored configuration attributes in the model conversion functions for: - Llama models (`llama_from_huggingface_model`) - Mistral models (`mistral_from_huggingface_model`) - GPT-NeoX...

ealt

strange import behavior of penzai.nn and pz

2

I'm confused what the intended behavior is between penzai.nn and pz and pz.nn. Here's an example of the confusing behavior. Basically, when you import nn, you don't get everything in...

hrbigelow

BUG: potential tracer leak in _jitted_nmapped_getitem?

2

The following code outputs "Call 1 succeeded" and then hangs indefinitely ```python import dataclasses import jax import jax.numpy as jnp from penzai import pz @pz.pytree_dataclass class Indexer(pz.Struct): index: int =...

amifalk

Conversion of a LlamaForCausalLM does not support `use_cache` and `_name_or_path`

2

When I run ```py hf_model = transformers.LlamaForCausalLM.from_pretrained("Unbabel/TowerInstruct-7B-v0.2") pz_model = penzai.models.transformer.variants.llama.llama_from_huggingface_model(hf_model) ``` the second line fails with ```sh ValueError: Conversion of a LlamaForCausalLM does not support these configuration attributes: {'use_cache': False,...

JEM-Mosig

Conversion from pretrained HuggingFace models

## Bug Description When attempting to convert a HuggingFace model to a Penzai model using `[llama/mistral/gpt_neox]_from_huggingface_model`, the conversion fails with a ValueError when the model configuration contains certain attributes that...

ealt

Initializer for Penzai's Linear layer does not support `jax.nn.initializers`

5

I am trying to create a simple linear layer as follows, ``` from penzai import pz import jax embed_axis = "embed_axis" head_axis = "head_axis" num_heads = 4 embed_size = 10...

Samarendra109

penzai
penzai copied to clipboard

Metadata

Feature request: Animate button

FR: batched mapping for `named_axes.nmap`

FR: expose device/sharding argument for NamedArray.wrap

Feature request: Support for quantization

Fix: Add missing attributes to hugging face conversion functions

strange import behavior of penzai.nn and pz

BUG: potential tracer leak in _jitted_nmapped_getitem?

Conversion of a LlamaForCausalLM does not support `use_cache` and `_name_or_path`

Conversion from pretrained HuggingFace models

Initializer for Penzai's Linear layer does not support `jax.nn.initializers`

← Metadata

Owner

Metadata

penzai penzai copied to clipboard

Metadata

← Metadata

Owner

Metadata

penzai
penzai copied to clipboard