Benjamin Anderson comments

Results 25 comments of


                                            Benjamin Anderson

Embeddings/MLX sentence transformers

I might have messed this up github-wise because it's showing all the changes you made to my previous speculative decoding example. LMK if this is a problem and I can...

Embeddings/MLX sentence transformers

Yep, happy to address all the points you've raised! For renaming the modules so that keys match, how would you suggest handling cases where the Transformers BERT model has more...

Embeddings/MLX sentence transformers

Hey, just made some updates responding to the comments here. I changed the setup of BERT so that fewer keys have to be swapped (still some, but fewer.) :) Also...

Embeddings/MLX sentence transformers

Ok, ready for review!

Embeddings/MLX sentence transformers

Hey @awni when do you think you'll get around to this one?

Embeddings/MLX sentence transformers

Lower precision results were decent/usable I thought! But it's not super important to support given how small these models are; the overhead of quantizing/dequantizing probably isn't worth it in most...

Embeddings/MLX sentence transformers

See if it looks better on your end with normalize=False. It doesn't affect metrics based on cosine similarity, but does seem to make a difference for e.g. BankingClassification. If there...

Embeddings/MLX sentence transformers

Yup, that's what I was getting too. Not sure what explains the remaining discrepancy.

Embeddings/MLX sentence transformers

Yeah, my thought is that it makes sentence embeddings usable in MLX, which requires the pooling, handling batches, truncation, etc. and also makes it easy to load the models. This...

Embeddings/MLX sentence transformers

Well, while this is stuck I'm working on a separate repo for embeddings in MLX.