tranquanghust issues

Results 30 issues of


                                            tranquanghust

Fine-tune cross- encoder by "Instruction-Finetuned Text"

Is the instruction model in your project a bi-encoder? Can the cross-encoder be fine-tuned in the direction of 'Instruction-Finetuned Text' like that?

Can I update the vocabulary and embedding matrix with a model other than "bert base uncase"?

How to mine hard neg for reranker?

I just saw the script that mine the hard negative for the bi encoder

something is wrong with hard neg mine

I tried hard neg mining, but when running on 2 different gpu's (namely t4 and a100), t4 only took a few seconds while a100 took 20 minutes, and their hard...

How can I adjust only specific layers and not all reranker and bi encoder layers in your project (e.g. adjust only the classifier layer)

ValueError: Attempting to unscale FP16 gradients.

Here is the Google Colab link I used for fine-tuning : [https://colab.research.google.com/drive/1kiALBR1UarPobiftZmiHfwFyk7hTCDnV?usp=sharing](url) When I fine-tune the LLM-embed for tool retrieval using the command on Google Colab: ![image](https://github.com/FlagOpen/FlagEmbedding/assets/80111554/c442eac2-62d1-4651-848e-1f5b86bfadaa) An error occurred:...

how to adjust hyperparameter for finetune llm embed

llm embed has the following training script. I don't know how to adjust hyperparameters like train_batch_size, learning rate, warmup_ratio, ... torchrun --nproc_per_node=8 run_dense.py \ --output_dir data/outputs/tool \ --train_data llm-embedder:tool/toolbench/train.json \...

tranquanghust