ThomasHoppe
ThomasHoppe
Loading [ESCO-v1.1.1.ttl](https://esco.ec.europa.eu/en/use-esco/download) (737 MB) under **Python 3.6 and rdflib 6.3.2 consumes 2.6 GB** on Windows 11 System running Ubuntu under WSL-2 Under **Python 3.8.2 and rdflib 6.3.2 consumes more than...
Valid language tags lik3 "en-GB", "de-AT" are not recognized. The test for valid language tags is buggy. IETF BCP 47 says that language tags consist of a country component (the...
Training as proposed in the README or executing the 3rd cell of the eval_acc.ipynb gives the following error: File ~/Thomas/KBLaM/src/kblam/models/llama3_model.py:43 38 from transformers.modeling_outputs import ( 39 BaseModelOutputWithPast, 40 CausalLMOutputWithPast, 41...
Trying Phi3 instead of Llama3 for the training with python train.py --dataset_dir ../datasets --train_dataset enron --N 120000 --B 20 --total_steps 601 --encoder_spec OAI --use_oai_embd --key_embd_src key --use_data_aug --hf_token ... --llm_type...
It seems that the configuration information in 'meta-llama/Llama-3.2-1B-Instruct/resolve/main/config.json' has changed since the code was used the last time. Running the training on the enron dataset gives: ` File "/home/fokus/Thomas/KBLaM/src/kblam/models/llama3_model.py", line...
1) Running training with the following call does not store the final model: `python train.py --dataset_dir ../datasets --train_dataset enron --N 120000 --B 20 --total_steps 601 --encoder_spec OAI --use_oai_embd --key_embd_src key...
Generating the KB-embeddings with `python dataset_generation/generate_kb_embeddings.py --dataset_path datasets/enron.json --output_path datasets --model_name text-embedding-3-small` if the environment variable AZURE_OPENAI_API_KEY is set to the proper value, causes the following error: ` File "/home/fokus/miniforge3/envs/kblam/lib/python3.13/site-packages/openai/_base_client.py",...