MiniOneRec icon indicating copy to clipboard operation
MiniOneRec copied to clipboard

Minimal reproduction of OneRec

Results 11 MiniOneRec issues
Sort by recently updated
recently updated
newest added

Hi, thank you for the great work and contribution to the open-source community! I would like to know more details about the models in the transfer capability experiment: **Qwen-Text**: The...

Firstly, thank you for the great work! We have reproduced the experiments using both SFT and RL, following the exact hyperparameters provided in the paper. The experiments were conducted on...

作者您好,可以放一下您使用的模型权重吗,这里库太多了,版本间差异太大,谢谢! 同时是否有小一点且适合的大模型权重呢? def load_qwen_model(model_path): print("Loading Qwen Model:", model_path) tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True) model = AutoModel.from_pretrained( model_path, trust_remote_code=True, torch_dtype=torch.float16, low_cpu_mem_usage=True ) return tokenizer, model

运行bash rl.sh \ --model_path Qwen/Qwen3-0.6B \ --output_dir output_dir 报错Exception ignored in: Traceback (most recent call last): File "/root/.local/share/mamba/envs/onerec3/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 528, in __del__ self.destroy() File "/root/.local/share/mamba/envs/onerec3/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 532, in destroy self.optimizer.destroy()...

Hi, thanks for the solid work. I'm curious why padding is applied from the left side in the code, such as `tokenizer.padding_side = "left"`. Is there any advantage of left...

运行bash generate_indices_plus.sh 报错Loading dataset from ../data/Amazon18/Industrial_and_Scientific/Industrial_and_Scientific.emb-qwen-td.npy Loaded embeddings shape: (26, 2048) Embeddings stats - min: -55.880207, max: 59.651443, mean: -0.005308 Building model with e_dim=2048 (Must match input dim 2048)... Loading...

Hi authors, thank you for releasing MiniOneRec! I am trying to reproduce the SFT results, but using a smaller backbone: Base model: Qwen2.5-0.5B-Instruct Dataset: Amazon-18 Musical Instruments (processed as described)...

I want to train on a customized dataset. What inputs do I need? And how can I modify the code to quickly adapt my customized dataset? Thanks

Thanks for your great work! I'd like to know if there's a way to control which item categories are generated during the recommendation process. For example, if I want to...