OpenLLM icon indicating copy to clipboard operation
OpenLLM copied to clipboard

Is there any way to make images smaller?

Open martinmr opened this issue 2 years ago • 2 comments

We have a fine-tuned Llama 7B model that we tried to built using openllm into a docker container. We ran into issues with how big the final docker image ended up being. The model is around 13GB uncompressed and the final image is 46GB. We built the default llama 7B for comparison and it turned to a 33GB images.

We built out model with this command:

openllm build llama --model_id /home/circleci/model

Is there a way to make the images smaller? We are trying to use them for a service that will use k8s autoscaling, which will not work well if it takes 10+ minutes to create a new pod.

Thanks.

martinmr avatar Oct 05 '23 23:10 martinmr

Yes I'm actively working on this.

aarnphm avatar Nov 09 '23 23:11 aarnphm

This probably has to do with the base image also includes vllm and all different dependencies for vllm

aarnphm avatar Nov 09 '23 23:11 aarnphm

The current image should be reduced to aboout 7G in openllm 0.6

bojiang avatar Jul 13 '24 05:07 bojiang