Chinmaya Andukuri

Results 2 comments of Chinmaya Andukuri

Is there any update on this? Would love to help out if there's still interest / effort @SeungoneKim @baberabb @haileyschoelkopf

Yes re: llama3 branch suggestion - although note that only single GPU inference seems to be supported for REST model if I'm not mistaken so 70B model will have to...