replicate-python issues

How to deploy images to my own S3 server?

2

Hi, I do like to use text-to-image and image-to-image from replicate and I do have a very time consuming in terms of code and performance process to redeploy images from...

unsanny

error when running another version on the server replicate.com via a python script

I run a python script indicating that I need the old version for me: ``` output_url = replicate.run( "tencentarc/gfpgan:9283608cc6b7be6b65a8e44983db012355fde4132009bf99d976b2f0896856a3", input = { "img": open(in_img_path, "rb"), "scale": 6, "version": "v1.3" }...

kinofq

Token usage tracking

1

Is there any api or function for calculate token i have used for every request and cost for that request for any model

pavansai26

Setting timeouts in for replicate.run()

2

Hello, I was wondering how you can set timeouts in the replicate.run() function. I have tried using the replicate client but it didn't throw a timeout error: ```python from replicate.client...

darhsu

documentation: error handling

1

It would be really helpful if there was a documented example on how to catch common error-cases. Might I request a documentation example, along the following lines: ``` for event...

RichardNeill

`meta/llama-2-70b` maximum input size (1024) differs from the LLaMA-2 maximum context size (4096 tokens)

LLaMA-2 models have a maximum input size of 4096 tokens [[original paper](https://arxiv.org/pdf/2307.09288.pdf), [meta llama github repo](https://github.com/meta-llama/llama/issues/267#issuecomment-1659440955)]. When prompting `meta/llama-2-70b` through replicate, however, the maximum size of the model is, strangely,...

jdkanu

Do `replicate-internal/staging-llama-2-70b-mlc` and `replicate-internal/llama-2-70b-triton` have different maximum input lengths?

I am getting an error that the prompt length exceeds the maximum input length when calling `meta/llama-2-70b` through the API. I have included the error log from the Replicate dashboard...

jdkanu

Predictions often fail on meta/llama-2-70b

2

Calls to meta/llama-2-70b are sometimes succeeding, but sometimes failing. It is very unreliable. This is the code ``` output = replicate.run( "meta/llama-2-70b", input={ "prompt": "Q: Would a pear sink in...

jdkanu

Invalid stop_str in conversion template json file.

2

Running this code ``` import os import replicate from dotenv import load_dotenv load_dotenv() REPLICATE_API_TOKEN = os.getenv("REPLICATE_API_TOKEN") prompt = "Q: What is 10*10? A: " output = replicate.run( "meta/llama-2-7b", input={ "prompt":...

jdkanu

Randomly getting truncated output from client.run() for streaming models using replicate 0.24.0

1

With replicate 0.24.0 Python client and "mistralai/mistral-7b-instruct-v0.2" (which is a model that supports streaming), the iterator I get back from client.run() is truncating output frequently, perhaps 1/50 times. I checked...

beatty

replicate-python
replicate-python copied to clipboard

Metadata

How to deploy images to my own S3 server?

error when running another version on the server replicate.com via a python script

Token usage tracking

Setting timeouts in for replicate.run()

documentation: error handling

`meta/llama-2-70b` maximum input size (1024) differs from the LLaMA-2 maximum context size (4096 tokens)

Do `replicate-internal/staging-llama-2-70b-mlc` and `replicate-internal/llama-2-70b-triton` have different maximum input lengths?

Predictions often fail on meta/llama-2-70b

Invalid stop_str in conversion template json file.

Randomly getting truncated output from client.run() for streaming models using replicate 0.24.0

← Metadata

Owner

Metadata

replicate-python replicate-python copied to clipboard

Metadata

← Metadata

Owner

Metadata

replicate-python
replicate-python copied to clipboard