stagehand icon indicating copy to clipboard operation
stagehand copied to clipboard

Allow multimodal input to commands

Open sameelarif opened this issue 1 year ago • 0 comments

Feature request: add image property to extract/act commands:

const price = await stagehand.extract({
  instruction: "extract the price for the watch that looks most similar to the image",
  image: <base64> | <file path>,
  schema: z.object({ ... }),
})

sameelarif avatar Dec 02 '24 17:12 sameelarif