Adam Hanson

Results 5 comments of Adam Hanson

A way to get stream to read PCM data from stdin or a pipe would be greatly appreciated.

The strength of this project's approach seems to be that it uses SAM and multimodal models to visually parse GUI layouts, instead of relying on OS-specific features like window's accessibility...

@abrichr I would use it for the same purposes as this: https://github.com/louis030195/screen-pipe (sadly that project is still mac-only at the moment)

Hi. I'm not able to get the [Self Hosting instructions](https://docs.plandex.ai/hosting/self-hosting/#quickstart) from the docs working, and keep getting errors similar to this user #169. Having a prebuilt docker image to run...

I'm also looking for a screenshot use-case. Most OCR seems geared to photos, handwriting, or PDFs. They don't do great on normal GUI text.