Richie Caputo

Results 12 comments of Richie Caputo

I should note that this ONLY occurs in a Jupyter setting. When executed with Python directly there is no issue.

@Benjamin-Etheredge would you mind sharing this notebook in your forked `examples`? Really love this idea as I've had issues with `keras-rl2` and the fact that it's no longer maintained.

We have gotten this to work for other [custom file formats](https://github.com/databrickslabs/tika-ocr/blob/main/src/main/scala/com/databricks/labs/tika/TikaFileFormat.scala) with fixed schema. I wonder if we can apply a similar approach here while supporting provided schemas or inferred...

Although not well-documented, you could do this with ```scala ThisBuild / githubWorkflowPermissions := Some(Permissions.Specify(Map( PermissionScope.IdToken -> PermissionValue.Write ))) ``` We needed to implement ```scala ThisBuild / githubWorkflowPermissions := Some(Permissions.Specify(Map( PermissionScope.Packages...

It should be straightforward to add an option to ignore the `content` column, but Tika still requires having the entire binary in memory to do OCR, so IMO memory is...

Feel free to give this a shot: https://github.com/TJC-LP/tika-ocr/tree/TJC-LP/enable-xml-output I'm going to test it in our Databricks workspace in the next few days, but locally seems to work as expected.

This is a fantastic change; currently installing from this branch! Please merge :)

As a workaround we use `someProperty: Optional[str] = Field(default_factory=lambda: None)`. This repo isn't being maintained as well as, for example, https://github.com/mitchelllisle/sparkdantic, so I'd take a look there as well.

Ollama supports the [OpenAI SDK](https://github.com/ollama/ollama/blob/main/docs/openai.md) fwiw

Note that destroying the target and/or manually deleting the `.bundle` and retrying yields the same issue.