Richie Caputo
Richie Caputo
I should note that this ONLY occurs in a Jupyter setting. When executed with Python directly there is no issue.
@Benjamin-Etheredge would you mind sharing this notebook in your forked `examples`? Really love this idea as I've had issues with `keras-rl2` and the fact that it's no longer maintained.
We have gotten this to work for other [custom file formats](https://github.com/databrickslabs/tika-ocr/blob/main/src/main/scala/com/databricks/labs/tika/TikaFileFormat.scala) with fixed schema. I wonder if we can apply a similar approach here while supporting provided schemas or inferred...
Although not well-documented, you could do this with ```scala ThisBuild / githubWorkflowPermissions := Some(Permissions.Specify(Map( PermissionScope.IdToken -> PermissionValue.Write ))) ``` We needed to implement ```scala ThisBuild / githubWorkflowPermissions := Some(Permissions.Specify(Map( PermissionScope.Packages...
It should be straightforward to add an option to ignore the `content` column, but Tika still requires having the entire binary in memory to do OCR, so IMO memory is...
Feel free to give this a shot: https://github.com/TJC-LP/tika-ocr/tree/TJC-LP/enable-xml-output I'm going to test it in our Databricks workspace in the next few days, but locally seems to work as expected.
This is a fantastic change; currently installing from this branch! Please merge :)
As a workaround we use `someProperty: Optional[str] = Field(default_factory=lambda: None)`. This repo isn't being maintained as well as, for example, https://github.com/mitchelllisle/sparkdantic, so I'd take a look there as well.
Ollama supports the [OpenAI SDK](https://github.com/ollama/ollama/blob/main/docs/openai.md) fwiw
`mode: production` target appears bugged during GitHub action deployment for v0.222 (`poetry` build)
Note that destroying the target and/or manually deleting the `.bundle` and retrying yields the same issue.