Vishal Kumar Das
Vishal Kumar Das
@rafaelghiorzi What exactly did you do, I am facing the same issue, only when doing parallell calling of docling
@rafaelghiorzi Well, I have a requirement of doing RAG, starting with picking up messages from queues(Azure bus), the process is Async with threading, for each message it calls document processing...
@Voileexperiments Did you suppress these warnings? And how did you implement rapidocr_paddle. I am getting a ton of these warnings for large PDFs. Can you help me out?
How much time it takes to convert and export to markdown, do you guys have any strategy? Because in my system it takes around 2mins to convert a 9 page...
[AlexandriaObol.pdf](https://github.com/user-attachments/files/20404612/AlexandriaObol.pdf) Hi @cau-git, Sure ``` def _initialize_converter(self) -> None: """Initialize the document converter with appropriate settings.""" pdf_pipeline_options = None if self.optimize_pdf: # Configure RapidOCR with GPU acceleration pdf_pipeline_options = PdfPipelineOptions(ocr_options=RapidOcrOptions())...
@cau-git Now I am facing this issue ``` "C:\Users\Admin\AppData\Local\pypoetry\Cache\virtualenvs\risingtidevectorsearch-ryepBKyc-py3.11\Lib\site-packages\docling_ibm_models\layoutmodel\layout_predictor.py", line 92, in __init__ ).to(self._device) ^^^^^^^^^^^^^^^^ File "C:\Users\Admin\AppData\Local\pypoetry\Cache\virtualenvs\risingtidevectorsearch-ryepBKyc-py3.11\Lib\site-packages\transformers\modeling_utils.py", line 3850, in to return super().to(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\Users\Admin\AppData\Local\pypoetry\Cache\virtualenvs\risingtidevectorsearch-ryepBKyc-py3.11\Lib\site-packages\torch\nn\modules\module.py", line 1343, in...
@cau-git Any update on this?
@geoHeil Well It has been resolved in the earlier versions. The issue was specifically for the Rapid OCR models with V4. Now that V5 has arrived, the issue still exists...
@dosu Can you write a small logic for the same because, this will happen for other formats as well like PPTx, HTMLX, etc
@dosu Now I am getting the below error: ` File "/app/src/objects_core/utils.py", line 34, in wrapper return func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ File "/app/src/vector_store_processor/core/docling_converter.py", line 151, in convert_to_markdown conv_result = self.converter.convert(input_path) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File...