CK
CK
IMPORTANT: 1.0.3 VAD v5 is much worse than 1.0.2 or 1.0.1 VAD v4 for some certain audio data. WHY?
[All falls down.zip](https://github.com/user-attachments/files/16440204/All.falls.down.zip) For testing some audio files that only contain human voice dialogue, the v5 VAD seems to be better than v4. But for the music lyrics transcription I...
where is "preprocessor_config.json" file? """ if files: model_path = model_size_or_path tokenizer_bytes = files.pop("tokenizer.json", None) preprocessor_bytes = files.pop("preprocessor_config.json", None) """
### I ensure I have defined all of agents and tasks well based on the examples of CrewAI. But I often encounter with such issues while using manager agent mode...
**Are the conditions for using manager_agent and hierarchical mode correct? Couldn't find out the official detailed explanations.** 1. MUST customize manager_agent and manager_task? 2. only manager_agent could be set as...
**If I set "OCR_ALL_PAGES" as True, the effect would be much worse than False status (eg. input an invoice with a bit complicated table), why? when is True of "OCR_ALL_PAGES"...
1. What's the relation between OCRMyPDF and marker-pdf? 2. If I install OCRMyPD and related packages, what could it do? 3. If I only install OCRMyPD, do I need to...
**I installed and deployed the marker-pdf locally, the output was successful based on GPU + CUDA mode, but the model loading speed is super slow (load_all_models() from source code). WHY?...
**my current versions:** torch==2.2.2+cu121 torchvision==0.17.2+cu121 torchaudio==2.2.2+cu121
**1. this issue happens every time. but the README said "Optional: Install TransformerEngine if using NVIDIA GPU (Linux only), The adaptation of TransformerEngine is currently under development and CANNOT run...