omnibox stuck at the last step
I only can see that it is trying to run some script. It is stuck here forever
Meanwhile this script is waiting for a response. But the virtual machine never gives a response.
I can confirm that the installation is stuck on the screen shown above. I followed the instructions given here:
https://github.com/microsoft/OmniParser/tree/master/omnitool
After running "./manage_vm.sh create" in the folder ~/OmniParser/omnitool/omnibox/scripts, the containers will be created, however, the message shown above will appear. It seems that the folder called "win11storage" is created (the folder should store the VM once everything is done). However, its size is 0, even after a significant amount of time.
Just like mine https://github.com/microsoft/OmniParser/issues/139#issuecomment-2662287890 I managed it after doing some stuff and waiting for around 1 hours
On Windows 10, I am getting:
2025-02-17 13:37:07 ❯ Starting OmniParser Windows for Docker v0.00... 2025-02-17 13:37:07 ❯ For support visit https://github.com/microsoft/OmniParser 2025-02-17 13:37:07 ❯ CPU: Intel Core TM i7 10700F | RAM: 14/16 GB | DISK: 61 GB (v9fs) | KERNEL: 5.15.167.4-microsoft-standard-WSL2... 2025-02-17 13:37:07 2025-02-17 13:37:07 ❯ Extracting local ISO image... 2025-02-17 13:38:13 ❯ Detecting version from ISO image... 2025-02-17 13:38:13 ❯ Detected: Windows 11 Enterprise (Evaluation) 2025-02-17 13:38:13 ❯ Adding drivers to image... 2025-02-17 13:38:15 ❯ Adding OEM folder to image... 2025-02-17 13:38:15 ❯ Adding win11x64-enterprise-eval.xml for automatic installation... 2025-02-17 13:38:16 ❯ Building Windows 11 image... 2025-02-17 13:39:47 ❯ Creating a 20G growable disk image in raw format... 2025-02-17 13:39:47 ❯ ERROR: KVM acceleration not available (device file missing), this will cause a major loss of performance. 2025-02-17 13:39:47 ❯ ERROR: See the FAQ on how to diagnose the cause, or continue without KVM by setting KVM=N (not recommended).
I tried it using the setting KVM=N, it does not give an error, but eventually when starting up the vm I am getting BdsDxe: failed to load Boot0002 "UEFI QEMU QEMU HARDDISK " from PciRoot(0x0)/Pci(0xA,0x0)/Scsi(0x0,0x0): Not Found 2025-02-17 11:43:32 BdsDxe: loading Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:32 BdsDxe: starting Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:56 ❯ Windows has started successfully. You can directly view the VM at http://localhost:8006/vnc.html?view_only=1&autoconnect=1&resize=scale. Wait until setup is complete before interacting manually.
I left it for an hour, but it just didn't do anything even when viewing the VM.
On Windows 10, I am getting:
2025-02-17 13:37:07 ❯ Starting OmniParser Windows for Docker v0.00... 2025-02-17 13:37:07 ❯ For support visit microsoft/OmniParser 2025-02-17 13:37:07 ❯ CPU: Intel Core TM i7 10700F | RAM: 14/16 GB | DISK: 61 GB (v9fs) | KERNEL: 5.15.167.4-microsoft-standard-WSL2... 2025-02-17 13:37:07 2025-02-17 13:37:07 ❯ Extracting local ISO image... 2025-02-17 13:38:13 ❯ Detecting version from ISO image... 2025-02-17 13:38:13 ❯ Detected: Windows 11 Enterprise (Evaluation) 2025-02-17 13:38:13 ❯ Adding drivers to image... 2025-02-17 13:38:15 ❯ Adding OEM folder to image... 2025-02-17 13:38:15 ❯ Adding win11x64-enterprise-eval.xml for automatic installation... 2025-02-17 13:38:16 ❯ Building Windows 11 image... 2025-02-17 13:39:47 ❯ Creating a 20G growable disk image in raw format... 2025-02-17 13:39:47 ❯ ERROR: KVM acceleration not available (device file missing), this will cause a major loss of performance. 2025-02-17 13:39:47 ❯ ERROR: See the FAQ on how to diagnose the cause, or continue without KVM by setting KVM=N (not recommended).
I tried it using the setting KVM=N, it does not give an error, but eventually when starting up the vm I am getting BdsDxe: failed to load Boot0002 "UEFI QEMU QEMU HARDDISK " from PciRoot(0x0)/Pci(0xA,0x0)/Scsi(0x0,0x0): Not Found 2025-02-17 11:43:32 BdsDxe: loading Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:32 BdsDxe: starting Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:56 ❯ Windows has started successfully. You can directly view the VM at http://localhost:8006/vnc.html?view_only=1&autoconnect=1&resize=scale. Wait until setup is complete before interacting manually.
I left it for an hour, but it just didn't do anything even when viewing the VM.
Same here on Windows 11. Fails while image is building
BdsDxe: failed to load Boot0002 "UEFI QEMU QEMU HARDDISK " from PciRoot(0x0)/Pci(0xA,0x0)/Scsi(0x0,0x0): Not Found
2025-02-17 08:33:28 BdsDxe: loading Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0)
2025-02-17 08:33:28 BdsDxe: starting Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0)
2025-02-17 08:33:54 ❯ Windows has started successfully. You can directly view the VM at http://localhost:8006/vnc.html?view_only=1&autoconnect=1&resize=scale. Wait until setup is complete before interacting manually.
2025-02-17 08:34:22 KVM: entry failed, hardware error 0xffffffff
2025-02-17 08:34:22 KVM: entry failed, hardware error 0xffffffff
2025-02-17 08:34:22 EAX=00000000 EBX=60ce7038 ECX=000000b2 EDX=000000b2
2025-02-17 08:34:22 ESI=00000000 EDI=0000007a EBP=60ce6fa0 ESP=60ce6f78
2025-02-17 08:34:22 EIP=00008000 EFL=00000002 [-------] CPL=0 II=0 A20=1 SMM=1 HLT=0
2025-02-17 08:34:22 ES =0000 00000000 ffffffff 00809300
2025-02-17 08:34:22 CS =fb00 7effb000 ffffffff 00809300
2025-02-17 08:34:22 SS =0000 00000000 ffffffff 00809300
2025-02-17 08:34:22 DS =0000 00000000 ffffffff 00809300
2025-02-17 08:34:22 FS =0000 00000000 ffffffff 00809300
2025-02-17 08:34:22 GS =0000 00000000 ffffffff 00809300
2025-02-17 08:34:22 LDT=0000 00000000 00000000 00000000
2025-02-17 08:34:22 TR =0040 cf56e000 00000067 00008b00
2025-02-17 08:34:22 GDT= cf56ffb0 00000057
2025-02-17 08:34:22 IDT= 00000000 00000000
2025-02-17 08:34:22 CR0=00050032 CR2=2780f2a3 CR3=23361000 CR4=00000000
2025-02-17 08:34:22 DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000
2025-02-17 08:34:22 DR6=00000000ffff0ff0 DR7=0000000000000400
2025-02-17 08:34:22 EFER=0000000000000000
2025-02-17 08:34:22 Code=qemu-system-x86_64: hw/core/cpu-sysemu.c:76: cpu_asidx_from_attrs: Assertion `ret < cpu->num_ases && ret >= 0' failed.
2025-02-17 08:34:25 ❯ ERROR: Forcefully terminating Windows, reason: 0...
2025-02-17 08:34:27 ❯ Shutdown completed!
On Windows 10, I am getting:
2025-02-17 13:37:07 ❯ Starting OmniParser Windows for Docker v0.00... 2025-02-17 13:37:07 ❯ For support visit https://github.com/microsoft/OmniParser 2025-02-17 13:37:07 ❯ CPU: Intel Core TM i7 10700F | RAM: 14/16 GB | DISK: 61 GB (v9fs) | KERNEL: 5.15.167.4-microsoft-standard-WSL2... 2025-02-17 13:37:07 2025-02-17 13:37:07 ❯ Extracting local ISO image... 2025-02-17 13:38:13 ❯ Detecting version from ISO image... 2025-02-17 13:38:13 ❯ Detected: Windows 11 Enterprise (Evaluation) 2025-02-17 13:38:13 ❯ Adding drivers to image... 2025-02-17 13:38:15 ❯ Adding OEM folder to image... 2025-02-17 13:38:15 ❯ Adding win11x64-enterprise-eval.xml for automatic installation... 2025-02-17 13:38:16 ❯ Building Windows 11 image... 2025-02-17 13:39:47 ❯ Creating a 20G growable disk image in raw format... 2025-02-17 13:39:47 ❯ ERROR: KVM acceleration not available (device file missing), this will cause a major loss of performance. 2025-02-17 13:39:47 ❯ ERROR: See the FAQ on how to diagnose the cause, or continue without KVM by setting KVM=N (not recommended).
I tried it using the setting KVM=N, it does not give an error, but eventually when starting up the vm I am getting BdsDxe: failed to load Boot0002 "UEFI QEMU QEMU HARDDISK " from PciRoot(0x0)/Pci(0xA,0x0)/Scsi(0x0,0x0): Not Found 2025-02-17 11:43:32 BdsDxe: loading Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:32 BdsDxe: starting Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:56 ❯ Windows has started successfully. You can directly view the VM at http://localhost:8006/vnc.html?view_only=1&autoconnect=1&resize=scale. Wait until setup is complete before interacting manually.
I left it for an hour, but it just didn't do anything even when viewing the VM.
This problem is not related to the originally posted issue, please open a new issue with your problem since you might be drawing attention from the original issue.
On Windows 10, I am getting:
2025-02-17 13:37:07 ❯ Starting OmniParser Windows for Docker v0.00... 2025-02-17 13:37:07 ❯ For support visit https://github.com/microsoft/OmniParser 2025-02-17 13:37:07 ❯ CPU: Intel Core TM i7 10700F | RAM: 14/16 GB | DISK: 61 GB (v9fs) | KERNEL: 5.15.167.4-microsoft-standard-WSL2... 2025-02-17 13:37:07 2025-02-17 13:37:07 ❯ Extracting local ISO image... 2025-02-17 13:38:13 ❯ Detecting version from ISO image... 2025-02-17 13:38:13 ❯ Detected: Windows 11 Enterprise (Evaluation) 2025-02-17 13:38:13 ❯ Adding drivers to image... 2025-02-17 13:38:15 ❯ Adding OEM folder to image... 2025-02-17 13:38:15 ❯ Adding win11x64-enterprise-eval.xml for automatic installation... 2025-02-17 13:38:16 ❯ Building Windows 11 image... 2025-02-17 13:39:47 ❯ Creating a 20G growable disk image in raw format... 2025-02-17 13:39:47 ❯ ERROR: KVM acceleration not available (device file missing), this will cause a major loss of performance. 2025-02-17 13:39:47 ❯ ERROR: See the FAQ on how to diagnose the cause, or continue without KVM by setting KVM=N (not recommended).
I tried it using the setting KVM=N, it does not give an error, but eventually when starting up the vm I am getting BdsDxe: failed to load Boot0002 "UEFI QEMU QEMU HARDDISK " from PciRoot(0x0)/Pci(0xA,0x0)/Scsi(0x0,0x0): Not Found 2025-02-17 11:43:32 BdsDxe: loading Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:32 BdsDxe: starting Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:56 ❯ Windows has started successfully. You can directly view the VM at http://localhost:8006/vnc.html?view_only=1&autoconnect=1&resize=scale. Wait until setup is complete before interacting manually.
I left it for an hour, but it just didn't do anything even when viewing the VM.
This problem is not related to the originally posted issue, please open a new issue with your problem since you might be drawing attention from the original issue.
I looked through the setup powershell script and found this mirror url used to download GIMP is really slow. And I was watching the QEMU and realizing it was indeed stuck after installation of VLC. After GIMP showed up on the desktop, the rest of the setup was blazing fast. Maybe removing the slow mirror url would help.
@kaimingKAI @mio-19 @AleksandarHaber @isaacrlevin Thanks for raising the issue. Any success after remove the line @kaimingKAI proposed? If it improves, we will consider merge that change into master. Thanks everyone for testing!
It does not solve the problem. The problem persists. The VM machine folder stays at 0, and nothing is created, only the message keeps on repeating.
On Tue, Feb 18, 2025 at 1:14 AM yadong-lu @.***> wrote:
@kaimingKAI https://github.com/kaimingKAI @mio-19 https://github.com/mio-19 @AleksandarHaber https://github.com/AleksandarHaber @isaacrlevin https://github.com/isaacrlevin Thanks for raising the issue. Any success after remove the line @kaimingKAI https://github.com/kaimingKAI proposed? If it improves, we will consider merge that change into master. Thanks everyone for testing!
— Reply to this email directly, view it on GitHub https://github.com/microsoft/OmniParser/issues/146#issuecomment-2664709793, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKSAE2UKEOP5Q72Z6EIZHHL2QLFT3AVCNFSM6AAAAABXIVJMFKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMNRUG4YDSNZZGM . You are receiving this because you were mentioned.Message ID: @.***> [image: yadong-lu]yadong-lu left a comment (microsoft/OmniParser#146) https://github.com/microsoft/OmniParser/issues/146#issuecomment-2664709793
@kaimingKAI https://github.com/kaimingKAI @mio-19 https://github.com/mio-19 @AleksandarHaber https://github.com/AleksandarHaber @isaacrlevin https://github.com/isaacrlevin Thanks for raising the issue. Any success after remove the line @kaimingKAI https://github.com/kaimingKAI proposed? If it improves, we will consider merge that change into master. Thanks everyone for testing!
— Reply to this email directly, view it on GitHub https://github.com/microsoft/OmniParser/issues/146#issuecomment-2664709793, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKSAE2UKEOP5Q72Z6EIZHHL2QLFT3AVCNFSM6AAAAABXIVJMFKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMNRUG4YDSNZZGM . You are receiving this because you were mentioned.Message ID: @.***>
Check your Docker logs for the created instance. In my case, the issue was that the allocated RAM for Docker was lower than the requested 8GB. Increasing the memory limit in Docker or WSL2, depending on your system, resolved the problem.
We have removed the slow GIMP mirrors from the latest branch. Pull master to get the latest. Let us know if increasing the memory for Docker fixes the issue @AleksandarHaber.
state {'messages': [{'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_67dad4b6ad60471e9103e0015a68f0b4.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_61c7a372300e437db87d5867f062fc71.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_f91f43b3c6f14d39aa032de1b6b5cdc6.png', './tmp/outputs/screenshot_som_f91f43b3c6f14d39aa032de1b6b5cdc6.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}], 'model': 'omniparser + R1', 'provider': 'groq', 'openai_api_key': 'de56a372-0363-464a-bb80-f514973071f7', 'anthropic_api_key': '', 'api_key': 'de56a372-0363-464a-bb80-f514973071f7', 'auth_validated': False, 'responses': {}, 'tools': {}, 'only_n_most_recent_images': 2, 'chatbot_messages': [('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.56s, OmniParser: 1.78s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.64s, OmniParser: 0.49s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.56s, OmniParser: 0.50s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.55s, OmniParser: 0.48s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.53s, OmniParser: 0.49s'), ('open google chrome search macbook air', None)], 'stop': False, 'groq_api_key': 'de56a372-0363-464a-bb80-f514973071f7'} in sampling_loop_sync, model: omniparser + R1 screen size: 1280, 800 Model Inited: omniparser + R1, Provider: groq Start the message loop. User messages: [{'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_67dad4b6ad60471e9103e0015a68f0b4.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_61c7a372300e437db87d5867f062fc71.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_f91f43b3c6f14d39aa032de1b6b5cdc6.png', './tmp/outputs/screenshot_som_f91f43b3c6f14d39aa032de1b6b5cdc6.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}] omniparser latency: 0.4864072799682617 _render_message: -- Step 1: -- Error in interleaved Groq: Error code: 404 - {'error': {'message': 'Not Found'}} groq token usage: 0 render_message: LLM: 0.62s, OmniParser: 0.49s Error code: 404 - {'error': {'message': 'Not Found'}} Total token so far: 0. Total cost so far: $USD0.00000 Traceback (most recent call last): File "D:\anaconda\envs\op\Lib\site-packages\gradio\queueing.py", line 715, in process_events response = await route_utils.call_process_api( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\route_utils.py", line 322, in call_process_api output = await app.get_blocks().process_api( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\blocks.py", line 2044, in process_api result = await self.call_function( ^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\blocks.py", line 1603, in call_function prediction = await utils.async_iteration(iterator) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 728, in async_iteration return await anext(iterator) ^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 722, in anext return await anyio.to_thread.run_sync( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\anyio\to_thread.py", line 33, in run_sync return await get_asynclib().run_sync_in_worker_thread( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\anyio_backends_asyncio.py", line 877, in run_sync_in_worker_thread return await future ^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\anyio_backends_asyncio.py", line 807, in run result = context.run(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 705, in run_sync_iterator_async return next(iterator) ^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 866, in gen_wrapper response = next(iterator) ^^^^^^^^^^^^^^ File "D:\OmniParser\omnitool\gradio\app.py", line 235, in process_input for loop_msg in sampling_loop_sync( File "D:\OmniParser\omnitool\gradio\loop.py", line 108, in sampling_loop_sync tools_use_needed, vlm_response_json = actor(messages=messages, parsed_screen=parsed_screen) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\OmniParser\omnitool\gradio\agent\vlm_agent.py", line 147, in call vlm_response_json = json.loads(vlm_response_json) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\json_init.py", line 346, in loads return _default_decoder.decode(s) ^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\json\decoder.py", line 337, in decode obj, end = self.raw_decode(s, idx=_w(s, 0).end()) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\json\decoder.py", line 355, in raw_decode raise JSONDecodeError("Expecting value", s, err.value) from None json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
state {'messages': [{'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_67dad4b6ad60471e9103e0015a68f0b4.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_61c7a372300e437db87d5867f062fc71.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_f91f43b3c6f14d39aa032de1b6b5cdc6.png', './tmp/outputs/screenshot_som_f91f43b3c6f14d39aa032de1b6b5cdc6.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}], 'model': 'omniparser + R1', 'provider': 'groq', 'openai_api_key': 'de56a372-0363-464a-bb80-f514973071f7', 'anthropic_api_key': '', 'api_key': 'de56a372-0363-464a-bb80-f514973071f7', 'auth_validated': False, 'responses': {}, 'tools': {}, 'only_n_most_recent_images': 2, 'chatbot_messages': [('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.56s, OmniParser: 1.78s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.64s, OmniParser: 0.49s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.56s, OmniParser: 0.50s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.55s, OmniParser: 0.48s'), ('open google chrome search macbook air', None), (None, '-- Step 1: --'), (None, 'LLM: 0.53s, OmniParser: 0.49s'), ('open google chrome search macbook air', None)], 'stop': False, 'groq_api_key': 'de56a372-0363-464a-bb80-f514973071f7'} in sampling_loop_sync, model: omniparser + R1 screen size: 1280, 800 Model Inited: omniparser + R1, Provider: groq Start the message loop. User messages: [{'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_67dad4b6ad60471e9103e0015a68f0b4.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_61c7a372300e437db87d5867f062fc71.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text'), './tmp/outputs/screenshot_f91f43b3c6f14d39aa032de1b6b5cdc6.png', './tmp/outputs/screenshot_som_f91f43b3c6f14d39aa032de1b6b5cdc6.png']}, {'role': <Sender.USER: 'user'>, 'content': [TextBlock(citations=None, text='open google chrome search macbook air', type='text')]}] omniparser latency: 0.4864072799682617 render_message: -- Step 1: -- Error in interleaved Groq: Error code: 404 - {'error': {'message': 'Not Found'}} groq token usage: 0 render_message: LLM: 0.62s, OmniParser: 0.49s Error code: 404 - {'error': {'message': 'Not Found'}} Total token so far: 0. Total cost so far: $USD0.00000 Traceback (most recent call last): File "D:\anaconda\envs\op\Lib\site-packages\gradio\queueing.py", line 715, in process_events response = await route_utils.call_process_api( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\route_utils.py", line 322, in call_process_api output = await app.get_blocks().process_api( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\blocks.py", line 2044, in process_api result = await self.call_function( ^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\blocks.py", line 1603, in call_function prediction = await utils.async_iteration(iterator) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 728, in async_iteration return await anext(iterator) ^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 722, in anext return await anyio.to_thread.run_sync( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\anyio\to_thread.py", line 33, in run_sync return await get_asynclib().run_sync_in_worker_thread( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\anyio_backends_asyncio.py", line 877, in run_sync_in_worker_thread return await future ^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\anyio_backends_asyncio.py", line 807, in run result = context.run(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 705, in run_sync_iterator_async return next(iterator) ^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\site-packages\gradio\utils.py", line 866, in gen_wrapper response = next(iterator) ^^^^^^^^^^^^^^ File "D:\OmniParser\omnitool\gradio\app.py", line 235, in process_input for loop_msg in sampling_loop_sync( File "D:\OmniParser\omnitool\gradio\loop.py", line 108, in sampling_loop_sync tools_use_needed, vlm_response_json = actor(messages=messages, parsed_screen=parsed_screen) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\OmniParser\omnitool\gradio\agent\vlm_agent.py", line 147, in call vlm_response_json = json.loads(vlm_response_json) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\json__init.py", line 346, in loads return _default_decoder.decode(s) ^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\json\decoder.py", line 337, in decode obj, end = self.raw_decode(s, idx=_w(s, 0).end()) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\anaconda\envs\op\Lib\json\decoder.py", line 355, in raw_decode raise JSONDecodeError("Expecting value", s, err.value) from None json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
WTF!!!!! use deepseek-r1 error change qwen-2.5vl api fixed
WTF!!!!! use deepseek-r1 error change qwen-2.5vl api fixed
gpt4o也会遇到相同的问题 line 355, in raw_decode raise JSONDecodeError("Expecting value", s, err.value) from None json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
On Windows 10, I am getting:
2025-02-17 13:37:07 ❯ Starting OmniParser Windows for Docker v0.00... 2025-02-17 13:37:07 ❯ For support visit https://github.com/microsoft/OmniParser 2025-02-17 13:37:07 ❯ CPU: Intel Core TM i7 10700F | RAM: 14/16 GB | DISK: 61 GB (v9fs) | KERNEL: 5.15.167.4-microsoft-standard-WSL2... 2025-02-17 13:37:07 2025-02-17 13:37:07 ❯ Extracting local ISO image... 2025-02-17 13:38:13 ❯ Detecting version from ISO image... 2025-02-17 13:38:13 ❯ Detected: Windows 11 Enterprise (Evaluation) 2025-02-17 13:38:13 ❯ Adding drivers to image... 2025-02-17 13:38:15 ❯ Adding OEM folder to image... 2025-02-17 13:38:15 ❯ Adding win11x64-enterprise-eval.xml for automatic installation... 2025-02-17 13:38:16 ❯ Building Windows 11 image... 2025-02-17 13:39:47 ❯ Creating a 20G growable disk image in raw format... 2025-02-17 13:39:47 ❯ ERROR: KVM acceleration not available (device file missing), this will cause a major loss of performance. 2025-02-17 13:39:47 ❯ ERROR: See the FAQ on how to diagnose the cause, or continue without KVM by setting KVM=N (not recommended).
I tried it using the setting KVM=N, it does not give an error, but eventually when starting up the vm I am getting BdsDxe: failed to load Boot0002 "UEFI QEMU QEMU HARDDISK " from PciRoot(0x0)/Pci(0xA,0x0)/Scsi(0x0,0x0): Not Found 2025-02-17 11:43:32 BdsDxe: loading Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:32 BdsDxe: starting Boot0001 "UEFI QEMU DVD-ROM QM00013 " from PciRoot(0x0)/Pci(0x5,0x0)/Sata(0x0,0xFFFF,0x0) 2025-02-17 11:43:56 ❯ Windows has started successfully. You can directly view the VM at http://localhost:8006/vnc.html?view_only=1&autoconnect=1&resize=scale. Wait until setup is complete before interacting manually.
I left it for an hour, but it just didn't do anything even when viewing the VM.
Hii @glienard, I am facing similar issue. Did you happen to find a fix for this?