Instructions to use microsoft/OmniParser-v2.0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/OmniParser-v2.0 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("microsoft/OmniParser-v2.0", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Poor result on local device compared to web
#29
by KJCD2K - opened
I've tested the performance of local device (GTX 1060) and web app. Most of the icons are detected on the web app but only a few icons are detected locally. The problem is mainly related to YOLO prediction. I keep the following factors the same.
- original image
- Box Threshold = 0.05
- IOU Threshold = 0.1
- Icon Detect Image Size = 640 (I tried to increase the value locally but there is no improvement.)
- No preprocessing on both sides.
- model.pt (The YOLO model is downloaded from the repository.)
Anybody knows what makes the difference?


