GUI-Spotlight: Adaptive Iterative Focus Refinement for Enhanced GUI Visual Grounding.

Introduction

GUI_Spotlight is a think-with-image GUI visual grounding model. For each step, it first calls tooling to crop the image according to its own predictions, and then returns an exact coordinate location.

Setup

cd GUI_Spotlight
conda create --name spotlight python=3.12
conda activate spotlight
conda install -c conda-forge uv
uv pip install -e .

Evaluation

Screenspot-pro

python screenspot_pro_evaluation.py

OSWorld-G (Need to download the dataset by yourself)

python osworld_g_evaluation.py \
  --model Bin12345/GUI-Spotlight \
  --dataset_json OSWorld-G_refined.json \
  --images_dir OSWorld-G/benchmark/images \
  --batch_size 1

UI-Vision (Need to download the dataset by yourself)

python uivision_evaluation.py \
  --model Bin12345/GUI-Spotlight \
  --dataset_json `uivision/annotations` \
  --images_dir `ui-vision/images` \
  --batch_size 1

Single Sample Inference

python inference.py --prompt `Your prompt` --image_path `Image Path` --model `The name of the model`

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
asset		asset
spotlight		spotlight
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
osworld_g_evaluation.py		osworld_g_evaluation.py
pyproject.toml		pyproject.toml
screenspot_pro_evaluation.py		screenspot_pro_evaluation.py
uivision_evaluation.py		uivision_evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GUI-Spotlight: Adaptive Iterative Focus Refinement for Enhanced GUI Visual Grounding.

Introduction

Setup

Evaluation

Single Sample Inference

About

Uh oh!

Releases

Packages

Languages

License

bin123apple/GUI_Spotlight

Folders and files

Latest commit

History

Repository files navigation

GUI-Spotlight: Adaptive Iterative Focus Refinement for Enhanced GUI Visual Grounding.

Introduction

Setup

Evaluation

Single Sample Inference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages