A powerful web-based tool that uses Google's Gemini AI to analyze your images, providing detailed descriptions, insights, and suggestions - all through a simple user interface.
- Web UI Interface: Easy-to-use interface with image preview
- Multiple Image Upload: Process multiple images at once with the same style/prompt
- Multiple Analysis Types: Choose from 10+ pre-defined analysis options
- Custom Prompts: Ask specific questions about your images
- Real-time Feedback: Get detailed AI-powered image analysis
- Simple Setup: Works on Windows, Mac, and Linux
- Python 3.7 or higher
- Google Gemini API key (get one free at Google AI Studio)
- Internet connection
git clone https://github.com/yourusername/gemini-image-editor.git
cd gemini-image-editor
# Create virtual environment
python -m venv venv
# Activate virtual environment
venv\Scripts\activate
# Create virtual environment
python3 -m venv venv
# Activate virtual environment
source venv/bin/activate
pip install -r requirements.txt
-
Copy the example environment file:
cp .env.example .env
-
Edit
.env
and replaceyour_gemini_api_key_here
with your actual Gemini API key.
# Ensure your virtual environment is activated
python gemini_webui.py
This will start a local web server at http://localhost:7860
. Open this URL in your browser to access the interface.
- Upload Images: Click the gallery area or drag and drop one or multiple images (max 10MB per image)
- Choose Analysis Type: Select from the dropdown menu or pick "none" to use only a custom prompt
- Enter a Custom Prompt (optional): For example, "What objects are in this image?"
- Click "Analyze Images": Wait a few seconds for Gemini to analyze your images
- View Results: Read Gemini's detailed analysis of your images
Type | Description |
---|---|
none | Custom prompt only |
describe | Detailed description of the image |
analyze | Composition and element analysis |
style | Artistic style and technique identification |
objects | List of all visible objects |
colors | Color palette and mood analysis |
text | Text extraction and reading |
quality | Technical quality assessment |
suggest | Improvement suggestions |
context | Context and story interpretation |
similar | Description of similar images |
- "What emotions does this image convey?"
- "Is this image suitable for a professional presentation?"
- "What photography techniques were used?"
- "Identify any potential copyright issues"
- "What time of day was this photo taken?"
- "Suggest ways to improve the composition"
- "What story does this image tell?"
- "Identify the location or type of environment"
- Double-check your API key in the
.env
file - Ensure you have the free Gemini API tier activated
- Make sure you're using Python 3.7+
- Verify all packages are installed with
pip list
- If gradio installation fails, try:
pip install --upgrade pip
then reinstall
- Ensure images are in common formats (JPG, PNG)
- Try with smaller images if you encounter timeouts
- Check your internet connection
- Uses the
gemini-1.5-flash
model for image analysis - Note: This tool analyzes images but does not generate or edit them
- Free tier has usage limitations (check Google AI Studio for details)
- Analysis quality varies based on image clarity and prompt specificity
- Maximum file size: 10MB per image
- Images are processed through Google's Gemini API
- Review Google's privacy policy for more information
- No images are permanently stored by this application
MIT License
Crafted with ❤️ for creative image enthusiasts