ddd_image_organize

A Windows 11 image organizer application that uses LLMs, with vision capabilities, accessed through services like OpenAI, Google Gemini, and local interfaces like LM Studio and Ollama, to automatically categorize images into folders based on what is depicted in the images.

Usage

Clone the repository.
Ensure Python is installed on your system.
Install the required dependencies (e.g., pip install -r requirements.txt).
Run the application (python main.py).
Choose an LLM with vision capability through an 'API' such as OpenAI, Google Gemini, LM Studio, Ollama or other Local options (Local and LM Studio are both local but for now they are seperate)).
- If using LM Studio, provide the 0.json file to LM Studio's structured output option, to help it provide a structured JSON output. Ensure LM Studio is running and a LLM with vision capability is loaded. The program will attempt to connect to LM Studio at http://127.0.0.1:1234/v1/chat/completions.
- "Local" is not implemented. But otherwise you would provide the path to the local LLM model file in the "Enter Local LLM Model Path" text box.
- If using OpenAI or Google Gemini, enter your API key.
- If using Ollama, ensure Ollama is installed and running. The program is configured to connect to Ollama at http://localhost:11434/v1. No API key is required. The application iterates through each uploaded image, converts it to JPEG format, and sends it to the Ollama server with a prompt to analyze the image and return a single word directory name. The application then creates a folder with that name (if it doesn't exist) and moves the image into that folder.
Select a directory to organize the images into.
Upload images using the "Upload Images" button.
Click the "Organize Images" button.

Dependencies

PyQt5
requests
openai
google.generativeai
llama_cpp (for local LLM)
pillow

Future Goals

LM Studio is not able to process or organize images properly, so it needs fixing. Google Gemini and Ollama are working well.
Handle greater amounts of images.
Standardized Organization: Implement a feature to allow users to define a limited set of categories for image organization to avoid too many catogories. The prompt option may provide this guidance, but has yet to be tested.
Implement a configuration/preference system.

Future Goals Fixed

2025-3-12 - Enhance accuracy in image identification and folder placement. The current LLM is not accurate in identifying images. This is an area that needs further development. I assume the issue is that the images are being uploaded to the LLMs incorrectly. This project is being shared on GitHub to encourage contributions to improve this aspect of the project. [Fix: It was rigid prompting that was causing the LLM to be limited.]

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
__pycache__		__pycache__
images		images
images_Gemini_Organized		images_Gemini_Organized
0.json		0.json
README.md		README.md
design_document.md		design_document.md
file_manager.py		file_manager.py
folder_manager.py		folder_manager.py
gemini_organizer.py		gemini_organizer.py
image.png		image.png
image_organizer.log		image_organizer.log
lmstudio_organizer.py		lmstudio_organizer.py
main.py		main.py
ollama_organizer.py		ollama_organizer.py
openai_organizer.py		openai_organizer.py
requirements.txt		requirements.txt
ui.py		ui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ddd_image_organize

Usage

Dependencies

Future Goals

Future Goals Fixed

About

Releases

Packages

Contributors 2

Languages

dadadies/ddd_image_organize

Folders and files

Latest commit

History

Repository files navigation

ddd_image_organize

Usage

Dependencies

Future Goals

Future Goals Fixed

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages