Feature/local processor for labels #20

hardyliao85 · 2025-09-22T13:39:56Z

Description

Enhance Vision Service for label generation with configurable languages (en and zh_TW).

Changes:

Prevent model loading timeouts with --timeout 120 in Dockerfile.
Updated requirements.txt to specify torch==2.8.0+cu126 (CUDA version) for GPU inference support. Requires GPU driver: Linux >=525.60.13, Windows >=527.41.
Added 5 Hugging Face image classification models for hardware-based selection.
LABELS_LOCALE configurable via Docker Compose.

Below is my test configuration
compose:

# Only show key parts
services:
  photoprism-vision:
    image: photoprism/vision:develop
    ports:
      - "5000:5000"
    environment:
      NVIDIA_VISIBLE_DEVICES: "all"
      NVIDIA_DRIVER_CAPABILITIES: "compute,utility"
      LABELS_LOCALE: "zh_TW"
    volumes:
      - ./photoprism_vision_models:/app/models
      - ./venv:/app/venv
    deploy:
      resources:
        reservations:
          devices:
            - driver: "nvidia"
              capabilities: [ gpu ]
              count: "all"

vision.yml:

- Type: labels
  Name: convnextv2_huge.fcmae_ft_in22k_in1k_384
  Resolution: 384
  Service:
    Uri: http://IP:5000/api/v1/vision/labels
    FileScheme: data
    RequestFormat: vision
    ResponseFormat: vision

Related Issues

Partially addresses #19 (main functionality implemented; language support pending)

Acceptance Criteria

New features or enhancements are fully implemented and do not break existing functionality, so that they can be released at any time without requiring additional work
Automated unit and/or acceptance tests are included to ensure that changes work as expected and to reduce repetitive manual work
Documentation has been / will be updated, especially as it relates to new configuration options or potentially disruptive changes

Note: Documentation updated only in README; no plans to update other official documents.
Note: I don't have much experience with PRs. I apologize if anything is unclear or not done properly. Please feel free to give me feedback, and I am happy to make any necessary corrections. Thank you very much.

…onversion photoprism#19

hardyliao85 added 3 commits September 21, 2025 23:25

Avoid timeouts when loading larger models

75d0799

Vision Service: Add label generation feature and locale-based label c…

b63f6d9

…onversion photoprism#19

Update README.md

89ac238

graciousgrey requested a review from lastzero September 22, 2025 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/local processor for labels #20

Feature/local processor for labels #20

Uh oh!

hardyliao85 commented Sep 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Feature/local processor for labels #20

Are you sure you want to change the base?

Feature/local processor for labels #20

Uh oh!

Conversation

hardyliao85 commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes:

Related Issues

Acceptance Criteria

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hardyliao85 commented Sep 22, 2025 •

edited

Loading