PTZ APP

This is an application for sending images of specific objects autonomously using PTZ cameras.

How It Works

The algorithm performs the following steps:

Initialization: Sets up object detection model (YOLO or Florence) based on user parameters.
Area Scanning: Systematically scans the environment by rotating the PTZ camera in pan steps (default: 15 degrees) through a full 360° rotation at the specified tilt and zoom level.
Object Detection: At each camera position, captures an image and runs object detection to identify specified objects (e.g., person, car, dog).
Filtering: Filters detections based on confidence threshold (default: 0.1).
Object Tracking: When an object of interest is detected with sufficient confidence, the algorithm:
- Centers the camera on the detected object
- Adjusts zoom to maximize the object in the frame
Image Publishing: Saves and publishes the optimized images of detected objects.
Iteration: Repeats the process for the specified number of iterations with configurable delay between scans.

Build the container

sudo docker buildx build --platform=linux/amd64,linux/arm64/v8 -t your_docker_hub_user_name/ptzapp -f Dockerfile --push .

Then pull the container from dockerhub in the node:

sudo docker image pull your_docker_hub_user_name/ptzapp

Run the container on a dell blade

sudo docker run --gpus all -it --rm your_docker_hub_user_name/ptzapp:latest -ki -it 5 -un camera_user_name -pw camera_password -ip camera_ip_address -obj person,car

Run the container on a waggle node

sudo docker run -it --rm your_docker_hub_user_name/ptzapp:latest -ki -it 5 -un camera_user_name -pw camera_password -ip camera_ip_address -obj person,car

Example with Florence model

sudo docker run --gpus all -it --rm your_docker_hub_user_name/ptzapp:latest --model Florence-base --iterations 5 --username username --password 'password' --cameraip 130.202.23.92 --objects 'person,car'

Using Different Object Detection Models

YOLO (Default)

By default, the application uses the YOLO model (yolo11n) for object detection. Specify objects by name:

sudo docker run --gpus all -it --rm your_docker_hub_user_name/ptzapp:latest --objects 'person,car,dog'

Florence Models

When using Florence models, you have more powerful detection capabilities:

sudo docker run --gpus all -it --rm your_docker_hub_user_name/ptzapp:latest --model Florence-base --objects 'person,car'

Detecting All Objects with Florence

To detect all objects using Florence models, use the asterisk:

sudo docker run --gpus all -it --rm your_docker_hub_user_name/ptzapp:latest --model Florence-base --objects '*'

Saving multiple images per discrete sweep location

To save multiple images when there are multiple detections in a single frame, use the --multiple flag:

sudo docker run --gpus all -it --rm your_docker_hub_user_name/ptzapp:latest --model Florence-base --objects '*' --multiple

Note: When using '*' with Florence models, the application runs in the <OD> task mode, which enables general object detection without filtering for specific classes. This can be useful for inventorying all objects in a scene but may produce more diverse results than when targeting specific objects.

Command Line Arguments

Argument	Short	Description	Default
`--model`	`-m`	Model to use (e.g., 'yolo11n', 'Florence-base')	yolo11n
`--iterations`	`-it`	Number of iterations (PTZ rounds) to run	5
`--username`	`-un`	PTZ camera username	""
`--password`	`-pw`	PTZ camera password	""
`--cameraip`	`-ip`	PTZ camera IP address	""
`--objects`	`-obj`	Objects to detect (comma-separated or '*' for everything)	"person"
`--keepimages`	`-ki`	Keep collected images in persistent folder	False
`--panstep`	`-ps`	Step of pan in degrees	15
`--tilt`	`-tv`	Tilt value in degrees	0
`--zoom`	`-zm`	Zoom value	1
`--confidence`	`-conf`	Confidence threshold (0-1)	0.1
`--iterdelay`	`-id`	Minimum delay in seconds between iterations	60.0
`--debug`		Enable debug level logging	False
`--multiple`		Save multiple images for multiple detections in a single frame	False

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
ecr-meta		ecr-meta
source		source
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
opencv-fix.py		opencv-fix.py
requirements.txt		requirements.txt
sage.yaml		sage.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PTZ APP

How It Works

Build the container

Run the container on a dell blade

Run the container on a waggle node

Example with Florence model

Using Different Object Detection Models

YOLO (Default)

Florence Models

Detecting All Objects with Florence

Saving multiple images per discrete sweep location

Command Line Arguments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PTZ APP

How It Works

Build the container

Run the container on a dell blade

Run the container on a waggle node

Example with Florence model

Using Different Object Detection Models

YOLO (Default)

Florence Models

Detecting All Objects with Florence

Saving multiple images per discrete sweep location

Command Line Arguments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages