caesar-yolo

Radio source detection with YOLO object detector

Credit

This software is distributed with GPLv3 license. If you use it for your research, please add a reference to this github repository and acknowledge these works in your paper:

S. Riggi et al., Astronomical source detection in radio continuum maps with deep neural networks, 2023, Astronomy and Computing, 42, 100682, doi

Installation

To build and install the package:

Download the software in a local directory, e.g. SRC_DIR:
$ git clone https://github.com/SKA-INAF/caesar-yolo.git
Create and activate a virtual environment, e.g. caesar-yolo, under a desired path VENV_DIR
$ python3 -m venv $VENV_DIR/caesar-yolo
$ source $VENV_DIR/caesar-yolo/bin/activate
Install dependencies inside venv:
(caesar-yolo)$ pip install -r $SRC_DIR/requirements.txt
Build and install package in virtual env:
(caesar-yolo)$ python setup.py install

To use package scripts:

Add binary directory to your PATH environment variable:
export PATH=$PATH:$VENV_DIR/caesar-yolo/bin

Usage

To detect source objects in input images use the the provided script run.py:

(caesar-yolo)$ python $VENV_DIR/caesar-yolo/bin/run.py [OPTIONS]

Supported options are:

INPUT DATA
--image=[VALUE]: Path to input image in FITS format
--datalist=[VALUE]: Path to input data filelist containing a list of json files
--maxnimgs=[VALUE]: Max number of images to consider in dataset (-1=all). Default: -1
--xmin=[VALUE]: Image min x to be read (read all if -1). Default: -1
--xmax=[VALUE]: Image max x to be read (read all if -1). Default: -1
--ymin=[VALUE]: Image min y to be read (read all if -1). Default: -1
--ymax=[VALUE]: Image max y to be read (read all if -1). Default: -1

MODEL
--weights=[VALUE]: Path to model weight file (.pt). This option is mandatory. Various pre-trained models are provided (see below).

DATA PRE-PROCESSING
--preprocessing: Enable image pre-processing. Default: disabled
--imgsize=[SIZE]: Size in pixel used to resize input image. Default: 640
--normalize_minmax: Normalize each channel in range [norm_min, norm_max]. Default: no normalization
--norm_min=[VALUE]: Normalization min value. Default: 0.0
--norm_max=[VALUE]: Normalization max value. Default: 1.0
--subtract_bkg: Subtract bkg from ref channel image. Default: no subtraction
--sigma_bkg=[VALUE]: Sigma clip value used in bkg calculation. Default: 3.0
--use_box_mask_in_bkg: Compute bkg value in borders left from box mask. Default: not used
--bkg_box_mask_fract=[VALUE]: Size of mask box dimensions with respect to image size used in bkg calculation. Default: 0.7
--bkg_chid=[VALUE]: Channel used to subtract background (-1=all). Default: -1
--clip_shift_data: Apply sigma clip shifting. Default: not applied
--sigma_clip=[VALUE]: Sigma threshold to be used for clip & shifting pixels. Default: 1.0
--clip_data: Apply sigma clipping. Default: not applied
--sigma_clip_low=[VALUE]: Lower sigma threshold to be used for clipping pixels below (mean - sigma_low x stddev). Default: 10.0
--sigma_clip_up=[VALUE]: Upper sigma threshold to be used for clipping pixels above (mean + sigma_up x stddev). Default: 10.0
--clip_chid=[VALUE]: Channel used to clip data (-1=all). Default: -1
--zscale_stretch: Apply zscale transform to data. Default: not applied
--zscale_contrasts=[VALUES]: zscale contrasts applied to all channels, separated by commas. Default: 0.25,0.25,0.25
--chan3_preproc: Use the 3-channel pre-processor. Default: not used
--sigma_clip_baseline=[VALUE]: Lower sigma threshold to be used for clipping pixels below (mean - sigma_low x stddev) in first channel of 3-channel preprocessing. Default: 0.0
--nchannels=[VALUE]: Number of channels. If you modify channels in preprocessing you must set this option accordingly. Default: 1

SOURCE DETECTION
--scoreThr=[VALUE]: Object detection score threshold, below which objects are not considered as sources. Default: 0.7
--iouThr=[VALUE]: Intersection Over Union (IoU) threshold for Non-Maximum Suppression (NMS), below which objects are not considered as sources. Default: 0.5
--merge_overlap_iou_thr_soft: IOU soft threshold used to merge overlapping detected objects with same class. Default: 0.3
--merge_overlap_iou_thr_hard: IOU hard threshold used to merge overlapping detected objects even if from different classes. Default: 0.7

RUN
--devices=[VALUE]: Specifies the device for inference (e.g., cpu, cuda:0 or 0). Default: cpu
--multigpu: Enable multi-gpu inference. Default: disabled

PARALLEL PROCESSING
--split_img_in_tiles: Enable splitting of input image in multiple subtiles for parallel processing. Default: disabled
--tile_xsize=[VALUE]: Sub image size in pixel along x. Default: 512
--tile_ysize=[VALUE]: Sub image size in pixel along y. Default: 512
--tile_xstep=[VALUE]: Sub image step fraction along x (=1 means no overlap). Default: 1.0
--tile_ystep=[VALUE]: Sub image step fraction along y (=1 means no overlap). Default: 1.0
--max_ntasks_per_worker=[VALUE]: Max number of tasks assigned to a MPI processor worker. Default: 100

PLOTTING
--draw_plots: Enable plotting of image and inference results superimposed. Default: disabled
--draw_class_label_in_caption: Enable drawing of class labels inside detected source caption in inference plots. Default: disabled

OUTPUT DATA
--save_per_image_outputs: Enable saving of per image output data in datalist-driven inference run. Default: disabled
--save_plots: Enable saving of inference plots. Default: disabled
--save_tile_catalog: Enable saving of catalog files for each subtile in parallel processing (debug scopes). Default: disabled
--save_tile_region: Enable saving of DS9 region files for each subtile in parallel processing (debug scopes). Default: disabled
--save_tile_img: Enable saving of subtile image in parallel processing (debug scopes). Default: disabled
--detect_outfile: Output plot PNG filename (internally generated if left empty). Default: empty
--detect_outfile_json: Output json filename with detected objects (internally generated if left empty). Default: empty

Below, we report a sample run script:

#!/bin/bash

# - Set env
VENV_DIR="/opt/software/venvs/caesar-yolo"
SCRIPT_DIR="$VENV_DIR/bin"
source $SCRIPT_DIR/activate

# - Set options
INPUTFILE="galaxy0001.fits"
WEIGHTFILE="weights-yolov8l_scratch_imgsize640_nepochs300.pt" # see pretrained weights below
PREPROC_OPTS="--preprocessing --imgsize=640 --zscale_stretch --zscale_contrasts=0.25,0.25,0.25 --normalize_minmax --norm_min=0 --norm_max=255 "
DET_OPTS="--scoreThr=0.5 --merge_overlap_iou_thr_soft=0.3 --merge_overlap_iou_thr_hard=0.8 "
DRAW_OPTS="--draw_plots --save_plots --draw_class_label_in_caption "

# - Run
python $SCRIPT_DIR/run.py --image=$INPUTFILE --weights=$WEIGHTFILE \
  	$PREPROC_OPTS \
  	$DET_OPTS \
  	$DRAW_OPTS \
	--devices="cuda:0"

Below, we report a sample parallel run script :

#!/bin/bash

# - Set env
VENV_DIR="/opt/software/venvs/caesar-yolo"
SCRIPT_DIR="$VENV_DIR/bin"
source $SCRIPT_DIR/activate

# - Set options
INPUTFILE="G005.5+0.0IFx_Mosaic_Mom0.fits"
WEIGHTFILE="weights-yolov8l_scratch_imgsize512_nepochs300.pt" # see pretrained weights below
PREPROC_OPTS="--preprocessing --imgsize=512 --zscale_stretch --zscale_contrasts=0.25,0.25,0.25 --normalize_minmax --norm_min=0 --norm_max=255 "
DET_OPTS="--scoreThr=0.5 --merge_overlap_iou_thr_soft=0.3 --merge_overlap_iou_thr_hard=0.8 "
DRAW_OPTS="--draw_plots --save_plots --draw_class_label_in_caption "
PARALLEL_OPTS="--split_img_in_tiles --tile_xsize=512 --tile_ysize=512 --tile_xstep=1 --tile_ystep=1 "

# - Parallel run
mpirun -np 4 python $SCRIPT_DIR/run.py --image=$INPUTFILE --weights=$WEIGHTFILE \
  $PREPROC_OPTS \
  $DET_OPTS \
  $DRAW_OPTS \
  $PARALLEL_OPTS

Pre-trained models

We have trained various YOLO v8 models from scratch on the same annotated radio dataset that was previously used to train Mask R-CNN model in paper Riggi+2023 (see Credits for full reference). Models were trained to detect 5 classes of radio objects:

0: spurious
1: compact
2: extended
3: extended-multisland
4: flagged

The original image size is 132x132 pixels. See the original publication for a description of each class and more details on the dataset.
We provide below the training configuration we used for producing the models and links to pre-trained model weights.

Training configuration

epochs=300
batch=16
erasing=0,
mosaic=0,
hsv_h=0,
hsv_s=0,
hsv_v=0,
translate=0,
degrees=180,
flipud=0.5,
fliplr=0.5,
scale=0.89
crop_fraction=1.0

Trained models Below, we report all trained models with the resulting performance metric F1 computed on the test set for each object class:

C= Completeness
R= Reliability
F1= 2*C*R/(C+R)

Model Base	Img Size	Weights	File Size[MB]	F1 (compact)[%]	F1 (extended)[%]	F1 (extended-multisland)[%]	F1 (spurious)[%]	F1 (flagged)[%]	Notes
yolov8n	640	url	6	76.5	88.3	87.8	53.1	82.2
yolov8n	640	url	6	76.9	89.1	87.6	60.0	85.4	pre-trained
yolov8l	128	url	83.6	57.1	79.0	78.1	12.5	73.0
yolov8l	256	url	83.6	77.1	87.5	87.7	35.7	65.1
yolov8l	512	url	83.6	76.1	88.7	87.6	45.1	74.6
yolov8l	512	url	83.6	77.3	90.2	88.3	51.9	78.7	scale=0.5
yolov8l	640	url	83.6	76.0	87.4	88.2	41.7	72.3
yolov8l	1024	url	83.7	74.1	88.0	83.6	40.5	83.0
yolo11n	640	url	5.2	76.3	88.9	86.9	56.7	87.0	pre-trained, scale=0.5
yolo11n	640	url	5.2	75.6	85.3	81.7	41.0	74.6
yolo11n	640	url	5.2	76.1	86.5	86.2	49.6	77.9	scale=0.5
yolo11l	128	url	48.8	56.3	77.2	79.5	18.0	62.7
yolo11l	256	url	48.8	76.8	86.2	86.1	45.0	79.6
yolo11l	512	url	48.8	76.3	87.9	87.5	53.0	82.7
yolo11l	640	url	48.9	76.0	88.7	86.8	53.0	82.1
yolo11l	1024	url	48.9	72.2	88.5	88.0	57.5	84.3

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
caesar_yolo		caesar_yolo
macros		macros
scripts		scripts
test		test
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

caesar-yolo

Credit

Installation

Usage

Pre-trained models

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

SKA-INAF/caesar-yolo

Folders and files

Latest commit

History

Repository files navigation

caesar-yolo

Credit

Installation

Usage

Pre-trained models

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages