`mapreader.spot_text.deepsolo_runner`

Module Contents

Classes

DeepSoloRunner

class mapreader.spot_text.deepsolo_runner.DeepSoloRunner(patch_df=None, parent_df=None, cfg_file='./DeepSolo/configs/R_50/IC15/finetune_150k_tt_mlt_13_15_textocr.yaml', weights_file='./ic15_res50_finetune_synth-tt-mlt-13-15-textocr.pth', device='cpu')

Parameters:

patch_df (pandas.DataFrame)
parent_df (pandas.DataFrame)
cfg_file (str | pathlib.Path)
weights_file (str | pathlib.Path)
device (str)

run_all(patch_df=None, return_dataframe=False)

Run the model on all images in the patch dataframe.

Parameters:

patch_df (pd.DataFrame, optional) – Dataframe containing patch information, by default None.
return_dataframe (bool, optional) – Whether to return the predictions as a pandas DataFrame, by default False

Returns:

A dictionary of predictions for each patch image or a DataFrame if as_dataframe is True.

Return type:

dict or pd.DataFrame

run_on_images(img_paths, return_dataframe=False)

Run the model on a list of images.

Parameters:

img_paths (str, pathlib.Path or list) – A list of image paths to run the model on.
return_dataframe (bool, optional) – Whether to return the predictions as a pandas DataFrame, by default False

Returns:

A dictionary of predictions for each image or a DataFrame if as_dataframe is True.

Return type:

dict or pd.DataFrame

run_on_image(img_path, return_outputs=False, return_dataframe=False)

Run the model on a single image.

Parameters:

img_path (str or pathlib.Path) – The path to the image to run the model on.
return_outputs (bool, optional) – Whether to return the outputs direct from the model, by default False
return_dataframe (bool, optional) – Whether to return the predictions as a pandas DataFrame, by default False

Returns:

The predictions for the image or the outputs from the model if return_outputs is True.

Return type:

dict or pd.DataFrame

get_patch_predictions(outputs, return_dataframe=False)

Post process the model outputs to get patch predictions.

Parameters:

outputs (dict) – The outputs from the model.
return_dataframe (bool, optional) – Whether to return the predictions as a pandas DataFrame, by default False

Returns:

A dictionary containing the patch predictions or a DataFrame if as_dataframe is True.

Return type:

dict or pd.DataFrame

convert_to_parent_pixel_bounds(patch_df=None, return_dataframe=False)

Convert the patch predictions to parent predictions by converting pixel bounds.

Parameters:

patch_df (pd.DataFrame, optional) – Dataframe containing patch information, by default None
return_dataframe (bool, optional) – Whether to return the predictions as a pandas DataFrame, by default False

Returns:

A dictionary of predictions for each parent image or a DataFrame if as_dataframe is True.

Return type:

dict or pd.DataFrame

Raises:

ValueError – If patch_df is not available.

convert_to_coords(parent_df=None, return_dataframe=False)

Convert the parent predictions to georeferenced predictions by converting pixel bounds to coordinates.

Parameters:

parent_df (pd.DataFrame, optional) – Dataframe containing parent image information, by default None
return_dataframe (bool, optional) – Whether to return the predictions as a pandas DataFrame, by default False

Returns:

A dictionary of predictions for each parent image or a DataFrame if as_dataframe is True.

Return type:

dict or pd.DataFrame

Raises:

ValueError – If parent_df is not available.

save_to_geojson(save_path='./deepsolo_text_outputs.geojson')

Save the georeferenced predictions to a GeoJSON file.

Parameters:: save_path (str | pathlib.Path, optional) – Path to save the GeoJSON file, by default “./deepsolo_text_outputs.geojson”
Return type:: None

show(image_id, figsize=(10, 10), border_color='r', text_color='b', image_width_resolution=None, return_fig=False)

Show the predictions on an image.

Parameters:

image_id (str) – The image ID to show the predictions on.
figsize (tuple | None, optional) – The size of the figure, by default (10, 10)
border_color (str | None, optional) – The color of the border of the polygons, by default “r”
text_color (str | None, optional) – The color of the text, by default “b”
image_width_resolution (int | None, optional) – The maximum resolution of the image width, by default None
return_fig (bool, optional) – Whether to return the figure, by default False

Returns:

The matplotlib figure if return_fig is True.

Return type:

fig

Raises:

ValueError – If the image ID is not found in the patch or parent predictions.

mapreader.spot_text.deepsolo_runner

Module Contents

Classes

`mapreader.spot_text.deepsolo_runner`