ProCrop: Learning Aesthetic Image Cropping from Professional Compositions

Ke Zhang1   Tianyu Ding3 †   Jiachen Jiang2   Tianyi Chen3  
Ilya Zharkov3   Vishal M. Patel1   Luming Liang3 †

1Johns Hopkins University    2Ohio State University    3Microsoft

Code and Dataset Release

We have released the official ProCrop implementation and the large-scale ProCrop dataset to support future research in aesthetic image cropping, image composition, and weakly supervised crop learning.

Pipeline of ProCrop

ProCrop overview figure

The pipeline of ProCrop. Given an input image, ProCrop retrieves compositionally similar professional images and generates a textual description, which guide the model to produce aesthetically enhanced crops along with corresponding aesthetic scores.

Abstract

Image cropping is crucial for enhancing the visual appeal and narrative impact of photographs, yet existing rule-based and data-driven approaches often lack diversity or require annotated training data. We introduce ProCrop, a retrieval-based method that leverages professional photography to guide cropping decisions. By fusing features from professional photographs with those of the query image, ProCrop learns from professional compositions, significantly boosting performance. Additionally, we present a large-scale dataset of 242K weakly-annotated images, generated by out-painting professional images and iteratively refining diverse crop proposals. This composition-aware dataset generation offers diverse high-quality crop proposals guided by aesthetic principles and becomes the largest publicly available dataset for image cropping. Extensive experiments show that ProCrop significantly outperforms existing methods in both supervised and weakly-supervised settings. Notably, when trained on the new dataset, our ProCrop surpasses previous weakly-supervised methods and even matches fully supervised approaches. Both the code and dataset are publicly available to advance research in image aesthetics and composition analysis.

BibTeX Citation

@inproceedings{zhang2026procrop,
  title     = {ProCrop: Learning Aesthetic Image Cropping from Professional Compositions},
  author    = {Zhang, Ke and Ding, Tianyu and Jiang, Jiachen and Chen, Tianyi and Zharkov, Ilya and Patel, Vishal M. and Liang, Luming},
  booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence},
  volume    = {40},
  number    = {15},
  pages     = {12600--12608},
  year      = {2026}
}