PreferThinker: Reasoning-based Personalized Image Preference Assessment

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Image Preference Assessment;Multimodal Large Language Model；Chain-of-Thought

Personalized image preference assessment aims to evaluate an individual user's image preferences by relying only on a small set of reference images as prior information. Existing methods mainly focus on general preference assessment, training models with large-scale data to tackle well-defined tasks such as text-image alignment. However, these approaches struggle to handle personalized preference because user-specific data are scarce and not easily scalable, and individual tastes are often diverse and complex. To overcome these challenges, we introduce a common preference profile that serves as a bridge across users, allowing large-scale user data to be leveraged for training profile prediction and capturing complex personalized preferences. Building on this idea, we propose a reasoning-based personalized image preference assessment framework that follows a \textit{predict-then-assess} paradigm: it first predicts a user's preference profile from reference images, and then provides interpretable, multi-dimensional scores and assessments of candidate images based on the predicted profile. To support this, we first construct a large-scale Chain-of-Thought (CoT)-style personalized assessment dataset annotated with diverse user preference profiles and high-quality CoT-style reasoning, enabling explicit supervision of structured reasoning. Next, we adopt a two-stage training strategy: a cold-start supervised fine-tuning phase to empower the model with structured reasoning capabilities, followed by reinforcement learning to incentivize the model to explore more reasonable assessment paths and enhance generalization. Furthermore, we propose a similarity-aware prediction reward to encourage better prediction of the user's preference profile, which facilitates more reasonable assessments exploration. Extensive experiments demonstrate the superiority of the proposed method. Our code and dataset will be publicly released.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a reasoning-based framework for personalized image preference assessment that predicts user-specific preference profiles from reference images and then evaluates candidate images accordingly. It resides in the 'Profile-Based Personalized Aesthetics Assessment' leaf, which contains five papers including the original work. This leaf sits within the broader 'Personalized Aesthetics and Preference Modeling' branch, indicating a moderately populated research direction focused on explicit profile modeling. The taxonomy shows that personalized aesthetics is one of several major branches alongside generic quality assessment and domain-specific methods, suggesting the paper addresses a well-defined but not overcrowded niche.

The taxonomy reveals neighboring leaves addressing implicit preference learning from user interactions, adaptive scalability across many users, privacy-preserving federated approaches, and specialized applications like color vision deficiency. The paper's profile-based approach contrasts with implicit methods that learn from ratings without explicit profiles and differs from generic quality assessment branches that apply universal perceptual criteria. The taxonomy's scope and exclude notes clarify that reasoning-based assessment distinguishes this work from simpler profile-based methods, while its focus on personalization separates it from generic aesthetics models that lack user-specific customization.

Among twenty-seven candidates examined across three contributions, none were found to clearly refute the proposed ideas. The common preference profile concept examined ten candidates with zero refutations, the reasoning-based predict-then-assess framework examined seven candidates with zero refutations, and the CoT-style dataset and training strategy examined ten candidates with zero refutations. This limited search scope—covering top-K semantic matches and citation expansion rather than exhaustive review—suggests that within the examined literature, the contributions appear distinct. The profile-based leaf contains four sibling papers, indicating some prior work in explicit profile modeling, though none among the examined candidates directly overlaps with the reasoning-based approach.

Based on the limited search of twenty-seven candidates, the work appears to introduce novel elements in reasoning-based personalized assessment, though the analysis does not cover the full breadth of personalized aesthetics research. The taxonomy structure shows this is an active area with multiple related directions, and the absence of refutations among examined candidates suggests the specific combination of profile prediction and reasoning-based evaluation may be distinctive within the scope analyzed.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Personalized image preference assessment. The field encompasses a broad spectrum of approaches to evaluating image quality and aesthetics, ranging from generic methods that apply universally to domain-specific techniques tailored for medical imaging, underwater scenes, or other specialized contexts. At the top level, the taxonomy distinguishes between personalized aesthetics and preference modeling—where systems adapt to individual tastes—and generic or domain-specific quality assessment that targets objective or context-dependent criteria. Additional branches address subjective evaluation studies that probe human perception, transfer and adaptation methods that align models across domains or modalities, personalized content generation and recommendation systems, benchmark datasets, and interactions between image processing and quality metrics. Review and survey papers provide overarching perspectives, such as Aesthetics Review[5], which synthesizes trends across these diverse lines of work. Within personalized aesthetics and preference modeling, a particularly active area focuses on profile-based methods that learn user-specific or group-specific preferences from historical ratings or rich attribute annotations. Personalized Aesthetics[3] and Rich Attributes[7] exemplify efforts to capture individual differences through explicit user profiles or detailed image attributes, while Interaction Matrix[12] and Content Attribute[26] explore how content features interact with user characteristics. PreferThinker[0] sits squarely in this profile-based cluster, emphasizing reasoning mechanisms that integrate user-specific signals to predict personalized preferences. Compared to neighboring works like Personalized Aesthetics[3], which often rely on collaborative filtering or attribute-based embeddings, PreferThinker[0] introduces a more deliberative approach to modeling individual taste. This contrasts with broader generic quality assessment methods such as Deep Learning Blind[17] or KonIQ[19], which prioritize universal perceptual criteria over personalization, highlighting an ongoing tension between scalability and the granularity of user-specific adaptation.

Claimed Contributions

Common preference profile bridging users for personalized assessment

10 retrieved papers

The authors propose a preference profile composed of common visual elements (such as color and art style) that characterizes individual preferences while being shared across users. This design enables leveraging large-scale data for training and addresses the challenges of limited personalized data and complex individual tastes.

10 retrieved papers

Reasoning-based predict-then-assess framework (PreferThinker)

7 retrieved papers

The authors develop a two-stage framework that first predicts a user's preference profile from reference images, then uses this profile to provide interpretable and multi-dimensional assessments of candidate images through structured reasoning.

7 retrieved papers

CoT-style personalized assessment dataset and two-stage training strategy

10 retrieved papers

The authors create a large-scale dataset with Chain-of-Thought annotations for personalized preference assessment and employ a two-stage training approach: supervised fine-tuning for structured reasoning followed by reinforcement learning with a similarity-aware prediction reward to improve generalization.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[3] Personalized image aesthetics PDF

Jian Ren, Xiaohui Shen, Zhe Lin, RadomÃr MÄch, Zhe L. Lin, David J. Foran, R. MÄch, D. Foran (2017)

[7] Personalized Image Aesthetics Assessment with Rich Attributes PDF

Yuzhe Yang, Liwu Xu, Leida Li, Nan Qie, Yaqian Li, Peng Zhang, Yandong Guo (2022)

[12] Interaction-Matrix Based Personalized Image Aesthetics Assessment PDF

Jingwen Hou, Weisi Lin, Guanghui Yue, Weide Liu, Bao-Quan Zhao, Baoquan Zhao (2022)

[26] Modeling content-attribute preference for personalized image esthetics assessment PDF

Yuanyang Wang, Yihua Huang, Xiumin Chen, Leida Li, Guangming Shi (2022)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Common preference profile bridging users for personalized assessment

[30] Personalized image quality assessment with social-sensed aesthetic preference PDF

Cannot Refute

[51] Universal scale-free representations in human visual cortex. PDF

Cannot Refute

[52] Performance and visual appearance of in-vehicle voice assistants impact user experience: A comparative study between Chinese and German users PDF

Cannot Refute

[53] Aesthetic preference for art can be predicted from a mixture of low-and high-level visual features PDF

Cannot Refute

[54] The role of expertise and culture in visual art appreciation PDF

Cannot Refute

[55] Biological components of sex differences in color preference PDF

Cannot Refute

[56] Universality and superiority in preference for chromatic composition of art paintings PDF

Cannot Refute

[57] A complex story: Universal preference vs. individual differences shaping aesthetic response to fractals patterns PDF

Cannot Refute

[58] Some like it hot-visual guidance for preference prediction PDF

Cannot Refute

[59] Aesthetic preference for art emerges from a weighted integration over hierarchically structured visual features in the brain PDF

Cannot Refute

Contribution

Reasoning-based predict-then-assess framework (PreferThinker)

[60] Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation PDF

Cannot Refute

[61] DiscoStyle: Multi-level logistic ranking for personalized image style preference inference PDF

Cannot Refute

[62] Personalized Image Aesthetics Assessment via Multi-Attribute Interactive Reasoning PDF

Cannot Refute

[63] Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization PDF

Cannot Refute

[64] Dual learning for explainable recommendation: Towards unifying user preference prediction and review generation PDF

Cannot Refute

[65] Personalized Reward Modeling for Text-to-Image Generation PDF

Cannot Refute

[66] PieAPP: Perceptual Image-Error Assessment through Pairwise Preference PDF

Cannot Refute

Contribution

CoT-style personalized assessment dataset and two-stage training strategy

[67] Multimodal Chain-of-Thought Reasoning in Language Models PDF

Cannot Refute

[68] Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning PDF

Cannot Refute

[69] Improving LLM-powered Recommendations with Personalized Information PDF

Cannot Refute

[70] RM-R1: Reward Modeling as Reasoning PDF

Cannot Refute

[71] RaCT: Ranking-aware Chain-of-Thought Optimization for LLMs PDF

Cannot Refute

[72] Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data PDF

Cannot Refute

[73] LeoDroid: An LLM-Based Few-Shot Multi-Label Detection for Android Malware PDF

Cannot Refute

[74] MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine PDF

Cannot Refute

[75] Syntriever: How to Train Your Retriever with Synthetic Data from LLMs PDF

Cannot Refute

[76] CoT4Rec: Revealing User Preferences Through Chain of Thought for Recommender Systems PDF

Cannot Refute

PreferThinker: Reasoning-based Personalized Image Preference Assessment

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[3] Personalized image aesthetics PDF

[7] Personalized Image Aesthetics Assessment with Rich Attributes PDF

[12] Interaction-Matrix Based Personalized Image Aesthetics Assessment PDF

[26] Modeling content-attribute preference for personalized image esthetics assessment PDF

Contribution Analysis

Common preference profile bridging users for personalized assessment

[30] Personalized image quality assessment with social-sensed aesthetic preference PDF

[51] Universal scale-free representations in human visual cortex. PDF

[52] Performance and visual appearance of in-vehicle voice assistants impact user experience: A comparative study between Chinese and German users PDF

[53] Aesthetic preference for art can be predicted from a mixture of low-and high-level visual features PDF

[54] The role of expertise and culture in visual art appreciation PDF

[55] Biological components of sex differences in color preference PDF

[56] Universality and superiority in preference for chromatic composition of art paintings PDF

[57] A complex story: Universal preference vs. individual differences shaping aesthetic response to fractals patterns PDF

[58] Some like it hot-visual guidance for preference prediction PDF

[59] Aesthetic preference for art emerges from a weighted integration over hierarchically structured visual features in the brain PDF

Reasoning-based predict-then-assess framework (PreferThinker)

[60] Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation PDF

[61] DiscoStyle: Multi-level logistic ranking for personalized image style preference inference PDF

[62] Personalized Image Aesthetics Assessment via Multi-Attribute Interactive Reasoning PDF

[63] Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization PDF

[64] Dual learning for explainable recommendation: Towards unifying user preference prediction and review generation PDF

[65] Personalized Reward Modeling for Text-to-Image Generation PDF

[66] PieAPP: Perceptual Image-Error Assessment through Pairwise Preference PDF

CoT-style personalized assessment dataset and two-stage training strategy

[67] Multimodal Chain-of-Thought Reasoning in Language Models PDF

[68] Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning PDF

[69] Improving LLM-powered Recommendations with Personalized Information PDF

[70] RM-R1: Reward Modeling as Reasoning PDF

[71] RaCT: Ranking-aware Chain-of-Thought Optimization for LLMs PDF

[72] Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data PDF

[73] LeoDroid: An LLM-Based Few-Shot Multi-Label Detection for Android Malware PDF

[74] MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine PDF

[75] Syntriever: How to Train Your Retriever with Synthetic Data from LLMs PDF

[76] CoT4Rec: Revealing User Preferences Through Chain of Thought for Recommender Systems PDF

Table of Contents