KnowProxy: Adapting Large Language Models by Knowledge-guided Proxy

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.5 Download Report PDF

Indirect TuningEfficient Fine-tuningLarge Language Models

Adapting large language models (LLMs) using smaller proxy models has been shown to improve training efficiency, where the LLMs remain frozen while the proxies are tuned on top. However, this approach typically requires access to the output probability distributions of LLMs, which are often inaccessible or unstable. To address this limitation, we propose KnowProxy, a knowledge-guided proxy framework in which the proxy is trained with textual knowledge rather than probability distributions. Specifically, we first elicit textual knowledge and reasoning from frozen LLMs through prompting, and then the proxy model learns to adapt this reasoning to target task distributions. We evaluate KnowProxy on diverse reasoning benchmarks with different fine-tuning scenarios. Comprehensive results show that KnowProxy achieves competitive or even better performance without direct access to probability distributions, thereby providing a scalable and versatile alternative to traditional fine-tuning.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes KnowProxy, a framework that trains smaller proxy models using textual knowledge elicited from frozen large language models rather than requiring access to their probability distributions. Within the taxonomy, this work resides in the Training-Time Proxy Integration leaf under Proxy-Based Model Adaptation Mechanisms, alongside two sibling papers (FedPromo and Large Small Collaboration). This leaf represents a moderately populated research direction within a broader taxonomy of 34 papers across 20 leaf nodes, indicating focused but not overcrowded attention to training-time proxy strategies for LLM adaptation.

The taxonomy structure reveals that Training-Time Proxy Integration sits adjacent to Decoding-Time Proxy Tuning (which applies proxies only at inference) and Proxy-Based Architecture Search. Neighboring branches include Knowledge-Guided Adaptation Strategies, which emphasize structured knowledge integration and domain-specific adaptation, and Parameter-Efficient Adaptation Strategies, focusing on low-rank updates and modular adapters. KnowProxy bridges these areas by combining proxy-based efficiency with knowledge-guided steering, diverging from purely mechanistic proxy methods by incorporating explicit reasoning elicitation and from pure knowledge integration approaches by maintaining the proxy architecture paradigm.

Among 30 candidate papers examined, none were identified as clearly refuting any of the three core contributions: the KnowProxy framework itself, the dynamic routing mechanism, and the knowledge elicitation process. Each contribution was assessed against 10 candidates with zero refutable overlaps found. This suggests that within the limited search scope, the combination of knowledge-guided proxy training without probability distribution access appears relatively unexplored. However, the modest search scale (30 candidates from semantic search) means the analysis captures immediate neighbors rather than exhaustive prior work, and the absence of refutations reflects this bounded examination rather than definitive novelty.

Based on the limited literature search, the work appears to occupy a distinctive position combining proxy-based efficiency with knowledge-guided adaptation. The taxonomy context shows this sits at the intersection of two active research threads, and the contribution-level analysis found no direct overlaps among examined candidates. However, the 30-paper search scope and the presence of two sibling papers in the same taxonomy leaf suggest caution in claiming broad novelty without deeper investigation of related proxy tuning and knowledge distillation literature.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: adapting large language models through knowledge-guided proxy training. This field addresses the challenge of efficiently adapting large language models by leveraging smaller proxy models or auxiliary knowledge sources during training. The taxonomy reveals several complementary branches: Proxy-Based Model Adaptation Mechanisms explore how smaller models can guide or substitute for expensive large-model updates, including training-time integration approaches like KnowProxy[0] and inference-time strategies such as Tuning by Proxy[6]. Knowledge-Guided Adaptation Strategies focus on injecting structured or domain-specific knowledge into models, with works like KaSA[1] and KG-SR-LLM[3] demonstrating how external knowledge graphs or task-driven cues can steer adaptation. Parameter-Efficient Adaptation Strategies and Knowledge Transfer and Distillation Methods address scalability through techniques like low-rank updates and student-teacher frameworks, while Domain Adaptation branches tackle specialized settings from medical imaging to federated learning, and Context and Knowledge Sensitivity Control manages how models balance parametric versus contextual information. A particularly active line of work centers on training-time proxy integration, where smaller models serve as computational surrogates to guide large-model fine-tuning without full-scale backpropagation. KnowProxy[0] exemplifies this approach by using knowledge-guided proxies during training, sitting naturally alongside FedPromo[21] and Large Small Collaboration[23], which similarly exploit small-large model synergies in federated and collaborative settings. These methods contrast with knowledge injection strategies like Selective Knowledge Injection[22] or Parametric Knowledge Guiding[25], which emphasize embedding external structured knowledge rather than relying on proxy architectures. The trade-off revolves around whether adaptation should prioritize computational efficiency through architectural proxies or semantic richness through explicit knowledge integration. KnowProxy[0] bridges these themes by combining proxy-based efficiency with knowledge-guided steering, positioning it at the intersection of mechanistic innovation and knowledge-aware adaptation within the broader landscape of parameter-efficient LLM tuning.

Claimed Contributions

KnowProxy framework for knowledge-guided proxy adaptation

10 retrieved papers

The authors introduce a novel proxy-based fine-tuning framework that adapts large language models by training smaller proxy models on textual knowledge and reasoning elicited from frozen LLMs, rather than relying on probability distributions. This design enables applicability to black-box settings where only text outputs are available.

10 retrieved papers

Dynamic routing mechanism for adaptive proxy invocation

10 retrieved papers

The authors develop an adaptive routing mechanism that uses uncertainty scores elicited from the LLM's generated knowledge to determine when to invoke the proxy model. This allows the framework to selectively engage the proxy only for uncertain or unreliable LLM outputs, reducing inference overhead while maintaining accuracy.

10 retrieved papers

Knowledge elicitation and filtering process for proxy training

10 retrieved papers

The authors propose a method to extract textual knowledge and reasoning from LLMs via prompting, along with confidence scores for each piece of knowledge. A filtering process retains only high-confidence knowledge, which is then used to train the proxy model to align LLM-derived reasoning with target task distributions.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[21] FedPromo: Federated Lightweight Proxy Models at the Edge Bring New Domains to Foundation Models PDF

Barbato, Francesco, Shenaj, Donald, Michieli, Umberto, Zanuttigh, Pietro (2025) • arXiv.org

[23] Large and Small Model Collaboration for Air Interface PDF

Yiming Cui, Jiajia Guo, Xiao Li, Chao-Kai Wen, Shi Jin (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

KnowProxy framework for knowledge-guided proxy adaptation

[6] Tuning Language Models by Proxy PDF

Cannot Refute

[55] Constructing surrogates for atomistic simulations via deep learning and generative large language models PDF

Cannot Refute

[56] Latuner: An llm-enhanced database tuning system based on adaptive surrogate model PDF

Cannot Refute

[57] Small models, big insights: Leveraging slim proxy models to decide when and what to retrieve for llms PDF

Cannot Refute

[58] FedPT: federated proxy-tuning of large language models on resource-constrained edge devices PDF

Cannot Refute

[59] Learning to rewrite: Generalized llm-generated text detection PDF

Cannot Refute

[60] Llms as repositories of factual knowledge: Limitations and solutions PDF

Cannot Refute

[61] LLM-Based Adaptive Distribution Voltage Regulation Under Frequent Topology Changes: An In-Context MPC Framework PDF

Cannot Refute

[62] Large language model-assisted surrogate modelling for engineering optimization PDF

Cannot Refute

[63] LVLM-HBA: Large Vision-Language Model with Cross-Modal Alignment for Human Behavior Analysis PDF

Cannot Refute

Contribution

Dynamic routing mechanism for adaptive proxy invocation

[45] Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization PDF

Cannot Refute

[46] Collm: Industrial large-small model collaboration with fuzzy decision-making agent and self-reflection PDF

Cannot Refute

[47] DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism PDF

Cannot Refute

[48] Mediator: Memory-efficient llm merging with less parameter conflicts and uncertainty based routing PDF

Cannot Refute

[49] Cargo: A framework for confidence-aware routing of large language models PDF

Cannot Refute

[50] Inclusive prompt engineering for large language models: a modular framework for ethical, structured, and adaptive AI PDF

Cannot Refute

[51] Learning to route llms with confidence tokens PDF

Cannot Refute

[52] At-cxr: Uncertainty-aware agentic triage for chest x-rays PDF

Cannot Refute

[53] Closing the Data Loop: Real-World AUVs Adaptive Sampling for Improved Ocean Model Predictions PDF

Cannot Refute

[54] Program Arrives Home Smoothly: Uncertainty-Based Routing Scheduling of Home-Based Elderly Care Programs PDF

Cannot Refute

Contribution

Knowledge elicitation and filtering process for proxy training

[35] Accelerated Preference Elicitation with LLM-Based Proxies PDF

Cannot Refute

[36] KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting PDF

Cannot Refute

[37] Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study PDF

Cannot Refute

[38] MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model PDF

Cannot Refute

[39] A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions PDF

Cannot Refute

[40] Knowledge Elicitation with Large Language Models for Interpretable Cancer Stage Identification from Pathology Reports PDF

Cannot Refute

[41] Accelerating Catalysis Understanding via Large Language Model Data Extraction and Shallow Machine Learning Techniques PDF

Cannot Refute

[42] Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings PDF

Cannot Refute

[43] A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining PDF

Cannot Refute

[44] Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations PDF

Cannot Refute

KnowProxy: Adapting Large Language Models by Knowledge-guided Proxy

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[21] FedPromo: Federated Lightweight Proxy Models at the Edge Bring New Domains to Foundation Models PDF

[23] Large and Small Model Collaboration for Air Interface PDF

Contribution Analysis

KnowProxy framework for knowledge-guided proxy adaptation

[6] Tuning Language Models by Proxy PDF

[55] Constructing surrogates for atomistic simulations via deep learning and generative large language models PDF

[56] Latuner: An llm-enhanced database tuning system based on adaptive surrogate model PDF

[57] Small models, big insights: Leveraging slim proxy models to decide when and what to retrieve for llms PDF

[58] FedPT: federated proxy-tuning of large language models on resource-constrained edge devices PDF

[59] Learning to rewrite: Generalized llm-generated text detection PDF

[60] Llms as repositories of factual knowledge: Limitations and solutions PDF

[61] LLM-Based Adaptive Distribution Voltage Regulation Under Frequent Topology Changes: An In-Context MPC Framework PDF

[62] Large language model-assisted surrogate modelling for engineering optimization PDF

[63] LVLM-HBA: Large Vision-Language Model with Cross-Modal Alignment for Human Behavior Analysis PDF

Dynamic routing mechanism for adaptive proxy invocation

[45] Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization PDF

[46] Collm: Industrial large-small model collaboration with fuzzy decision-making agent and self-reflection PDF

[47] DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism PDF

[48] Mediator: Memory-efficient llm merging with less parameter conflicts and uncertainty based routing PDF

[49] Cargo: A framework for confidence-aware routing of large language models PDF

[50] Inclusive prompt engineering for large language models: a modular framework for ethical, structured, and adaptive AI PDF

[51] Learning to route llms with confidence tokens PDF

[52] At-cxr: Uncertainty-aware agentic triage for chest x-rays PDF

[53] Closing the Data Loop: Real-World AUVs Adaptive Sampling for Improved Ocean Model Predictions PDF

[54] Program Arrives Home Smoothly: Uncertainty-Based Routing Scheduling of Home-Based Elderly Care Programs PDF

Knowledge elicitation and filtering process for proxy training

[35] Accelerated Preference Elicitation with LLM-Based Proxies PDF

[36] KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting PDF

[37] Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study PDF

[38] MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model PDF

[39] A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions PDF

[40] Knowledge Elicitation with Large Language Models for Interpretable Cancer Stage Identification from Pathology Reports PDF

[41] Accelerating Catalysis Understanding via Large Language Model Data Extraction and Shallow Machine Learning Techniques PDF

[42] Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings PDF

[43] A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining PDF

[44] Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations PDF

Table of Contents