Abstract

We address the problem of preference learning, which aims to learn user-specific preferences (e.g., 'good parking spot', 'convenient drop-off location') from visual input. Despite its similarity to learning factual concepts (e.g., 'red cube'), preference learning is a fundamentally harder problem due to its subjective nature and the paucity of person-specific training data.

To tackle this problem, we present new framework called SYNAPSE, which is a neuro-symbolic approach designed to efficiently learn preferential concepts from limited demonstrations. SYNAPSE represents preferences as neuro-symbolic programs in a domain-specific language (DSL) that operates over images, and leverages a novel combination of visual parsing, large language models, and program synthesis to learn programs representing individual preferences. We evaluate SYNAPSE through extensive experimentation including a user case study focusing on mobility-related concepts in mobile robotics and autonomous driving. Our evaluation demonstrates that SYNAPSE significantly outperforms existing baselines as well as its own ablations.

Demos

Approach

At a high level, the learning procedure consists of three steps:

Concept Library Update: Checking whether the existing concept library C is sufficient for successfully learning the desired preference evaluation function. For example, if the natural language explanation uses the term "far away" but the concept library does not contain a suitable definition, SYNAPSE-Learn interactively queries the user for clarification and updates its concept library as needed.
Program Sketch Synthesis: If the concept library is sufficient for representing the preference, SYNAPSE-Learn proceeds to synthesize a so-called program sketch, which is a program with missing constants to be synthesized. This is so because the user's natural language explanation is often sufficient to understand the general structure of the preference evaluation function but not its numeric parameters, which can only be accurately learned from the physical demonstrations.
Parameter Synthesis: We use all physical demonstrations provided thus far to synthesize the unknown numeric parameters of the sketch using a constraint-solving approach. For example, if the user's NL explanation mentions "not too close to the sidewalk", the physical demonstrations are needed to understand what the user considers "too close". Thus, a separate parameter synthesis procedure to determine suitable numeric parameters from the physical demonstrations.

Results

We evaluate SYNAPSE against eight baselines and conduct multiple ablation studies to confirm our design choices.

Lifelong Learning Curve

SYNAPSE learns new concepts and synthesizes better parameters as it sees more demonstrations.

User-study

We conduct a SYNAPSE-based case-study to test if it can align well with multiple user preferences.

Latest News

Workshop Paper

Apr 04, 2024

Paper accepted at the Vision-Language Models for Navigation and Manipulation (VLMNM) workshop at ICRA 2024.

BibTeX

@article{modak2024synapse,
        title={SYNAPSE: SYmbolic Neural-Aided Preference Synthesis Engine},
        author={Modak, Sadanand and Patton, Noah and Dillig, Isil and Biswas, Joydeep},
        journal={arXiv preprint arXiv:2403.16689},
        year={2024}}