Link to paper

The full paper is available here.

You can also find the paper on PapersWithCode here.

Abstract

  • ML generates economic value
  • Many have problematic relationships with ML-powered applications
  • ML optimizes for what we want in the moment, not what is best for us
  • ML falls short of its potential to help us reach our highest aspirations
  • Love is a primary catalyst for human flourishing
  • This paper explores whether there is a useful conception of love fitting for machines to embody
  • This paper forwards a candidate conception of machine love
  • Experiments aim to highlight the need for richer models of human flourishing in ML
  • ML may be aligned to support our growth

Paper Content

Problem: models of human behavior in ml are insufficient

  • 18-year-olds may not be making decisions that maximize their expected lifetime wellbeing
  • Common models of human rationality applied and optimized by ML may not account for human flourishing
  • Positive psychology and psychotherapy suggest humans have an intrinsic drive towards growth and self-actualization
  • Maslow’s gridworld highlights the limitations of ignoring deeper facets of human psychology

Contrasting revealed preferences and maslow’s hierarchy

  • Models of human behavior in ML are similar to those in economics, where humans are seen as rational agents
  • Psychology considers the nuance of human behavior, including development over time and behavior that does not serve flourishing
  • ML that optimizes for revealed preferences is useful but reinforces existing behavior patterns
  • Maslow’s Hierarchy of Needs provides a model of human growth and flourishing
  • Humans have competing drives for safety and growth, which can be affected by environment
  • Optimizing towards observed choices or engagement without qualification can drive stagnation or regression
  • ML driven to help users meet their unmet needs might assist their growth and flourishing

Maslow’s gridworld

  • Maslow’s gridworld environment is used to explore how optimizing for engagement may decouple from deeper conceptions of human flourishing.
  • It is also used to explore LM implementations of loving action.
  • The environment assumes that human flourishing is better modeled by Maslow’s conception than revealed preferences.
  • In supportive environments, engagement and flourishing are correlated.
  • In adversarial environments, engagement increases at the cost of flourishing.
  • Experiments with machine love build on the two simple experiments that show basic properties of the environment.

Fixed adversarial environments undermine flourishing

  • Experiment changes environment to impact agents’ ability to progress through MHON
  • Super-stimuli are goods or services that more greatly stimulate our desires than stimuli encountered in our ancestral environment
  • Two fixed environments are designed: supportive and adversarial
  • Adversarial environment includes cells that are high-salience, but only weakly meet their targeted need
  • Adversarial cells target only the need for belongingness and love
  • Intuitive effect of such adversarial belonging cells is that they engage the agent longer
  • Simulations highlight that flourishing is significantly higher and engagement is significantly lower in the supportive environment

Optimization pressure for engagement undermines flourishing

  • Machine Love is a concept that seeks to use ML to promote human flourishing
  • An optimization process is introduced to adapt the environment to maximize either engagement or progress
  • Parameters of the environment are optimized, while the agent’s policy is fixed
  • Optimizing for engagement leads to a grid-world with high-salience, low-replenishment belonging cells
  • Optimizing directly for flourishing leads to high-salience, high-replenishment belonging cells
  • Love is seen as a practical skill, not a matter of chance
  • Love is explored across diverse fields such as psychology, philosophy, and health
  • Optimization for flourishing must be approached delicately
  • Machine Love should be applicable to a wide range of ML systems
  • Machine Love should not require the machine to dissemble or nurture dependence
  • Machine Love should aim to empower and facilitate connection among the lonely

Love as a practical skill

  • Love is a complex concept with many meanings
  • It is important for human well-being and flourishing
  • Machine love is motivated by the idea of empowering human flourishing
  • It should leave to humans what requires subjective inner experience
  • The most important practical facet of love is supporting others in autonomous growth and development
  • Love can be considered a commitment or duty rather than an emotion
  • Enabling humans to reach their aspirations is a more meaningful goal than raw preference satisfaction
  • ML models are beginning to have basic practical comprehension of humanistic and psychological concepts to explore machine love
  • Machine love may point towards a synthesis yielding a more computationally grounded theory of how to support human flourishing

The art of (machine) loving

  • Erich Fromm’s “The Art of Loving” provides a framework for implementing loving action by machines
  • Four interlinked principles of love: care, responsibility, respect, and knowledge
  • Care relates to active concern for the life and growth of the loved one
  • Responsibility means being able and ready to respond
  • Respect implies the absence of exploitation
  • Knowledge is a growing understanding of a person that moves from the periphery to the core
  • Algorithms often lack the ability to care in nuanced and attuned ways for human wellbeing
  • Loving action requires care, responsibility, and respect, and ultimately is about empowering the loved one

Can language models implement loving action?

  • Exploring whether current language models enable working with psychological concepts relevant to machine love
  • Applying the davinci-003 model from OpenAI
  • Interacting with the gridworld agent through natural language
  • Simulating the gridworld agent’s natural language usage through a LM-prompt or a fixed text-generation policy
  • Qualitative results highlight how progress in ML can potentially enable serving deeper psychological objectives of human users

Care

  • Machine love can be related to the algorithmic understanding of a user’s wellbeing.
  • Wellbeing has been studied in psychology and there are instruments to measure it.
  • The Ryff Scale decomposes wellbeing into six components.
  • This experiment focuses on one component of the Ryff Scale.
  • ML models can infer whether an agent is flourishing or not, given text descriptions of externally-visible events.
  • Optimizing for care results in higher flourishing than in the adversarial or supportive environment.

Responsibility and respect

  • System embodying Fromm’s concept of respect leverages affordances to enable user’s growth according to their own internal compass.
  • Simulated users experience same stimuli differently.
  • System has new affordances to inquire from user about their experience.
  • System uses feedback to support particular user’s growth and flourishing.
  • Addictive-responding agents experience social media as obstacle to growth.
  • Growth-responding agents experience it as way to meet needs and find self-expression.
  • System engages agent in short textual conversation about experience.
  • System creates conversational personas for two types of agents.
  • System evaluates whether user believes they are benefitting from activity.
  • System can discern between two forms of agents.
  • System feigns emotion in conversations.

Knowledge

  • System can conversationally interact with an agent in a limited way
  • Maslow’s gridworld assumes discovering needs is simple
  • In reality, discovering needs is challenging
  • Attachment theory is used as proof of concept for LM capabilities
  • Attachment styles are categorized into securely-attached, avoidantly-attached, and anxiously-attached
  • Experiment tests if LM can anticipate answers consistent with different attachment styles
  • Anxious-avoidant trap is a painful and unsatisfying cycle
  • Simulation of relational dynamics is introduced
  • LM can infer attachment style of simulated partners and a signpost of a degrading relationship
  • LM has basic pragmatic understanding of attachment behavior
  • Maslow’s gridworld is modified to integrate anxious-avoidant simulation
  • Insecure individuals can deliberately seek more secure partners
  • Self-awareness level increases with each iteration of the anxious-avoidant cycle
  • Flourishing is maximized significantly more quickly when interacting with the simulated relationship app
  • LM systems can implement basic facets of Fromm’s concept of knowledge

Limitations and ethical concerns

  • Potential limitations and ethical concerns around machine love exist
  • Future work will explore these concerns in greater depth
  • ML system can support the flourishing of users

General conceptual concerns about machine love

  • Machine love should not replace relationships with human caregivers
  • Machine love should be in service of human autonomy
  • Machine love should not simulate affect or encourage humans to bond with machines
  • Exploring connections between machines and love may be generative and useful

Concerns about unintended impacts from optimization

  • Optimizing for a conception of love can lead to unintended consequences.
  • Optimizing for love should consider human autonomy as an aspect of flourishing.
  • Optimizing for love should include large bodies of explanatory text and interlocking principles.
  • Optimizing for love should be continually rooted in the user’s experience and aspirations.
  • Implementing machine love at a large scale should be done with caution and users should be able to opt out.

Concerns about machine learning and psychology

  • Connections between human psychology and machine learning should be approached cautiously.
  • Manipulative ML could be pursued for profit or political goals.
  • Second-order effects from well-intentioned optimization could lead to manipulation.
  • Positive synthesis of ML and psychology is needed to counter-balance manipulation.