Skip to content

Official repository for "Dialogue Systems for Emotional Support via Value Reinforcement"

Notifications You must be signed in to change notification settings

holi-lab/ES-Value

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Dialogue Systems for Emotional Support via Value Reinforcement

This is the official repository for the paper:
"Dialogue Systems for Emotional Support via Value Reinforcement"


πŸš€ Training

Overview of the framework

The framework consists of three key components:

  1. Target Value Detector
    Identifies which human values to reinforce at each turn.

  2. Reference Generator
    Generates utterances that promote these values from the seeker.

  3. Supporter Model
    Determines appropriate strategies and responses based on values and references.

βœ… Target Value Detector

  • Training Command
    bash tvd_sft.sh

βœ… Reference Generator

  • Training Command
    bash rg_sft.sh
    bash rg_dpo.sh

βœ… Supporter Model

  • Training Command
    bash sptr_sft.sh
    bash sptr_dpo.sh

πŸ§ͺ Simulation

After training all three components (target value detector, reference generator, and supporter model), you can simulate a conversation with the seeker simulator using the seeker personas (test dataset).

  • Run Simulation
    bash simulation.sh

About

Official repository for "Dialogue Systems for Emotional Support via Value Reinforcement"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published