Dialogue Systems for Emotional Support via Value Reinforcement

This is the official repository for the paper:
"Dialogue Systems for Emotional Support via Value Reinforcement"

🚀 Training

The framework consists of three key components:

Target Value Detector
Identifies which human values to reinforce at each turn.
Reference Generator
Generates utterances that promote these values from the seeker.
Supporter Model
Determines appropriate strategies and responses based on values and references.

✅ Target Value Detector

Training Command
```
bash tvd_sft.sh
```

✅ Reference Generator

Training Command
```
bash rg_sft.sh
bash rg_dpo.sh
```

✅ Supporter Model

Training Command
```
bash sptr_sft.sh
bash sptr_dpo.sh
```

🧪 Simulation

After training all three components (target value detector, reference generator, and supporter model), you can simulate a conversation with the seeker simulator using the seeker personas (test dataset).

Run Simulation
```
bash simulation.sh
```

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
figures		figures
persona		persona
simulation		simulation
training		training
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dialogue Systems for Emotional Support via Value Reinforcement

🚀 Training

✅ Target Value Detector

✅ Reference Generator

✅ Supporter Model

🧪 Simulation

About

Uh oh!

Releases

Packages

Languages

holi-lab/ES-Value

Folders and files

Latest commit

History

Repository files navigation

Dialogue Systems for Emotional Support via Value Reinforcement

🚀 Training

✅ Target Value Detector

✅ Reference Generator

✅ Supporter Model

🧪 Simulation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages