Build multi-turn Chatbot GPT-2

The following code trains a multi-turn chatbot model based on the DailyDialog multi-turn dataset and pre-trained GPT-2 model.

Multi-turn GPT-2 Dialogue Results

"Q" represents user (your inputs) message, and "A" represents the chatbot's response.

Q: Have you ever been to France?
A: Oh, yes, I'Ve been there. I went there last summer.
Q: How was there?
A: It was great!
Q: Which spot was your best?
A: I like the place, don't you?
Q: What your favourite place in France?
A: It's certainly of interest to me.
Q: Best place in France?
A: Yes, it's very famous.
Q: Famous places in France?
A: Yes, like Eiffel Tower, Eiffel Tower and so on.
Q: What about Louvre?
A: It's very famous, but not very modern.
Q: National French cuisine
A: That's very interesting.
Q: What about National French cuisine
A: It's got ingredients.


Q: Did you see her yesterday?
A: Yes, I did. I saw her with a pretty girl.
Q: Oh where did you see her?                       
A: She was in a cafe. She told me she was going to a concert.
Q: I see. I've tried to contact her by phone, but she never received my call.
A: I'm sure she was in New York. She said she'd be in New York at the latest.
Q: Really? Thank you for your information. I will contact her today. Thanks.

Quick Start 🚀

python3 src/run/train.py --config config/project.yaml --mode train

Supported Devices

CPU, GPU, multi-GPU (DDP), MPS (for Mac and torch>=1.12.0)

Supported Models

Pre-trained GPT-2 from Hugging Face: "gpt2" from Hugging Face.

Supported Tokenizer

Pre-trained GPT-2 tokenizer from Hugging Face.

Base Dataset 📚

DailyDialog multi-turn dataset.
If you want to use your custom data, you have to set train/validation/test data paths in the config/project.yaml. Also, you have to implement your custom tokenizer, data loading parts in the src/trainer/build.py.

Tutorials & Documentations 🔍

Please follow the steps below to train a multi-turn chatbot model.

Training Results 📣

Results of GPT-2-based Multi-turn Chatbot Model

BLEU Score History
NIST Score History
Test Set Scores
The results of applying the model, which achieved the highest metric on the validation set, to the test set.
- BLEU-2: 0.4052
- BLEU-4: 0.2268
- NIST-2: 5.1797
- NIST-4: 5.5162

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
config		config
data/dailydialog/filtered		data/dailydialog/filtered
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build multi-turn Chatbot GPT-2

Multi-turn GPT-2 Dialogue Results

Quick Start 🚀

Supported Devices

Supported Models

Supported Tokenizer

Base Dataset 📚

Tutorials & Documentations 🔍

Training Results 📣

Results of GPT-2-based Multi-turn Chatbot Model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Build multi-turn Chatbot GPT-2

Multi-turn GPT-2 Dialogue Results

Quick Start 🚀

Supported Devices

Supported Models

Supported Tokenizer

Base Dataset 📚

Tutorials & Documentations 🔍

Training Results 📣

Results of GPT-2-based Multi-turn Chatbot Model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages