Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

The implementation of the model proposed in the Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing paper

Requirements

Python 3+ with pip

Setup

To install the python modules required to train and run the model

pip install -r requirements.txt

Preprocessing

To download and pre-process the multi-domain belief tracking dialogues and pre-trained word embeddings

python preprocess.py

Training

To train the model

python main.py train [--args=value]

Some of these args include:

--num_hid: the size of the hidden layers - default is 50 (the paper uses 100)
--bidir/--no-bidir: bidirectional vs forward only - default is bidirectional (only valid for a RNNs)
--net_type: the type of the feature encoders, gru for GRU, cnn for CNN, lstm for LSTM - default is lstm (See the paper)
--batch_size: the batch size - default is 64
--dev: device use to train the model (cpu or gpu) - default is gpu
--model_url: path to save the model or to resume training - default is models/model-1
--graph_url: path to save the tensorboard graph for evaluation metrics (cross-entropy, accuracy ..etc) - default is graphs/graph-1

For example to train the cnn variant to the model with 8 batch size on the GPU

python main.py train --batch_size=8 --net_type=cnn --dev=gpu

Testing

To evaluate the model on the test dataset

python main.py test [--args=value]

This uses the same arguments as above, except --dev is not included as cpu is used for testing. It generates a log file at results/log-1.txt, which includes the original dialogues with the true labels and the model predictions.

Currently there is a bug that shuffles the model predictions across dialogues. So to go around it, make sure to use a batch size of 1, i.e. --batch_size=1.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
model.py		model.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

Requirements

Setup

Preprocessing

Training

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

osmanio2/multi-domain-belief-tracking

Folders and files

Latest commit

History

Repository files navigation

Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

Requirements

Setup

Preprocessing

Training

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages