SQLNet with Custom Inference

W. Ronny Huang // [email protected]

Xin Chen // [email protected]

This repo provides an implementation of the SQLNet neural network for predicting SQL queries on WikiSQL dataset and on your own custom dataset. The original paper on SQLNet is available at here.

Custom inference demo

For a live demo with your own typed custom questions, you must first run the installation and training as outlined below.

To perform transfer learning, first train the SQLNet weights on the large WikiSQL dataset (details below) and save those weights into the folder saved_model_pretrained_wikisql/. Then uncomment the lines in train_mc.py which load the pretrained weights from that saved model and also uncomment the lines which commence finetuning.

After transfer learning is finished and the weights are saved into saved_model/, run the following command to do custom inference.

python infer_mc.py --ca

This will allow you to run your own custom inference (English) statements on your own SQL table. It will then return the predicted SQL query.

Installation

The data is in data.tar.bz2. Unzip the code by running

tar -xjvf data.tar.bz2

The code is written using PyTorch 0.2.0 in Python 2.7. Check here to install PyTorch, or run

conda install pytorch=0.2.0 cuda90 -c pytorch

You can install other dependency by running

pip install -r requirements.txt

Downloading the glove embedding.

Download the pretrained glove embedding from here using

bash download_glove.sh

Extract the glove embedding for training.

Run the following command to process the pretrained glove embedding for training the word embedding:

python extract_vocab.py

Train

The training script is train.py. To see the detailed parameters for running:

python train.py -h

Some typical usage are listed as below:

Train a SQLNet model with column attention:

python train.py --ca

Specify a gpu with

python train.py --ca --gpu=0

Train a SQLNet model with column attention and trainable embedding (requires pretraining without training embedding, i.e., executing the command above):

python train.py --ca --train_emb

Pretrain a Seq2SQL model on the re-splitted dataset

python train.py --baseline --dataset 1

Train a Seq2SQL model with Reinforcement Learning after pretraining

python train.py --baseline --dataset 1 --rl

Test

The script for evaluation on the dev split and test split. The parameters for evaluation is roughly the same as the one used for training. For example, the commands for evaluating the models from above commands are:

Test a trained SQLNet model with column attention

python test.py --ca

Test a trained SQLNet model with column attention and trainable embedding:

python test.py --ca --train_emb

Test a trained Seq2SQL model withour RL on the re-splitted dataset

python test.py --baseline --dataset 1

Test a trained Seq2SQL model with Reinforcement learning

python test.py --baseline --dataset 1 --rl

Citation of original work

Xiaojun Xu, Chang Liu, Dawn Song. 2017. SQLNet: Generating Structured Queries from Natural Language Without Reinforcement Learning.

Bibtex of original work

@article{xu2017sqlnet,
  title={SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning},
  author={Xu, Xiaojun and Liu, Chang and Song, Dawn},
  journal={arXiv preprint arXiv:1711.04436},
  year={2017}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
mock		mock
saved_model		saved_model
saved_model_mconly		saved_model_mconly
saved_model_pretrain_wikisql		saved_model_pretrain_wikisql
sqlnet		sqlnet
.DS_Store		.DS_Store
.ignore		.ignore
LICENSE		LICENSE
README.md		README.md
download_glove.sh		download_glove.sh
download_stanford_tokenizer.sh		download_stanford_tokenizer.sh
ewc.py		ewc.py
extract_vocab.py		extract_vocab.py
infer.py		infer.py
infer_mc.py		infer_mc.py
input_question.txt		input_question.txt
input_tokenize_py3.py		input_tokenize_py3.py
newpackages		newpackages
notes.md		notes.md
oldpackages		oldpackages
pagekite.py		pagekite.py
requirements.txt		requirements.txt
run_pagekite.sh		run_pagekite.sh
sql_engine.py		sql_engine.py
test.py		test.py
test_mc.py		test_mc.py
train.py		train.py
train_mc.py		train_mc.py
train_mc_only.py		train_mc_only.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SQLNet with Custom Inference

Custom inference demo

Installation

Downloading the glove embedding.

Extract the glove embedding for training.

Train

Test

Citation of original work

Bibtex of original work

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

xchen011/sqlnet_inference_py36

Folders and files

Latest commit

History

Repository files navigation

SQLNet with Custom Inference

Custom inference demo

Installation

Downloading the glove embedding.

Extract the glove embedding for training.

Train

Test

Citation of original work

Bibtex of original work

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages