Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
main.py	main.py
misc.py	misc.py
options.py	options.py

Name

Last commit message

Last commit date

main.py

misc.py

options.py

Sequence Kernel Networks

This example trains a multi-layer string kernel network on Stanford Sentiment Treebank (SST).

Data can be found here

Results

Fine-grained classification	Dev acc.	Test acc.
d=200, dropout 0.35, rnn dropout 0.2, lr decay 0.95	53.7 (±0.5)	52.4 (±0.5)
Binary classification
d=200, dropout 0.35, rnn dropout 0.1, lr decay 0.95	90.1 (±0.5)	89.6 (±0.3)

We use a 3-layer network with around 540k parameters. Glove word embeddings are normalized to unit vectors and fixed during training and testing.

Usage

Code requires Theano, and has been tested on Theano 0.9.0

python main.py --help gives the following arguments:

optional arguments:
  --train            training set
  --dev              validation set
  --test             test set
  --hidden_dim, -d   hidden dimension
  --learning_rate    learning rate
  --activation       type of activation (none, relu, tanh etc.)
  --batch_size       mini batch size
  --depth            number of stacking recurrent layers
  --dropout          dropout rate between layers
  --rnn_dropout      variational dropout within RNN cells
  --highway          whether to use highway connections (0 or 1)
  --lr_decay         decrease learning rate by this factor after each epoch
  --multiplicative   whether to use multiplicative KNN or additive KNN (0 or 1)
  --max_epoch        maxmimum number of training epochs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Sequence Kernel Networks

Results

Usage

FilesExpand file tree

sst

Directory actions

More options

Directory actions

More options

Latest commit

History

sst

Folders and files

parent directory

README.md

Sequence Kernel Networks

Results

Usage