Skip to content

Commit f45b368

Browse files
committed
add k-means baseline
1 parent 86f2e85 commit f45b368

File tree

1 file changed

+15
-3
lines changed

1 file changed

+15
-3
lines changed

README.md

Lines changed: 15 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,12 +2,16 @@ Code and data for paper ["Dialog Intent Induction with Deep Multi-View Clusterin
22

33
Data is available in the sub-directory [data](data), with a specific [LICENSE](data/LICENSE) file.
44

5+
Pre-requisites for use
6+
----------------------
7+
8+
- decompress the `.bz2` files in `data`folder
9+
- download http://nlp.stanford.edu/data/glove.840B.300d.zip, and unzip `glove.840B.300d.txt` into `data` folder
10+
511
To run AV-Kmeans
612
----------------
713

8-
- decompress the data files in `data`folder
9-
- download http://nlp.stanford.edu/data/glove.840B.300d.zip, and unzip `glove.840B.300d.txt` into `data` folder
10-
- then run one of:
14+
- run one of:
1115
```
1216
# no pre-training
1317
python train.py --pre-epoch 0 --data-path data/airlines_processed.csv --num-epochs 50 --view1-col first_utterance --view2-col context
@@ -19,3 +23,11 @@ python train.py --pre-model ae --pre-epoch 20 --data-path data/airlines_processe
1923
python train.py --pre-model qt --pre-epoch 10 --data-path data/airlines_processed.csv --num-epochs 50 --view1-col first_utterance --view2-col context
2024
```
2125
- to train on askubuntu, replace `airlines` with `askubuntu` in the above command-lines
26+
27+
To run k-means baseline
28+
-----------------------
29+
30+
- for qt pretraining run:
31+
```
32+
python train_qt.py --data-path data/airlines_processed.csv --pre-epoch 10 --view1-col first_utterance --view2-col context --scenarios view1
33+
```

0 commit comments

Comments
 (0)