[example] add a deep residual net example#1041
[example] add a deep residual net example#1041shuokay wants to merge 2 commits intoapache:masterfrom shuokay:master
Conversation
|
Please verify the results on cifar first. |
|
many thanks for the PR. it looks like you are using a simplified resnet? how about refactor the symbol definition into ps. you may want to choose a better name but i hope to reserve |
|
What is your results on cifar10? If it is the same as the paper, we can merge in, otherwise we had better to find the reasons to avoid confusion to others. |
|
The result in https://github.com/shuokay/mxnet/commit/cd28e159d2509bf8799ad36a3219ecd5ac1b0a4f is "train accuracy around 82.7% and test accuracy 75.6%". |
|
our baseline algorithm on cifar10 gets 90% test accuracy, see https://github.com/dmlc/mxnet/tree/master/example/image-classification#cifar-10 so i think there should be room for improvement |
|
@winstywang @mli Sorry for disturbing you guys, I am busy with work these days, maybe can get an improved result this weekend. |
|
I pushed this commit just for setting a checkpoint and will not push commits until get a reasonable result. |
|
@shuokay no at all, many thanks for your contributions, we are just trying to help to improve the results |
|
@shuokay In the paper, I don't see a relu before element-wise addition. But in your implementation, there is one. Did I misread that? |
|
Update the resnet-small example, there are differences to the paper:
Running this example by ps: I think |
|
@wangg12 you are right, I have updated the example code. |
|
Close for now. Welcome to PR when you get it work. |
Hi, as mentioned in #1022 , I have complete a simple residual net. But because I don't have enough gpus temporarily, I just test it on tinyimagenet. Hoping this can help you guys to brew your own residual net.
@piiswrong @antinucleon @winstywang please review this PR.