HDLTex: Hierarchical Deep Learning for Text Classification
Paper
•
1709.08267
•
Published
This is a model to classify scientific papers by the Web-of-Science nomenclature.
It's a fine-tuned model to predict the 134 classes from the WOS-46985 model published by https://arxiv.org/pdf/1709.08267.pdf.
10/90 validation/training split (like https://arxiv.org/pdf/1709.08267.pdf)
Accuracy on the final layer was 83% (previous state-of-the-art 77% https://arxiv.org/pdf/1709.08267.pdf). However, the previous SOTA did not use test-data set, so the difference is probably more significant.
Useful model to annotate scientific text.