References

glo

Hierarchical classification module based on scikit-learn's interfaces and conventions. https://github.com/globality-corp/sklearn-hierarchical-classification.

BKL09

Steven Bird, Ewan Klein, and Edward Loper. Natural language processing with Python: analyzing text with the natural language toolkit. " O'Reilly Media, Inc.", 2009.

EP99

Theodoros Evgeniou and Massimiliano Pontil. Support vector machines: theory and applications. In Advanced Course on Artificial Intelligence, 249–257. Springer, 1999.

GL20

Eleonora Giunchiglia and Thomas Lukasiewicz. Coherent hierarchical multi-label classification networks. arXiv preprint arXiv:2010.10151, 2020.

MB18

Luca Masera and Enrico Blanzieri. AWX: an integrated approach to hierarchical-multilabel classification. In Michele Berlingerio, Francesco Bonchi, Thomas Gärtner, Neil Hurley, and Georgiana Ifrim, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part I, volume 11051 of Lecture Notes in Computer Science, 322–336. Springer, 2018. doi:10.1007/978-3-030-10925-7\_20.

PVG+11

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.

SDCW19

Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, 2019. URL: http://arxiv.org/abs/1910.01108, arXiv:1910.01108.

WCB18

Jonatas Wehrmann, Ricardo Cerri, and Rodrigo Barros. Hierarchical multi-label classification networks. In International Conference on Machine Learning, 5075–5084. PMLR, 2018.

Zha04

Tong Zhang. Solving large scale linear prediction problems using stochastic gradient descent algorithms. In Proceedings of the twenty-first international conference on Machine learning, 116. 2004.