References

glo: Hierarchical classification module based on scikit-learn's interfaces and conventions. https://github.com/globality-corp/sklearn-hierarchical-classification.
BKL09: Steven Bird, Ewan Klein, and Edward Loper. Natural language processing with Python: analyzing text with the natural language toolkit. " O'Reilly Media, Inc.", 2009.
EP99: Theodoros Evgeniou and Massimiliano Pontil. Support vector machines: theory and applications. In Advanced Course on Artificial Intelligence, 249–257. Springer, 1999.
GL20: Eleonora Giunchiglia and Thomas Lukasiewicz. Coherent hierarchical multi-label classification networks. arXiv preprint arXiv:2010.10151, 2020.
MB18: Luca Masera and Enrico Blanzieri. AWX: an integrated approach to hierarchical-multilabel classification. In Michele Berlingerio, Francesco Bonchi, Thomas Gärtner, Neil Hurley, and Georgiana Ifrim, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part I, volume 11051 of Lecture Notes in Computer Science, 322–336. Springer, 2018. doi:10.1007/978-3-030-10925-7\_20.
PVG+11: F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.
SDCW19: Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, 2019. URL: http://arxiv.org/abs/1910.01108, arXiv:1910.01108.
WCB18: Jonatas Wehrmann, Ricardo Cerri, and Rodrigo Barros. Hierarchical multi-label classification networks. In International Conference on Machine Learning, 5075–5084. PMLR, 2018.
Zha04: Tong Zhang. Solving large scale linear prediction problems using stochastic gradient descent algorithms. In Proceedings of the twenty-first international conference on Machine learning, 116. 2004.