mrm8488/bert-spanish-cased-finetuned-ner cover image

mrm8488/bert-spanish-cased-finetuned-ner

This paper presents a fine-tuned Spanish BERT model (BETO) for the Named Entity Recognition (NER) task. The model was trained on the CONLL Corpora ES dataset and achieved an F1 score of 90.17%. The authors also compared their model with other state-of-the-art models, including a multilingual BERT and a TinyBERT model, and demonstrated its effectiveness in identifying entities in Spanish text.

This paper presents a fine-tuned Spanish BERT model (BETO) for the Named Entity Recognition (NER) task. The model was trained on the CONLL Corpora ES dataset and achieved an F1 score of 90.17%. The authors also compared their model with other state-of-the-art models, including a multilingual BERT and a TinyBERT model, and demonstrated its effectiveness in identifying entities in Spanish text.

Public
$0.0005 / sec
Web inference not supported yet, please check API tab

Spanish BERT (BETO) + NER

This model is a fine-tuned on NER-C version of the Spanish BERT cased (BETO) for NER downstream task.

Details of the downstream task (NER) - Dataset

I preprocessed the dataset and split it as train / dev (80/20)

Dataset# Examples
Train8.7 K
Dev2.2 K
B-LOC
B-MISC
B-ORG
B-PER
I-LOC
I-MISC
I-ORG
I-PER
O

Metrics on evaluation set:

Metric# score
F190.17
Precision89.86
Recall90.47

Comparison:

Model# F1 scoreSize(MB)
bert-base-spanish-wwm-cased (BETO)88.43421
bert-spanish-cased-finetuned-ner (this one)90.17420
Best Multilingual BERT87.38681
TinyBERT-spanish-uncased-finetuned-ner70.0055

Created by Manuel Romero/@mrm8488

Made with in Spain