TASS2019

If you use this work, please cite the following references:

@article{GONZALEZ2020102262,
title = "Transformer based contextualization of pre-trained word embeddings for irony detection in Twitter",
journal = "Information Processing & Management",
volume = "57",
number = "4",
pages = "102262",
year = "2020",
issn = "0306-4573",
doi = "https://doi.org/10.1016/j.ipm.2020.102262",
url = "http://www.sciencedirect.com/science/article/pii/S0306457320300200",
author = "José Ángel González and Lluís-F. Hurtado and Ferran Pla",
keywords = "Irony detection, Twitter, Deep learning, Transformer encoders",
abstract = "Human communication using natural language, specially in social media, is influenced by the use of figurative language like irony. Recently, several workshops are intended to explore the task of irony detection in Twitter by using computational approaches. This paper describes a model for irony detection based on the contextualization of pre-trained Twitter word embeddings by means of the Transformer architecture. This approach is based on the same powerful architecture as BERT but, differently to it, our approach allows us to use in-domain embeddings. We performed an extensive evaluation on two corpora, one for the English language and another for the Spanish language. Our system was the first ranked system in the Spanish corpus and, to our knowledge, it has achieved the second-best result on the English corpus. These results support the correctness and adequacy of our proposal. We also studied and interpreted how the multi-head self-attention mechanisms are specialized on detecting irony by means of considering the polarity and relevance of individual words and even the relationships among words. This analysis is a first step towards understanding how the multi-head self-attention mechanisms of the Transformer architecture address the irony detection problem."
}

@article{DBLP:journals/jifs/GonzalezHP20,
  author    = {Jos{\'{e}}{-}{\'{A}}ngel Gonz{\'{a}}lez and
               Llu{\'{\i}}s{-}F. Hurtado and
               Ferran Pla},
  title     = {Self-attention for Twitter sentiment analysis in Spanish},
  journal   = {J. Intell. Fuzzy Syst.},
  volume    = {39},
  number    = {2},
  pages     = {2165--2175},
  year      = {2020},
  url       = {https://doi.org/10.3233/JIFS-179881},
  doi       = {10.3233/JIFS-179881},
  timestamp = {Thu, 10 Sep 2020 16:38:02 +0200},
  biburl    = {https://dblp.org/rec/journals/jifs/GonzalezHP20.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

TASS2019

Este es el modelo que mejor ha funcionado en comparación a los modelos del año pasado (puede ser que ajustando los modelos del año pasado funcionen mejor que este, pero en todas las ejecuciones que he hecho con este modelo y con casi cualquier hiper-parámetro, pasa los 50 de MF1:

SVM-BOW Acc: 53.01 | MF1: 42.89
SVM-BOC Acc: 54.04 | MF1: 39.16
SVM-SumaEmbeddings Acc: 54.91 | MF1: 43.21 (este no estaba en la tabla del año pasado)
Att-BLSTM Acc: Acc: 0.590361 | MF1: 0.488154
CNN Acc: 0.612737 | MF1: 0.476809
DAN (es-run1): Acc: 0.569707 | MF1: 0.482551
Transformer: Acc: 0.595525 | MF1: 0.522083

Mejor modelo (Transformer):

Acc: 0.595525
MF1: 0.522083
MP: 0.529196
MR: 0.521423

Conf Matrix

N 201 30 13 22
NEU 31 29 10 13
NONE 16 10 30 8
P 46 22 14 86

Classification Report

       precision    recall  f1-score   support

       N       0.68      0.76      0.72       266
     NEU       0.32      0.35      0.33        83
    NONE       0.45      0.47      0.46        64
       P       0.67      0.51      0.58       168

micro avg       0.60      0.60      0.60       581
macro avg       0.53      0.52      0.52       581
weighted avg       0.60      0.60      0.59       581

El modelo tiene una sola capa con 6 cabezales de atención. Lo que se muestra son los 6 cabezales para cada muestra (más amarillo más peso, más morado, menos peso).

Algunas cosas que he visto:

El primer cabezal reacciona siempre a los usuarios (token user) y lo que hace referencia a ellos (si no está el token, ni idea)
El 2º cabezal parece reaccionar a palabras de "tiempo" (hola, saludos, manyana, dias, directo, noche, ...), pero no termino de entenderlo
El 5º cabezal reacciona a palabras con polaridades extremas (genial, maravilloso, horrible, ...) (cuando no hay, ni idea)
El 3º cabezal reacciona siempre a las palabra "no", "ni" (en caso de que no estén, no lo entiendo, parece controlar la negación marca los segmentos negados)
El 6º cabezal reacciona a casi todo, menos a determinantes, preposiciones, conjunciones, etc. (generalmente a palabras con "significado")
Si no hay palabras que tienen mucha importancia según el cabezal (negaciones, tiempos, usuarios, etc.) todos parecen reaccionar a palabras con polaridad alta (bien positiva o negativa)
Para las clases NEU y NONE, las atenciones forman patrones complicados de entender, para las P y N suelen marcar palabras con polaridades altas y se entienden mejor las atenciones (aunque en algunos casos, también son complicados).

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
best-models		best-models
figures		figures
tass-corpus		tass-corpus
BuildVocabulary.py		BuildVocabulary.py
CorpusStatistics.py		CorpusStatistics.py
GenerateReports.html		GenerateReports.html
GenerateReports.ipynb		GenerateReports.ipynb
LICENSE		LICENSE
LayerNormalization.py		LayerNormalization.py
Losses.py		Losses.py
Metrics.py		Metrics.py
MultiHeadAttention.py		MultiHeadAttention.py
MyMasking.py		MyMasking.py
PositionalEncoding.py		PositionalEncoding.py
Preprocess.py		Preprocess.py
README.md		README.md
SHT-Test.py		SHT-Test.py
SHT-Train.py		SHT-Train.py
SelfAttention.py		SelfAttention.py
SentenceEncoderBlock.py		SentenceEncoderBlock.py
StringProcessing.py		StringProcessing.py
TransformerEncoder.py		TransformerEncoder.py
Utils.py		Utils.py
Visualization.py		Visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TASS2019

Muestra 12 (Pred:N Truth:N)

Muestra 128 (Pred:NONE Truth:NEU)

Muestra 13 (Pred:NEU Truth:NEU)

Muestra 141 (Pred:N Truth:N)

Muestra 19 (Pred:N Truth:N)

Muestra 222 (Pred:N Truth:N)

Muestra 30 (Pred:N Truth:N)

Muestra 505 (Pred:N Truth:N)

Muestra 508 (Pred:P Truth:N)

Muestra 0 (Pred:N Truth:NONE)

Muestra 1 (Pred:N Truth:N)

Muestra 136 (Pred:N Truth:P)

Muestra 17 (Pred:P Truth:P)

About

Uh oh!

Releases

Packages

Languages

License

jogonba2/TE-TextClassification

Folders and files

Latest commit

History

Repository files navigation

TASS2019

Muestra 12 (Pred:N Truth:N)

Muestra 128 (Pred:NONE Truth:NEU)

Muestra 13 (Pred:NEU Truth:NEU)

Muestra 141 (Pred:N Truth:N)

Muestra 19 (Pred:N Truth:N)

Muestra 222 (Pred:N Truth:N)

Muestra 30 (Pred:N Truth:N)

Muestra 505 (Pred:N Truth:N)

Muestra 508 (Pred:P Truth:N)

Muestra 0 (Pred:N Truth:NONE)

Muestra 1 (Pred:N Truth:N)

Muestra 136 (Pred:N Truth:P)

Muestra 17 (Pred:P Truth:P)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages