Apply transformer models for Question Answering task

Data used

Data downloaded from here https://www.kaggle.com/c/tweet-sentiment-extraction

Problem

Given a tweet and sentiment extract the 'phrase' leading to a sentiment.

input = {'sentence': 'soo sad I missed you here in san diego', 'sentiment':'negative'} expected output = {'target':'soo sad'}

Evaluation

Jaccard similarity between phrases of target and prediction.


    a = set(str1.lower().split()) 
    b = set(str2.lower().split())
    c = a.intersection(b)
    return float(len(c)) / (len(a) + len(b) - len(c))

Approach

Straight forward way to approach this problem is to use a recurrent neural network on char-level emebedding to predict the characters which appear in output.

NER approach: Formulate this task as named entity recognition task. All the words in the output are marked as entities to be recognized from the input. Instead of characters we predict words here.

QA approach: We can formulate this problem Question answering problem with

sentiment as question.
text as context.
selected_text as answer

and apply BERT-based models like RoBERTa, ALBERT models. (Current state-of-art models)

Judging by the notebooks in the competition page, BERT-based models seem to outperform other approaches. Based

Model Implemented

Apply ALBERT model and record cv-score
Apply BERT model and record cv-score

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
data		data
images/baseline		images/baseline
ts-logs		ts-logs
.gitignore		.gitignore
ALBERT.ipynb		ALBERT.ipynb
README.md		README.md
data-visualization.ipynb		data-visualization.ipynb
environment.yml		environment.yml
requirements.txt		requirements.txt
roBERTa.ipynb		roBERTa.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apply transformer models for Question Answering task

Data used

Problem

Evaluation

Approach

About

Releases

Packages

Contributors 2

Languages

nithish08/tweet-sent

Folders and files

Latest commit

History

Repository files navigation

Apply transformer models for Question Answering task

Data used

Problem

Evaluation

Approach

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages