Training three different RNN models on a portion of Penn Treebank data to perform POS-tagging
Training three different RNN models on a portion of Penn Treebank data to perform POS-tagging
This repository contains a Python Notebook project I developed for the Natural Language Processing exam in the AI Master Course I’m attending.
The aim of the project is performing POS-tagging (i.e. sequence labelling) on a portion of the Penn Treebank data, made public by nltk.org and available on this page.
I performed this task through the help of GloVe embeddings, making use of the Gensim library to download the model.
This notebook provides a keras
implementation of three RNN models with the purpose of performing sequence labelling, in particular POS-tagging.
The main outline of the process is the following:
pandas.DataFrame
object.