Experiments of Basic Vietnamese NLP Problems and Named Entity Recognition Tool
Vietnamese Named Entity Recognition Tool
Install using npm:
npm install -g @vntk/tagger
Two main features:
PER
, ORG
, LOC
, DATE
, TIME
, …Simply run following command to predict new input from file:
vntk-tagger predict [your_file_name.txt]
The output is a new file with name: your_file_name.txt.tags
Preparing your data from: News, QA, Comments, or Chat logs.
Convert raw data to enrich data, by:
Command: vntk-tagger clean [your_data_file.txt]
Run node preprocess.js
to clean and convert raw data to iob
format. Result can feed to the trainer!
Run the following command to train new NER data.
vntk-tagger train [your_training_file.txt]