项目作者: guojson

项目描述 :
Annotation tool for Chinese named entity recognition(中文命名实体识别工具,英文通用)
高级语言: Python
项目地址: git://github.com/guojson/ChineseNERAnno.git
创建时间: 2019-12-28T04:17:15Z
项目社区:https://github.com/guojson/ChineseNERAnno

开源协议:

下载


ChineseNERAnno

Annotation tool for Chinese named entity recognition(中文命名实体标注工具,英文通用)


  1. |-- ChineseNERAnno #the name of this tool
  2. |--configs # the configuration files for this tool
  3. |--data # the raw and annoted datas
  4. |--utils #
  5. |--anno_main.py # the main file for ChineseNERAnno

menu

  1. |--文件
  2. |--打开
  3. |--设置
  4. |--保存
  5. |--编辑
  6. |--控制面板
  7. |--检测
  8. |--检测长度
  9. |--拆分句子
  10. |--分割数据集
  11. |--转为实体
  12. |--转为纯文本
  13. |--定位
  14. |--查找和替换
  15. |--格式
  16. |--BMES
  17. |--BIOES
  18. |--分词

operation process

alt

main windows

alt

keyboard

控制面板主要用于显示预先定义的实体类别,便于实体划分。具体的实体类别可在“文件-设置”中设置,或在数据表category中设置。 \
alt alt

operation

  • run the main file
    1. python anno_main.py
  • open the file

    1. '文件-->打开'

    alt

  • label the entity

alt

  • export the entities with BIO format

alt \

  • the example for the export dataset:
    1. B-DRUG
    2. I-DRUG
    3. I-DRUG
    4. I-DRUG
    5. O
    6. O
    7. O
    8. O
    9. O
    10. O
    11. 广 O
    12. O
    13. O
    14. O
    15. O
    16. B-CLA
    17. I-CLA
    18. I-CLA

update

In the future, we will continue to add entity relationship annotation. Other functions continue to improve, please wait patiently

reference

  1. YEDDA: https://github.com/jiesutd/YEDDA.git

Cited

  1. @article{GUO2020105830,
  2. title = "Chinese agricultural diseases and pests named entity recognition with multi-scale local context features and self-attention mechanism",
  3. journal = "Computers and Electronics in Agriculture",
  4. volume = "179",
  5. pages = "105830",
  6. year = "2020",
  7. issn = "0168-1699",
  8. doi = "https://doi.org/10.1016/j.compag.2020.105830",
  9. }