项目作者: finalfusion

项目描述 :
Context-sensitive word embeddings with subwords. In Rust.
高级语言: Rust
项目地址: git://github.com/finalfusion/finalfrontier.git
创建时间: 2018-06-02T08:47:45Z
项目社区:https://github.com/finalfusion/finalfrontier

开源协议:Other

下载


Crate
Docs
Build Status

finalfrontier

Introduction

finalfrontier is a Rust program for training word embeddings.
finalfrontier currently has the following features:

  • Models:
    • skip-gram (Mikolov et al., 2013)
    • structured skip-gram (Ling et al., 2015)
    • directional skip-gram (Song et al., 2018)
    • dependency (Levy and Goldberg, 2014)
  • Output formats:
    • finalfusion
    • fastText
    • word2vec binary
    • word2vec text
    • GloVe text
  • Noise contrastive estimation (Gutmann and Hyvärinen, 2012)
  • Subword representations (Bojanowski et al., 2016)
  • Hogwild SGD (Recht et al., 2011)
  • Quantized embeddings through the finalfusion quantize
    command.

The trained embeddings can be stored in the versatile finalfusion
format, which can be read and used with the
finalfusion crate
and the
finalfusion
Python module.

The minimum required Rust version is currently 1.70.

Where to go from here