:school_satchel: 基于CNN实现的文本分类应用
文本分类应用,采用模型是Text-CNN
中文:
准训练结果:验证集91%左右
中文
x.shape -> (54568, 50)
y.shape -> (54568, 2)
len(vocabulary) -> 52822
len(vocabulary_inv) -> 52822
X_train.shape -> (43654, 50)
y_train.shape -> (43654, 2)
X_test.shape -> (10914, 50)
y_test.shape -> (10914, 2)
英文
x.shape -> (10662, 56)
y.shape -> (10662, 2)
len(vocabulary) -> 12766
len(vocabulary_inv) -> 12766
X_train.shape -> (8529, 56)
y_train.shape -> (8529, 2)
X_test.shape -> (2133, 56)
y_test.shape -> (2133, 2)