较小的Stanford NLP Models Jar文件

作者: biu~
发布时间: 2025-04-26 06:24:19 (2月前)
转自：

3 条回复

0#
回复此人
喜欢一个人丶 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 如果只是在类路径和pos标记器的模型文件中包含解析器的模型文件，那么您应该没问题。 “引理”需要“pos”，因此您需要将其包含在注释器列表中。 </p> <P> 例如：“edu / stanford / nlp / models / lexparser / englishPCFG.ser.gz”和“edu / stanford / nlp / models / pos-tagger / english-left3words / english-left3words-distsim.tagger”应该是你的全部需要。 </p> <P> 您可以创建该目录结构并在类路径中包含这些文件，或者只使用其中的文件制作一个jar。你绝对可以切掉大部分罐子。 </p> <P> 最重要的是，如果您遗漏了某些内容，您的代码将因资源丢失而崩溃。所以你只需要继续添加文件，直到代码停止崩溃。你肯定不需要那个jar中的很多文件。 </p> </DIV>

编辑
1#
回复此人
老夫的少女心吖 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 按照@StanfordNLPHelp提到的类似方法，我使用了maven-shade-plugin并减小了我最终编译的jar文件的大小。你需要改变“Package.MainClass”和 <code> includes </code> 标记或添加 <code> excludes </code> 标签 </p> <pre> <code> <plugin> <groupId>org.apache.maven.plugins</groupId> <artifactId>maven-shade-plugin</artifactId> <version>3.1.0</version> <executions> <execution> <phase>package</phase> <goals> <goal>shade</goal> </goals> <configuration> <transformers>  <transformer implementation="org.apache.maven.plugins.shade.resource.ManifestResourceTransformer"> <mainClass>Package.MainClass</mainClass> </transformer> </transformers> <minimizeJar>true</minimizeJar> <filters> <filter> <artifact>edu.stanford.nlp:stanford-corenlp</artifact> <includes> <include>**</include> </includes> </filter> <filter> <artifact>edu.stanford.nlp:stanford-corenlp:models</artifact> <includes> <include>edu/stanford/nlp/models/pos-tagger/**</include> </includes> </filter> </filters> </configuration> </execution> </executions> </plugin> </code> </pre> </DIV>

编辑

登录后才能参与评论