Java EncodingPrintWriter类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中edu.stanford.nlp.io.EncodingPrintWriter类的典型用法代码示例。如果您正苦于以下问题：Java EncodingPrintWriter类的具体用法？Java EncodingPrintWriter怎么用？Java EncodingPrintWriter使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

EncodingPrintWriter类属于edu.stanford.nlp.io包，在下文中一共展示了EncodingPrintWriter类的13个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: main

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/** Mainly for testing.  Usage:
 *  <code>ChineseUtils ascii spaceChar word*</code>
 *  <p>
 *  ascii and spaceChar are integers: 0 = leave, 1 = ascii, 2 = fullwidth.
 *  The words listed are then normalized and sent to stdout.
 *  If no words are given, the program reads from and normalizes stdin.
 *  Input is assumed to be in UTF-8.
 *
 *  @param args Command line arguments as above
 *  @throws IOException If any problems accessing command-line files
 */
public static void main(String[] args) throws IOException {
  if (args.length < 3) {
    System.err.println("usage: ChineseUtils ascii space midDot word*");
    System.err.println("  First 3 args are int flags; a filter or maps args as words; assumes UTF-8");
    return;
  }
  int i = Integer.parseInt(args[0]);
  int j = Integer.parseInt(args[1]);
  int midDot = Integer.parseInt(args[2]);
  if (args.length > 3) {
    for (int k = 3; k < args.length; k++) {
      EncodingPrintWriter.out.println(normalize(args[k], i, j, midDot));
    }
  } else {
    BufferedReader r =
      new BufferedReader(new InputStreamReader(System.in, "UTF-8"));
    String line;
    while ((line = r.readLine()) != null) {
      EncodingPrintWriter.out.println(normalize(line, i, j, midDot));
    }
  }
}

开发者ID:paulirwin，项目名称:Stanford.NER.Net，代码行数:34，代码来源:ChineseUtils.java

示例2: main

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
public static void main(String[] args) throws IOException {
  if (args.length < 1 ||
      ! (args[0].equals("-a2b") || args[0].equals("-b2a"))) {
    System.err.println("usage: java Buckwalter [-a2b|-b2a] words+ OR, as a filter, just [-a2b|-b2a]");
    return;
  }
  Properties p = StringUtils.argsToProperties(args);
  Buckwalter b;
  b = new Buckwalter(args[0].equals("-a2b"));
  if(p.containsKey("outputUnicodeValues"))
    b.outputUnicodeValues = true;
  int j = (p.containsKey("outputUnicodeValues") ? 2 : 1);
  if (j < args.length) {
    for (; j < args.length; j++) {
      EncodingPrintWriter.out.println(args[j] + " -> " + b.apply(args[j]), "utf-8");
    }
  } else {
    BufferedReader br = new BufferedReader(new InputStreamReader(System.in, "utf-8"));
    String line;
    while ((line = br.readLine()) != null) {
      EncodingPrintWriter.out.println(b.apply(line), "utf-8");
    }
  }
  if (DEBUG) {
    if ( ! b.unmappable.keySet().isEmpty()) {
      EncodingPrintWriter.err.println("Characters that could not be converted [passed through!]:", "utf-8");
      EncodingPrintWriter.err.println(b.unmappable.toString(), "utf-8");
    } else {
      EncodingPrintWriter.err.println("All characters successfully converted!", "utf-8");
    }
  }
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:33，代码来源:Buckwalter.java

示例3: printDebug

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
public void printDebug() {
  if (DEBUG) {
    if ( ! b.unmappable.keySet().isEmpty()) {
      EncodingPrintWriter.err.println("Characters that could not be converted [passed through!]:", "utf-8");
      EncodingPrintWriter.err.println(b.unmappable.toString(), "utf-8");
    } else {
      EncodingPrintWriter.err.println("All characters successfully converted!", "utf-8");
    }
  }
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:11，代码来源:Buckwalter.java

示例4: makeObjects

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * Build the set of dependencies for evaluation.  This set excludes
 * all dependencies for which the argument is a punctuation tag.
 */
@Override
protected
Set<?> makeObjects(Tree tree) {
  Set<Dependency<Label, Label, Object>> deps = new HashSet<Dependency<Label, Label, Object>>();
  for (Tree node : tree.subTreeList()) {
    if (DEBUG) EncodingPrintWriter.err.println("Considering " + node.label());
    // every child with a different head is an argument, as are ones with
    // the same head after the first one found
    if (node.isLeaf() || node.children().length < 2) {
      continue;
    }
    // System.err.println("XXX node is " + node + "; label type is " +
    //                         node.label().getClass().getName());
    String head = ((HasWord) node.label()).word();
    boolean seenHead = false;
    for (int cNum = 0; cNum < node.children().length; cNum++) {
      Tree child = node.children()[cNum];
      String arg = ((HasWord) child.label()).word();
      if (DEBUG) EncodingPrintWriter.err.println("Considering " + head + " --> " + arg);
      if (head.equals(arg) && !seenHead) {
        seenHead = true;
        if (DEBUG) EncodingPrintWriter.err.println("  ... is head");
      } else if (!punctFilter.accept(arg)) {
        deps.add(new UnnamedDependency(head, arg));
        if (DEBUG) EncodingPrintWriter.err.println("  ... added");
      } else if (DEBUG) {
        if (DEBUG) EncodingPrintWriter.err.println("  ... is punct dep");
      }
    }
  }
  if (DEBUG) {
    EncodingPrintWriter.err.println("Deps: " + deps);
  }
  return deps;
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:40，代码来源:DependencyEval.java

示例5: accept

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/** Doesn't accept nodes that only cover an empty. */
public boolean accept(Tree t) {
  Tree[] kids = t.children();
  Label l = t.label();
  if ((l != null) && l.value() != null && // there appears to be a mistake in CTB3 where the label "-NONE-1" is used once
          // presumably it should be "-NONE-" and be spliced out here.
          (l.value().matches("-NONE-.*")) && !t.isLeaf() && kids.length == 1 && kids[0].isLeaf()) {
    // Delete empty/trace nodes (ones marked '-NONE-')
    if ( ! l.value().equals("-NONE-")) {
      EncodingPrintWriter.err.println("Deleting errant node " + l.value() + " as if -NONE-: " + t, ChineseTreebankLanguagePack.ENCODING);
    }
    return false;
  }
  return true;
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:16，代码来源:CTBErrorCorrectingTreeNormalizer.java

示例6: main

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * The main() method tokenizes a file in the specified Encoding
 * and prints it to standard output in the specified Encoding.
 * Its arguments are (Infile, Encoding).
 */
public static void main(String[] args) throws IOException {

  String encoding = args[1];
  Reader in = new BufferedReader(new InputStreamReader(new FileInputStream(args[0]), encoding));

  Tokenizer<String> st = new CHTBTokenizer(in);

  while (st.hasNext()) {
    String s = st.next();
    EncodingPrintWriter.out.println(s, encoding);
    // EncodingPrintWriter.out.println("|" + s + "| (" + s.length() + ")",
    //				encoding);
  }
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:20，代码来源:CHTBTokenizer.java

示例7: WordToSentenceProcessor

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * Flexibly set the set of acceptable sentence boundary tokens,
 * the set of tokens commonly following sentence boundaries, and also
 * the set of tokens that are sentences boundaries that should be
 * discarded.
 * This is private because it is a dangerous constructor. It's not clear what the semantics
 * should be if there are both boundary token sets, and patterns to match.
 */
private WordToSentenceProcessor(Set<String> boundaryTokens, Set<String> boundaryFollowers, Set<String> boundaryToDiscard, Pattern regionBeginPattern, Pattern regionEndPattern) {
  sentenceBoundaryTokens = boundaryTokens;
  sentenceBoundaryFollowers = boundaryFollowers;
  sentenceBoundaryToDiscard = boundaryToDiscard;
  sentenceRegionBeginPattern = regionBeginPattern;
  sentenceRegionEndPattern = regionEndPattern;
  if (DEBUG) {
    EncodingPrintWriter.err.println("WordToSentenceProcessor: boundaryTokens=" + boundaryTokens, "UTF-8");
    EncodingPrintWriter.err.println("  boundaryFollowers=" + boundaryFollowers, "UTF-8");
    EncodingPrintWriter.err.println("  boundaryToDiscard=" + boundaryToDiscard, "UTF-8");
  }
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:21，代码来源:WordToSentenceProcessor.java

示例8: WordToSentenceProcessor

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * Flexibly set a pattern that matches acceptable sentence boundaries,
 * the set of tokens commonly following sentence boundaries, and also
 * the set of tokens that are sentence boundaries that should be discarded.
 * This is private because it is a dangerous constructor. It's not clear what the semantics
 * should be if there are both boundary token sets, and patterns to match.
 */
private WordToSentenceProcessor(String boundaryTokenRegex, Set<String> boundaryFollowers, Set<String> boundaryToDiscard, Pattern regionBeginPattern, Pattern regionEndPattern) {
  sentenceBoundaryTokenPattern = Pattern.compile(boundaryTokenRegex);
  sentenceBoundaryFollowers = boundaryFollowers;
  setSentenceBoundaryToDiscard(boundaryToDiscard);
  sentenceRegionBeginPattern = regionBeginPattern;
  sentenceRegionEndPattern = regionEndPattern;
  if (DEBUG) {
    EncodingPrintWriter.err.println("WordToSentenceProcessor: boundaryTokens=" + boundaryTokenRegex, "UTF-8");
    EncodingPrintWriter.err.println("  boundaryFollowers=" + boundaryFollowers, "UTF-8");
    EncodingPrintWriter.err.println("  boundaryToDiscard=" + boundaryToDiscard, "UTF-8");
  }
}

开发者ID:paulirwin，项目名称:Stanford.NER.Net，代码行数:20，代码来源:WordToSentenceProcessor.java

示例9: train

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * Trains this UWM on the Collection of trees.
 */
public void train(TaggedWord tw, int loc, double weight) {
  IntTaggedWord iTW = 
    new IntTaggedWord(tw.word(), tw.tag(), wordIndex, tagIndex);
  IntTaggedWord iT = new IntTaggedWord(nullWord, iTW.tag);
  IntTaggedWord iW = new IntTaggedWord(iTW.word, nullTag);
  seenCounter.incrementCount(iW, weight);
  IntTaggedWord i = NULL_ITW;
  
  if (treesRead > indexToStartUnkCounting) {
    // start doing this once some way through trees; 
    // treesRead is 1 based counting
    if (seenCounter.getCount(iW) < 1.5) {
      // it's an entirely unknown word
      int s = model.getSignatureIndex(iTW.word, loc, 
                                      wordIndex.get(iTW.word));
      if (DOCUMENT_UNKNOWNS) {
        String wStr = wordIndex.get(iTW.word);
        String tStr = tagIndex.get(iTW.tag);
        String sStr = wordIndex.get(s);
        EncodingPrintWriter.err.println("Unknown word/tag/sig:\t" +
                                        wStr + '\t' + tStr + '\t' + 
                                        sStr, "UTF-8");
      }
      IntTaggedWord iTS = new IntTaggedWord(s, iTW.tag);
      IntTaggedWord iS = new IntTaggedWord(s, nullTag);
      unSeenCounter.incrementCount(iTS, weight);
      unSeenCounter.incrementCount(iT, weight);
      unSeenCounter.incrementCount(iS, weight);
      unSeenCounter.incrementCount(i, weight);
      // rules.add(iTS);
      // sigs.add(iS);
    } // else {
      // if (seenCounter.getCount(iTW) < 2) {
      // it's a new tag for a known word
      // do nothing for now
      // }
      // }
  }
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:43，代码来源:EnglishUnknownWordModelTrainer.java

示例10: run

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * Runs this session by reading a string, tagging it, and writing
 * back the result.  The input should be a single line (no embedded
 * newlines), which represents a whole sentence or document.
 */
@Override
public void run() {
  if (DEBUG) {System.err.println("Created new session");}

  try {
    String input = in.readLine();
    if (DEBUG) {
      EncodingPrintWriter.err.println("Receiving: \"" + input + '\"', charset);
    }
    if (! (input == null)) {
      String output = tagger.apply(input);
      if (DEBUG) {
        EncodingPrintWriter.err.println("Sending: \"" + output + '\"', charset);
      }
      out.print(output);
      out.flush();
    }
    close();
  } catch (IOException e) {
    System.err.println("MaxentTaggerServer:Session: couldn't read input or error running POS tagger");
    e.printStackTrace(System.err);
  } catch (NullPointerException npe) {
    System.err.println("MaxentTaggerServer:Session: connection closed by peer");
    npe.printStackTrace(System.err);
  }
}

开发者ID:jaimeguzman，项目名称:data_mining，代码行数:32，代码来源:MaxentTaggerServer.java

示例11: train

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/**
 * Trains this lexicon on the Collection of trees.
 */
public void train(TaggedWord tw, int loc, double weight) {
  IntTaggedWord iTW = 
    new IntTaggedWord(tw.word(), tw.tag(), wordIndex, tagIndex);
  IntTaggedWord iT = new IntTaggedWord(nullWord, iTW.tag);
  IntTaggedWord iW = new IntTaggedWord(iTW.word, nullTag);
  seenCounter.incrementCount(iW, weight);
  IntTaggedWord i = NULL_ITW;
  
  if (treesRead > indexToStartUnkCounting) {
    // start doing this once some way through trees; 
    // treesRead is 1 based counting
    if (seenCounter.getCount(iW) < 2) {
      // it's an entirely unknown word
      int s = model.getSignatureIndex(iTW.word, loc, 
                                      wordIndex.get(iTW.word));
      if (DOCUMENT_UNKNOWNS) {
        String wStr = wordIndex.get(iTW.word);
        String tStr = tagIndex.get(iTW.tag);
        String sStr = wordIndex.get(s);
        EncodingPrintWriter.err.println("Unknown word/tag/sig:\t" +
                                        wStr + '\t' + tStr + '\t' + 
                                        sStr, "UTF-8");
      }
      IntTaggedWord iTS = new IntTaggedWord(s, iTW.tag);
      IntTaggedWord iS = new IntTaggedWord(s, nullTag);
      unSeenCounter.incrementCount(iTS, weight);
      unSeenCounter.incrementCount(iT, weight);
      unSeenCounter.incrementCount(iS, weight);
      unSeenCounter.incrementCount(i, weight);
    } // else {
  }
}

开发者ID:amark-india，项目名称:eventspotter，代码行数:36，代码来源:ArabicUnknownWordModelTrainer.java

示例12: printlnErr

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
private static void printlnErr(String s) {
  EncodingPrintWriter.err.println(s, ChineseTreebankLanguagePack.ENCODING);
}

开发者ID:FabianFriedrich，项目名称:Text2Process，代码行数:4，代码来源:ChineseTreebankParserParams.java

示例13: writeTagsAndErrors

import edu.stanford.nlp.io.EncodingPrintWriter; //导入依赖的package包/类
/** Write the tagging and note any errors (if pf != null) and accumulate
 *  global statistics.
 *
 *  @param finalTags Chosen tags for sentence
 *  @param pf File to write tagged output to (can be null, then no output;
 *               at present it is non-null iff the debug property is set)
 */
protected void writeTagsAndErrors(String[] finalTags, PrintFile pf, boolean verboseResults) {
  StringWriter sw = new StringWriter(200);
  for (int i = 0; i < correctTags.length; i++) {
    sw.write(toNice(sent.get(i)));
    sw.write(tagSeparator);
    sw.write(finalTags[i]);
    sw.write(' ');
    if (pf != null) {
      pf.print(toNice(sent.get(i)));
      pf.print(tagSeparator);
      pf.print(finalTags[i]);
    }
    if ((correctTags[i]).equals(finalTags[i])) {
      numRight++;
    } else {
      numWrong++;
      if (pf != null) pf.print('|' + correctTags[i]);
      if (verboseResults) {
        EncodingPrintWriter.err.println((maxentTagger.dict.isUnknown(sent.get(i)) ? "Unk" : "") + "Word: " + sent.get(i) + "; correct: " + correctTags[i] + "; guessed: " + finalTags[i], encoding);
      }

      if (maxentTagger.dict.isUnknown(sent.get(i))) {
        numWrongUnknown++;
        if (pf != null) pf.print("*");
      }// if
    }// else
    if (pf != null) pf.print(' ');
  }// for
  if (pf != null) pf.println();

  if (verboseResults) {
    PrintWriter pw;
    try {
      pw = new PrintWriter(new OutputStreamWriter(System.out, encoding), true);
    } catch (UnsupportedEncodingException uee) {
      pw = new PrintWriter(new OutputStreamWriter(System.out), true);
    }
    pw.println(sw);
  }
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:48，代码来源:TestSentence.java

注：本文中的edu.stanford.nlp.io.EncodingPrintWriter类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java ListenerRegistration类代码示例发布时间：2022-05-23

Java RegexRegisteredService类代码示例发布时间：2022-05-23

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：10198|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：6816|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：5728|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：6235|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：6078|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：6440|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：6017|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：5486|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：5872|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：5204|2022-11-06

客服电话

电子邮件

Java EncodingPrintWriter类代码示例

示例1: main

示例2: main

示例3: printDebug

示例4: makeObjects

示例5: accept

示例6: main

示例7: WordToSentenceProcessor

示例8: WordToSentenceProcessor

示例9: train

示例10: run

示例11: train

示例12: printlnErr

示例13: writeTagsAndErrors

请发表评论

全部评论

上一篇：

下一篇：

CVE-2022-35356

librespeed/speedtest: Self-hosted Speedt

avehtari/BDA_m_demos: Bayesian Data Anal

四维彩超怎么看性别？四维看男孩女孩诀窍

medfreeman/markdown-it-toc-and-anchor: m

剪的笔顺,诠释剪的笔画,认识剪的部首

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

六六分期app的软件客服如何联系？(六六分期

CVE-2020-36276

doraiso/Mastodon

关于我们

产品与服务

解决方案

139-2527-9053