Class TextDocumentStatistics

java.lang.Object
com.kohlschutter.boilerpipe.document.TextDocumentStatistics

public final class TextDocumentStatistics extends Object
Provides shallow statistics on a given TextDocument
  • Field Details

    • numWords

      private int numWords
    • numBlocks

      private int numBlocks
  • Constructor Details

    • TextDocumentStatistics

      public TextDocumentStatistics(TextDocument doc, boolean contentOnly)
      Computes statistics on a given TextDocument.
      Parameters:
      doc - The TextDocument.
      contentOnly - if true then o
  • Method Details

    • avgNumWords

      public float avgNumWords()
      Returns the average number of words at block-level (= overall number of words divided by the number of blocks).
      Returns:
      Average
    • getNumWords

      public int getNumWords()
      Returns the overall number of words in all blocks.
      Returns:
      Sum