Uses of Interface
com.kohlschutter.boilerpipe.BoilerpipeExtractor
-
Packages that use BoilerpipeExtractor Package Description com.kohlschutter.boilerpipe.extractors Some standard extractors (i.e., completely piped BoilerpipeFilters)com.kohlschutter.boilerpipe.sax Classes related to parsing and producing HTML from/to Boilerpipe TextDocuments. -
-
Uses of BoilerpipeExtractor in com.kohlschutter.boilerpipe.extractors
Classes in com.kohlschutter.boilerpipe.extractors that implement BoilerpipeExtractor Modifier and Type Class Description classArticleExtractorA full-text extractor which is tuned towards news articles.classArticleSentencesExtractorA full-text extractor which is tuned towards extracting sentences from news articles.classCanolaExtractorclassDefaultExtractorA quite generic full-text extractor.classExtractorBaseThe base class of Extractors.classKeepEverythingExtractorMarks everything as content.classKeepEverythingWithMinKWordsExtractorA full-text extractor which extracts the largest text component of a page.classLargestContentExtractorA full-text extractor which extracts the largest text component of a page.classNumWordsRulesExtractorA quite generic full-text extractor solely based upon the number of words per block (the current, the previous and the next block). -
Uses of BoilerpipeExtractor in com.kohlschutter.boilerpipe.sax
Methods in com.kohlschutter.boilerpipe.sax with parameters of type BoilerpipeExtractor Modifier and Type Method Description java.lang.StringHTMLHighlighter. process(java.net.URL url, BoilerpipeExtractor extractor)Fetches the givenURLusingHTMLFetcherand processes the retrieved HTML using the specifiedBoilerpipeExtractor.java.util.List<Image>ImageExtractor. process(java.net.URL url, BoilerpipeExtractor extractor)Fetches the givenURLusingHTMLFetcherand processes the retrieved HTML using the specifiedBoilerpipeExtractor.
-