Class TikaConfiguration


  • @UriParams
    public class TikaConfiguration
    extends Object
    • Constructor Detail

      • TikaConfiguration

        public TikaConfiguration()
    • Method Detail

      • setOperation

        public void setOperation​(TikaOperation operation)
        Tika Operation - parse or detect
      • setOperation

        public void setOperation​(String operation)
      • setTikaParseOutputFormat

        public void setTikaParseOutputFormat​(TikaParseOutputFormat tikaParseOutputFormat)
        Tika Output Format. Supported output formats.
        • xml: Returns Parsed Content as XML.
        • html: Returns Parsed Content as HTML.
        • text: Returns Parsed Content as Text.
        • textMain: Uses the boilerpipe library to automatically extract the main content from a web page.
      • getTikaParseOutputEncoding

        public String getTikaParseOutputEncoding()
      • setTikaParseOutputEncoding

        public void setTikaParseOutputEncoding​(String tikaParseOutputEncoding)
        Tika Parse Output Encoding - Used to specify the character encoding of the parsed output. Defaults to Charset.defaultCharset().
      • getTikaConfig

        public org.apache.tika.config.TikaConfig getTikaConfig()
      • setTikaConfig

        public void setTikaConfig​(org.apache.tika.config.TikaConfig tikaConfig)
        To use a custom Tika config.
      • getTikaConfigUri

        public String getTikaConfigUri()
      • setTikaConfigUri

        public void setTikaConfigUri​(String tikaConfigUri)
        Tika Config Uri: The URI of tika-config.xml file to use.