Class TikaMetadataExtractor

java.lang.Object
org.craftercms.search.opensearch.impl.tika.TikaMetadataExtractor
All Implemented Interfaces:
MetadataExtractor<org.apache.tika.metadata.Metadata>

public class TikaMetadataExtractor extends Object implements MetadataExtractor<org.apache.tika.metadata.Metadata>
Implementation of MetadataExtractor that uses Apache Tika to parse binary files.
Author:
joseross
  • Field Details

    • supportedMimeTypes

      protected String[] supportedMimeTypes
      The list of mime types that can be handled by this extractor
    • mapping

      protected final Map<String,Object> mapping
      The mapping of Apache Tika properties to extract
  • Constructor Details

    • TikaMetadataExtractor

      public TikaMetadataExtractor(Map<String,Object> mapping)
  • Method Details

    • setSupportedMimeTypes

      public void setSupportedMimeTypes(String[] supportedMimeTypes)
    • isSupported

      protected boolean isSupported(org.apache.tika.metadata.Metadata metadata)
      Indicates if the given metadata can be handled by this extractor
      Parameters:
      metadata - the metadata to check
      Returns:
      true if the metadata is supported
    • extract

      public void extract(org.springframework.core.io.Resource resource, org.apache.tika.metadata.Metadata metadata, Map<String,Object> properties)
      Takes relevant metadata from the source object to the properties parameter
      Specified by:
      extract in interface MetadataExtractor<org.apache.tika.metadata.Metadata>
      Parameters:
      resource - the original file containing the metadata (in case any extra operation is needed)
      metadata - the object that provides the metadata
      properties - the properties that will be indexed