org.apache.solr.handler.dataimport
Class TikaEntityProcessor
java.lang.Object
org.apache.solr.handler.dataimport.EntityProcessor
org.apache.solr.handler.dataimport.EntityProcessorBase
org.apache.solr.handler.dataimport.TikaEntityProcessor
public class TikaEntityProcessor
- extends EntityProcessorBase
An implementation of EntityProcessor
which reads data from rich docs using Apache Tika
- Since:
- solr 3.1
- Version:
- $Id: TikaEntityProcessor.java 1146911 2011-07-14 22:21:58Z sarowe $
Fields inherited from class org.apache.solr.handler.dataimport.EntityProcessorBase |
ABORT, CACHE_KEY, CACHE_LOOKUP, cachePk, cacheVariableName, cacheWithWhereClause, context, CONTINUE, dataSourceRowCache, entityName, isFirstInit, ON_ERROR, onError, query, rowIterator, simpleCache, SKIP, SKIP_DOC, TRANSFORM_ROW, TRANSFORMER, transformers |
Method Summary |
protected void |
firstInit(Context context)
first time init call. |
Map<String,Object> |
nextRow()
For a simple implementation, this is the only method that the sub-class should implement. |
Methods inherited from class org.apache.solr.handler.dataimport.EntityProcessorBase |
cacheInit, destroy, getAllNonCachedRows, getFromRowCacheTransformed, getIdCacheData, getNext, getSimpleCacheData, init, nextDeletedRowKey, nextModifiedParentRowKey, nextModifiedRowKey |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TikaEntityProcessor
public TikaEntityProcessor()
firstInit
protected void firstInit(Context context)
- Description copied from class:
EntityProcessorBase
- first time init call. do one-time operations here
- Overrides:
firstInit
in class EntityProcessorBase
nextRow
public Map<String,Object> nextRow()
- Description copied from class:
EntityProcessorBase
- For a simple implementation, this is the only method that the sub-class should implement. This is intended to
stream rows one-by-one. Return null to signal end of rows
- Overrides:
nextRow
in class EntityProcessorBase
- Returns:
- a row where the key is the name of the field and value can be any Object or a Collection of objects. Return
null to signal end of rows
Copyright © 2000-2011 Apache Software Foundation. All Rights Reserved.