Class ContentSource
java.lang.Object
org.apache.lucene.benchmark.byTask.feeds.ContentItemsSource
org.apache.lucene.benchmark.byTask.feeds.ContentSource
- All Implemented Interfaces:
Closeable,AutoCloseable
- Direct Known Subclasses:
DirContentSource,EnwikiContentSource,LineDocSource,LongToEnglishContentSource,ReutersContentSource,SingleDocSource,TrecContentSource
Represents content from a specified source, such as TREC, Reuters etc. A
ContentSource is
responsible for creating DocData objects for its documents to be consumed by DocMaker. It also keeps track of various statistics, such as how many documents were generated,
size in bytes etc.
For supported configuration parameters see ContentItemsSource.
-
Field Summary
Fields inherited from class org.apache.lucene.benchmark.byTask.feeds.ContentItemsSource
encoding, forever, logStep, verbose -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionabstract DocDatagetNextDocData(DocData docData) Returns the nextDocDatafrom the content source.Methods inherited from class org.apache.lucene.benchmark.byTask.feeds.ContentItemsSource
addBytes, addItem, close, collectFiles, getBytesCount, getConfig, getItemsCount, getTotalBytesCount, getTotalItemsCount, printStatistics, resetInputs, setConfig, shouldLog
-
Constructor Details
-
ContentSource
public ContentSource()
-
-
Method Details
-
getNextDocData
Returns the nextDocDatafrom the content source. Implementations must account for multi-threading, as multiple threads can call this method simultaneously.- Throws:
NoMoreDataExceptionIOException
-