Indexing documents
Leave feedback
Documents can be added to the search network using the add method of the Indexer class.
The first parameter of the add method is an array of documents to be added. Currently, documents created from a stream or from a structure are supported.
The second parameter of the add method is an array of optional passwords for opening added documents.
The third parameter of the add method is the indexing options object.
The following code example demonstrates adding documents from a stream.
InputStream[] streams = new InputStream[filePaths.length];
Document[] documents = new Document[filePaths.length];
String[] passwords = new String[filePaths.length];
for (int i = 0; i < filePaths.length; i++) {
String filePath = filePaths[i];
String extension = FilenameUtils.getExtension(filePath);
InputStream stream = new FileInputStream(filePath);
streams[i] = stream;
Document document = Document.createFromStream(
filePath,
new Date(),
extension,
stream);
documents[i] = document;
}
Indexer indexer = node.getIndexer();
IndexingOptions options = new IndexingOptions();
options.setUseRawTextExtraction(false);
options.getImageIndexingOptions().setEnabledForSeparateImages(true);
options.getImageIndexingOptions().setEnabledForEmbeddedImages(true);
options.getImageIndexingOptions().setEnabledForContainerItemImages(true);
options.getOcrIndexingOptions().setEnabledForSeparateImages(true);
options.getOcrIndexingOptions().setEnabledForEmbeddedImages(true);
options.getOcrIndexingOptions().setEnabledForContainerItemImages(true);
indexer.add(documents, passwords, options);
for (InputStream stream : streams) {
stream.close();
}
You may easily run the code from documentation articles and see the features in action in our GitHub examples:
Along with full featured .NET library we provide simple, but powerful free Apps.
You are welcome to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our free online Free Online Document Search App.
Was this page helpful?
Any additional feedback you'd like to share with us?
Please tell us how we can improve this page.
Thank you for your feedback!
We value your opinion. Your feedback will help us improve our documentation.