GroupDocs.Search Overview

Discover the powerful text search capabilities of the Java Java library.

Illustration search

Indexing and Search Operations in Java

With GroupDocs.Search for Java, you can collect, store, and analyze data efficiently to create detailed indexes for faster, more accurate searches.

Optimize Search by Merging Indexes

Easily combine multiple indexes with GroupDocs.Search for Java to streamline searches. Reduce the impact of smaller delta indexes by consolidating them into a single, high-performance index.

Support for Multilingual Keyboard Layouts

Search across different languages and keyboard layouts with GroupDocs.Search for Java. It supports 88 languages and 164 keyboard configurations for unmatched versatility.

Morphological Search Capabilities

Find different word forms like singular/plural nouns or verb variations using GroupDocs.Search for Java. Customize search options for English and other languages.

Platform independence

GroupDocs.Search for Java is compatible with major operating systems and package managers.

Amazon
Docker
Azure
Eclipse
IntelliJ
Windows
Linux
Maven

Supported file formats

Work with a wide range of file formats using GroupDocs.Search for Java. View the full list.

  • Portable: PDF
  • Word: DOC, DOCX, DOCM, DOT, DOTX, DOTM
  • Excel: XLS, XLSX, XLSM, XLT, XLTX, XLTM, XLSB, XLA, XLAM, CSV, TSV
  • PowerPoint: PPT, PPTX, POT, POTX, PPS, PPSX, PPTM, PPSM, POTM
  • OpenDocument: ODT, ODP, ODS, OTT, OTS
  • Text: TXT, RTF

Media Formats

  • Popular image formats: BMP, JP2, PNG, EMF, WMF, JPG, PSD
  • Multi-page images: GIF, WEBP, TIFF
  • Audio: MP3, WAV
  • Video: AVI, MOV, QT, FLV, ASF

Other

  • Email: PST, OST, MSG, EML, EMLX
  • Microsoft Visio: VSD, VSS
  • Web: XML, HTM, HTML, XHTML, MHT, MHTML
  • Others: TORRENT, ZIP, DCM, DJVU, EPUB, FB2

Features of GroupDocs.Search for Java

Manage document content effectively with advanced search capabilities supporting formats like PDF, DOCX, XLSX, PPTX, and more.

Feature icon

Customizable Search Parameters

Refine searches using date ranges and case sensitivity filters.

Feature icon

Enhanced Spell Check

Search efficiently with spell check, wildcards, and by ignoring special characters.

Feature icon

Filtered Search Results

Apply filters to focus search results based on specific document types or criteria.

Feature icon

Import and Export Index Data

Easily import data for indexing or export results to files for further use.

Feature icon

Skip Unneeded Files

Optimize indexing by excluding specific files or words.

Feature icon

HTML and URL Processing

Extract HTML content to files and generate URLs for navigation through search results.

Feature icon

Fast Search in Large Indexes

Speed up search operations by dividing large indexes into manageable chunks.

Feature icon

Stream-Based Indexing

Index data directly from streams or data structures.

Feature icon

Handle Misspelled Queries

Detect misspellings and suggest alternative words for better search accuracy.

Feature icon

Comprehensive Archive Support

Index nested archives and retrieve detailed lists of files within ZIP files.

Feature icon

Space-Saving Indexing

Compact indexes to save disk space and process password-protected files.

Feature icon

Custom Synonym Support

Expand the synonym dictionary to enhance search accuracy with tailored options.

Code samples

Try out GroupDocs.Search for Java features with these code examples.

Boost Search Accuracy with Fuzzy Matching

Explore the flexibility of GroupDocs.Search for Java for managing content with advanced fuzzy search capabilities. Learn more.

How to process search result

// Create an index
Index index = new Index("C:/IndexFolder");
index.add("C:/DocumentFolder");

// Set up search options
SearchOptions options = new SearchOptions();
options.getFuzzySearch().setEnabled(true);
options.getFuzzySearch().setFuzzyAlgorithm(new TableDiscreteFunction(3));

// Search for documents containing the word 'water' or the phrase 'Lorem ipsum'
String query = "water OR \"Lorem ipsum\"";
SearchResult result = index.search(query, options);

// Process search result
System.out.println("Documents: " + result.getDocumentCount());
System.out.println("Total occurrences: " + result.getOccurrenceCount());
for (int i = 0; i < result.getDocumentCount(); i++) {
    FoundDocument document = result.getFoundDocument(i);
    System.out.println("Document: " + document.getDocumentInfo().getFilePath());
    System.out.println("Occurrences " + document.getOccurrenceCount());
    }

Refine Results with Regular Expressions

Use regular expressions in GroupDocs.Search for Java to create precise and detailed search results. Discover advanced techniques.

How to search using regular expressions

// Create an index
Index index = new Index("C:/IndexFolder");
index.add("c:/DocumentFolder");

// Search for the phrase in text form

// The first caret character at the beginning indicates that this is a regular expression search query
String query = "^^(.)\\1{1,}";
// Search for two or more identical characters at the beginning of a word
SearchResult result = index.search(query);

Ready to get started?

Download GroupDocs.Search for free or get a trial license for full access!

Useful resources

Explore documentation, code samples, and community support to enhance your experience.

Temporary license tips

1
Sign up with your work email.
Free mail services are not allowed.
2
Use Get a temporary license button on the second step.
 English