Catalogue

The Catalogue tool provides an interface for exploring the contents of a larger, multi-document corpus, as well as for creating a subset (or workset) based on the search criteria. It functions somewhat like a library database or an online store, allowing you to filter documents.

Use it with a Jane Austen corpus or with your own corpus.

Overview

By default the following facets are shown:

  • Title: the document's title (or its filename if no better title was found)
  • Author: the document's author, if specified
  • Language: the document's automatically detected language
  • Terms: all of the terms in the document
Catalogue with the Works of Jane Austen. You can also use Catalogue with your own corpus.

Options

You can filter documents by typing a query into the search box and hitting enter (see Search for more advanced searching capabilities). Note that by default the query includes the full-text, the title and the author. You can use the author or title prefix to only look at one of those metadata fields (e.g. title:love).

Additional Information

We hope to soon offer a better interface for users to select other facets.

The current interface may not work properly with corpora containing more than 1,000 documents.

Please contact us if you're interested in creating a custom Catalogue interface for your text collection, like this:

http://voyant-tools.org/catalogue/docsouth/?facet=facet.keyword,facet.author

See Also