#LancsBox: Lancaster University corpus toolbox

User guide [pdf] FAQ [pdf]

Guide de l’utilisateur [pdf]

マニュアル [pdf]


Overview and data

#LancBox is a new-generation corpus analysis tool. Version 5 has been designed primarily for 64-bit operating systems (Windows 64-bit, Mac and Linux) that allow the tool’s best performance. #LancsBox also operates on older 32-bit systems, but its performance is somewhat limited. Downloading and running it is very easy. It is done in three simple steps: 1) download, 2) install and 3) run.

From #LancsBox, v. 5 we have added an installer, which helps to install #LancsBox onto your machine. We recommend default settings for the installer. More details about #LancsBox installation
can be found here.■ #LancsBox is safe to install on your computer.
■ Allow #LancsBox to run by adjusting security settings.
Follow these instructions.

See what's new in #LancsBox v. 6.

Starting with #LancsBox [pdf]

[download video]

Load data into #LancsBox [pdf]

Data can be loaded and imported into #LancsBox on the ‘Corpora’ tab. This tab opens automatically when you run #LancsBox. #LancsBox works with corpora in different formats (.txt, .xml, .doc, .docx, .pdf, .odt, .xls, .xlsx and many others) and with wordlists (.cvs). There are two options for loading corpora and wordlists: i) load data and ii) download corpora and wordlists that are distributed with #LancsBox.

[download video]

Search in #LancsBox [pdf]

Throughout the tool, #LancsBox offers powerful searches at different levels of corpus annotation using i) simple searches, ii) wildcard searches, iii) smart searches and iv) regex searches.

[download video]

KWIC [pdf]

The KWIC tool generates a list of all instances of a search term in a corpus in the form of a concordance. It can be used, for example, to:

  • Find the frequency of a word or phrase in a corpus.
  • Find frequencies of different word classes such as nouns, verbs, adjectives.
  • Find complex linguistic structures such as the passives, split infinitives etc. using ‘smart searches’.
  • Sort, filter and randomise concordance lines.

[download video]

Whelk [pdf]

The Whelk tool provides information about how the search term is distributed across corpus files. It can be used, for example, to:

  • Find absolute and relative frequencies of the search term in corpus files.
  • Filter the results according to different criteria.
  • Sort files according to absolute and relative frequencies of the search term.

[download video]

Words [pdf]

The Words tool allows in-depth analysis of frequencies of types, lemmas and POS categories as well as comparison of corpora using the keywords technique. It can be used, for example, to:

  • Compute frequency and dispersion measures for types, lemmas and POS tags.
  • Visualize frequency and dispersion in corpora.
  • Compare corpora using the keyword technique.
  • Visualize keywords.

[download video]

[download video]

GraphColl [pdf]

The GraphColl tool identifies collocations and displays them in a table and as a collocation graph or network. It can be used, for example, to

  • Find the collocates of a word or phrase.
  • Find colligations (co-occurrence of grammatical categories).
  • Visualise collocations and colligations.
  • Identify shared collocates of words or phrases.
  • Summarise discourse in terms of its ‘aboutness’.

[download video]

Text [pdf]

The Text tool enables an in-depth insight into the context in which a word or phrase is used. It can be used, for example, to

  • View a search term in full context.
  • Preview a text.
  • Preview a corpus as a run-on text.
  • Check different levels of annotation of a text/corpus.

[download video]

Ngrams [pdf]

The Ngrams tool allows in-depth analysis of frequencies of ngram types, lemmas and POS categories as well as comparison of corpora using the key ngram technique. It can be used, for example, to:

  • Compute frequency and dispersion measures for ngram types, lemmas and POS tags.
  • Visualize frequency and dispersion in corpora.
  • Compare corpora using the key ngram technique.
  • Visualize key ngrams.

Wizard [pdf]

The Wizard tool combines the power of all tools in #LancsBox, searches corpora and produces research reports for print (docx) and web (htlm). It can be used, for example, to:

  • Carry out simple or complex research.
  • Produce a draft report.
  • Download all relevant data.

#LancsBox feedback

If you have a question about #LancsBox functionalities, please read the manual or watch video tutorials to see if you can find the answers there. If you are experiencing problems with #LancsBox, try to find the answer in the troubleshooting chart.

Feedback form