Dashboard
Available corpora
Browse full list of (English) corpora here.
Among others, …
Subcorpora
You can create subcorpora for pre-loaded and self-compiled corpora based on
- all available metadata categories (e.g. timestamps, topics, filenames)
- concordance searches
Queries
You run queries from the Concordance
view.
There are two options:
- basic searches: basic
- advanced searches: more involved and powerful (e.g. searching for constructions based on lemmatized forms or word classes)
Basic queries
Advanced (CQL) queries
![]()
Helpful: manual and CQL builder
.
Extracting parts of your query matches using within
:
Options:
- query metadata within CQL syntax (e.g.
[word="bank"] within <doc topic="recreation" />
)
- perform ‘text type’1 filtering using the dropdown menus, which is also available for simple queries (see above).
Concordance view
Collocations
![]()
Additional measures (e.g. log likelihood) and other options are available in the advanced settings.
Word sketches
Word sketch difference: between two words/phrases
Word sketch difference: between two subcorpora
![]()
Visualizations
![]()
Annotating data
for metadata: see Figure 1 above
for concordance lines:
Exporting data
Almost everything can be exported:
- your entire annotated corpora
- results from queries/concordances
- results from collocations
- results from word sketches
I recommend exporting data in .xlsx
format, since this seems to be best supported by SkE.1
Compiling a corpus: dead authors’ minds
See Section Compiling corpora above.
Sharing corpora: the toy corpus of Gutenberg books that I created for this tutorial is named qw-gutenberg
and it should be accessible by all LMUlers.
Studying syntactic constructions: the N BE that
Select pre-loaded corpus: Gutenberg English 2020
![]()
Query inspired by: Schmid, Hans-Jörg, and Annette Mantlik. 2015. ‘Entrenchment in Historical Corpora? Reconstructing Dead Authors’ Minds from Their Usage Profiles’. Anglia 133 (4): 583—623.
Search for target construction
Get frequency distribution of nouns in target construction:
Distribution across all authors in SkE:
![]()
Plot in exported Excel file:
Individual analysis on Samuel Pepys’ works:
Results for Samuel Pepys:
![]()
Comparing collocational profiles
corpus: enTenTen20
method: for the lemma bankn, get word sketch differences between texts with recreation
and business
as topics
Results:
- Investigating the frequency increase of whatever:
Results:
Plotting the exported version in Excel: