Skip to content

Check syntactic information

Click on the segment text in search results to open a syntactic information panel.

All transcript data in TCSE is annotated with linguistic analysis using spaCy 3.8 (en_core_web_lg). The following information is available for each token:

Field Description
Surface The original word form as it appears in the text
Lemma The base/dictionary form of the word
POS Part-of-speech tag (coarse: NOUN, VERB, ADJ, etc.)
Tag Fine-grained POS tag (NN, VBZ, JJ, etc.)
Dep Syntactic dependency relation (nsubj, dobj, prep, etc.)
Morph Morphological features (Number, Tense, Voice, etc.)
Entity Named entity type, if applicable (PERSON, ORG, GPE, etc.)
Noun Chunk Whether the token is part of a noun chunk
Frequency Frequency of the word in the corpus

Where to click

Syntactic information panel