The linguistics of the last century was dominated by F. de Saussure’s grandiose program of synchronic linguistics, the methodological level of which was substantially elevated by the Chomskyan revolution. However, the more recent development has confirmed the necessity to work with basic ingredients of the European structuralist approach, based on syntactic dependency, so that the concept of constituent seems to lose its prominent position.
Různé přístupy к teoretickému popisu jazyka je třeba porovnávat jak co do empirického rozsahu popisného rámce, tak co do kvalit a stupně dodržování jejich principů. Důležité je diskutoval věcně o možnosti spojení jednotlivých dílčích řešení do bezrozporných celků a o možnostech, jak co nejekonomičtěji zachytit jádro (centrum) jazykového systému.
The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied Linguistics under various projects between 1996 and 2018 on the original texts, i.e., all annotation from PDT 1.0, PDT 2.0, PDT 2.5, PDT 3.0, PDiT 1.0 and PDiT 2.0, plus corrections, new structure of basic documentation and new list of authors covering all previous editions. The Prague Dependency Treebank 3.5 (PDT 3.5) contains the same texts as the previous versions since 2.0; there are 49,431 annotated sentences (832,823 words) on all layers, from tectogrammatical annotation to syntax to morphology. There are additional annotated sentences for syntax and morphology; the totals for the lower layers of annotation are: 87,913 sentences with 1,502,976 words at the analytical layer (surface dependency syntax) and 115,844 sentences with 1,956,693 words at the morphological layer of annotation (these totals include the annotation with the higher layers annotated as well). Closely linked to the tectogrammatical layer is the annotation of sentence information structure, multiword expressions, coreference, bridging relations and discourse relations.
In the Prague Dependency Treebank, a part of the texts from the Czech National Corpus is being annotated on several layers, including the underlying (tectogrammatical) representations. The usefulness of such a treebank is briefly characterized and a large set of topics is discussed for which further monographical research appears to be necessary. The future discussion and elaboration of these topics can be carried out much more effectively with the use of the annotated corpus, and the results thus gained may then serve to an enrichment of the descriptive framework and of the annotation procedure.