
The most valuable part of the search engine in the national corpus of the Kazakh language is meta-markup, which characterizes the text as a whole


The system of meta-markups adopted in the National Corpus of the Kazakh Language is accompanied by meta-marked information. Here we should understand the meta-marked system of features of extralinguistic nature or external annotation and technical works (i.e., service works) related to the text. On the basis of the study of the literature on corpus linguistics, it was found that there are several types of markup. Among them, we have considered the so-called extralinguistic markup or metalanguage (meta-markup), which gives an idea of text data.

About the Author

A. K. Zhubanov
Institute of Linguistics named after A. Baitursynov

Chief Researcher of the Institute of Linguistics named after A. Baitursynov, Doctor of Philology, Professor.


1. Сичинава Д.В. К задаче создания корпусов русского языка. [электрон ресурс]. (жүгіну уақыты: 2.05.2016).

2. Демская-Кульчицкая О.М., Семеренко В.Р., Ющенко Р.А. Методы автоматической разметки текстов Национального корпуса языка // Компьютерная математика. – 2005. № 2. 6 с.

3. Азарова И.В. Морфологическая разметка текстов на русском языке с использованием формальной грамматики AGFL. Кафедра математической лингвистики СПб.: ГУ [электрон ресурс]. // (жүгіну уақыты: 2.05.2016).

4. Национальный корпус русского языка. [электрон ресурс]. // (жүгіну уақыты: 2.05.2016).

5. Дарчук Н.П. Автоматизований морфологічний аналіз тексту. [электрон ресурс]. // (жүгіну уақыты: 2.05.2016).


For citations:

Zhubanov A.K. The most valuable part of the search engine in the national corpus of the Kazakh language is meta-markup, which characterizes the text as a whole. Tiltanym. 2016;(2):3-9. (In Kazakh)

Views: 230

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

ISSN 2411-6076 (Print)
ISSN 2709-135X (Online)