Year: 2025
Pages: 328–341
UDC: 811.512.141’322
Number: Volume 2, no. 3
Type: scientific article
DOI: https://doi.org/10.31833/sifk/2025.2.3.39
Topic: SCIENTIFIC COLLECTIONS AND FUNDS OF THE IHLL UFRC RAS
Authors: Sirazitdinov, Zinnur A.
The article examines the structure and functionality of the Machine fund of the Bashkir language (MFBL), which was established at the Institute of History, Language and Literature of the UFIC RAS. The MFBL is an integrated system designed to search for linguistic information. It includes several databases. Work on the creation of the MFBL began in 2006. At the moment, the fund consists of ten major sections: general dictionary; lexicographic databases; grammatical bases; experimental phonetic bases; catalogues of handwritten and old printed books; dialectological base; corpus databases containing texts of prose, journalistic, and folklore works in the Bashkir language. In total, the fund contains 75 different linguistic databases. The MFBL system was developed based on the ORACLE database management system. The inclusion of linguistic data in a relational database requires careful analysis and separation of information into its component parts, which makes it possible to efficiently and quickly obtain generalized characteristics. The machine fund of the language has not only scientific, but also practical significance. It is a tool for optimizing and improving the quality of educational materials, including the preparation of language examples for textbooks and teaching aids. The Ministry of Education of the Republic of Bashkortostan actively promotes the use of the Machine Fund among teachers of the Bashkir language. The use of Machine Resources by editors, journalists, and translators certainly contributes to improving the level of Bashkir language proficiency. The machine fund of the language also has significant socio-economic value. Due to the fact that a large number of dictionary and grammar materials are available on the Internet, there is no need for the expensive process of republishing and distributing these materials on paper; Automatic search in the foundation’s databases makes it possible to find philological information faster. This, in turn, accelerates the creation of new linguistic developments and didactic materials. Due to the availability of Bashkir language material on the Internet, residents of the republic can be satisfied with the current language and national policy.
the Machine fund of the Bashkir language, database, types of linguistic databases, corpus linguistics, Bashkir language corpus projects, applied linguistics
Andryushchenko, V.M., 1985, “Machine fund of the Russian language: statement of the problem and practical steps”, Questions of Linguistics, no. 2, pp. 54–64. (In Russ.)
Bochkarev, V.V., 2019, “Machine fund of the Yakut language”, Electronic writing of the peoples of the Russian Federation: experience, problems and prospects: materials of the II International scientifi c conference, Ufa, pp. 59–61.
Bulgakov, R.M., 2001. Catalog of Arabic-script books of the National Museum of the Republic of Bashkortostan. Ufa, 127 p. (In Russ.)
Bulgakov, R.M., 2002, Description of Eastern Manuscripts of the Institute of History, Language and Literature. Ufa: Gilem. Part 1. Turkic Manuscripts. Issue 1. Works of the 12th–18th centuries. 128 p. (In Russ.)
Buskunbaeva, L.A., Sirazitdinov, Z.A., Ishmukhametova, A.Sh., Ibragimova, A.D., Migranova, L.G., 2012, “Corpus of texts from periodicals in the Bashkir language”, Current problems of dialectology of the languages of the peoples of Russia (“Materials of the XII regional conference”), Ufa, pp. 139–141. (In Russ.)
Buskunbaeva, L.A., Sirazitdinov, Z.A., Ishmukhametova, A.Sh. 2017, “Composition and structure of the corpus of journalism of the Bashkir language”, Electronic writing of the peoples of the Russian Federation: experience, problems and prospects: materials of the International scientific conference, Syktyvkar, pp. 39– 43. (In Russ.)
Buskunbaeva, L.A., Sirazitdinov, Z.A., 2020, “Development of an audio corpus of the eastern dialect of the Bashkir language: problems and prospects” Bulletin of the Ufa Scientific Center of the Russian Academy of Sciences, no. 2, pp. 90–97. (In Russ.)
Volodina, N.I., 2008, “Chuvash Republic”, Multilingualism in Russia: regional aspects, Mezhregionalnii tsentr bibliotechnogo sotrudnichestva, Moscow, pp. 22–30. (In Russ.)
The Second All-Union Conference on the creation of the Ministry of Foreign Affairs of the Russian Federation, 1988, ed. Yu.N. Karaulov. (Conference materials), 230 p. (In Russ.)
Galiullin, K.R., Obnosova, N.A., Tuhvatullina, A.A., Sharipzyanova, L.S. 1994, “Machine fund of the Tatar language: features of formation and functioning”, Problems of lexicology and terminology of the Tatar language, Issue 2. Kazan, pp. 127–134. (In Russ.)
Gerd, A.S., 1986, “Russian morphology and the machine fund of the Russian language”, Questions of linguistics, no. 6, pp. 90–96. (In Russ.)
Dialectological atlas of the Bashkir language, 2005. Gilem, Ufa, 243 p. (In Russ.)
Esipova, A.V., 1992, “Creation of a machine fund of the Shor language”, Languages, spiritual culture and history of the Turks: traditions and modernity. Proceedings of the international conference in 3 volumes. Vol. 1. Kazan, pp. 244–247. (In Russ.)
Zhubanov, A.K., 2009, Database “Til – kazyna” of the Kazakh word and its theoretical foundations. Arys, Almaty, 304 p. (In Kazkh.)
Zhubanov, A.H., Uskombaev, S.A., 1988, “On the creation of a machine fund of the Kazakh language”, Materials of the working meeting “Machine funds of the languages of the peoples of the USSR, November 13–22, 1988. Tbilisi, pp. 16–17. (In Russ.)
Machine Funds of the Languages of the Peoples of the USSR: Materials of the Working Meeting (Tallin, December 19–22, 1988), Institute of Language and Literature of the Academy of Sciences of the Estonian SSR, Tallin, 1988, 21 p. (In Russ.)
Nadergulov, M.Kh., 2024, “Bashkir literary criticism: past, present and future”, Proceedings of the UFRS RAS. Series: History. Philology. Culture, vol. 1, no. 1, pp. 107–114. (In Russ.)
Piotrovsky, R.G., Shcherba, A.M., Guzev, V.G., 1988. “On the Creation of a Machine Fund for Turkic Languages”, Soviet Turkology, no. 2, pp. 92–101. (In Russ.)
Sirazitdinov, Z.A., 2006, Modeling the grammar of the Bashkir language. Inflectional system. Gilem, Ufa, 160 p. (In Russ.)
Sirazitdinov, Z.A., 2013, “On lemmatization in the corpus of Bashkir language”, Actual problems of dialectology of the languages of the peoples of Russia: Proceedings of the XIII international conference, Ufa, pp. 240–242. (In Russ.)
Sirazitdinov, Z.A., 2014, “On modeling the inflectional system of agglutinative languages by paired combinations (using the Bashkir language as an example)”, Actual problems of modern Mongolian and Altaic studies. Proceedings of the International scientifi c conference dedicated to the 75th anniversary of the
birth and 55th anniversary of the scientific and pedagogical activity of Professor V.I. Rassadin, Kalmitskii gosudarstvennii universitet, Elista, pp. 139–143. (In Russ.)
Sirazitdinov, Z.A., Buskunbaeva, L.A., Ishmukhametova, A.Sh., 2015, “About linguistic corpus of the Bashkir language”, Turkic languages Processing: Turklang–2015, Kazan, pp. 269–276. (In Russ.)
Sirazitdinov, Z.A., Buskunbaeva, L.A., Ishmukhametova, A.Sh., 2019, “On the processing of sound materials for the dialectological audio corpus of the Bashkir language”, Turkologia (Kazakhstan, Turkestan), no. 4 (96), pp. 35–45. (In Russ.)
Sirazitdinov, Z.A., Buskunbaeva, L.A., Ishmuhametova, A.Sh., Ibragimova, A.D., 2013, Information systems and databases of the Bashkir language. Knizhnaya palata RB, Ufa, 116 p. (In Russ.)
Sirazitdinov, Z.A., Polyanin, A.I. 2014, “On the experience of developing an integrated corpus system based on the ORACLE SUBD”, Proceedings of the Kazan School of Computer and Cognitive Linguistics, Fan, Kazan, pp. 85–88. (In Russ.)
Sirazitdinov, Z.A., Polyanin A.I., Ibragimova, A.D., Ishmuhametova, A.Sh., 2013, “Corpus of the Bashkir language: principles of development”, Problems of Oriental Studies, no. 4 (62), pp. 65–72. (In Russ.)
Khusainova, G.R., 2024. “Bashkir folklore studies in the system of modern humanitarian science (experience of collecting, publishing, research)”, Proceedings of the UFRS RAS. Series: History. Philology. Culture, vol. 1, no. 1, pp. 89–96. (In Russ.)
Shamsutdinova, G.G., Ishmukhametova, A.Sh., Buskunbaeva, L.A., 2017, “Structure and composition of the database of riddles in the subcorpus of texts of aphoristic genres of Bashkir folklore”, Bulletin of the Kalmyk Institute for Humanitarian Research of the Russian Academy of Sciences, vol. 10, no. 4 (32), pp. 146– 153. (In Russ.)
Shevelev, O.G., 2004, “Representation of a set of texts in a relational database for the purposes of linguistic analysis”, Bulletin of Tomsk State University, no. 284, pp. 222–226. (In Russ.)
The ARTFL Project: A textual database (http://humanities.uchicago.edu/orgs/ ARTFL/artfl.flyer.html). (In Russ.)