<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-id journal-id-type="elibrary">80301</journal-id>
      <journal-title-group>
        <journal-title>Terra Linguistica</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Terra Linguistica</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2782-5450</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">2</article-id>
      <article-id pub-id-type="doi">10.18721/JHSS.14302</article-id>
      <title-group>
        <article-title>Problems of article structure formalization of the “Dictionary of the russian language of the 18th century” prior to electronic edition</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Проблемы формализации структуры словарной статьи «Словаря русского языка XVIII века» при подготовке электронного издания</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Egorov</surname>
            <given-names>Igor</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Molkov</surname>
            <given-names>Georgiy</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>georgiymolkov@gmail.com</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">The Institute for Linguistic Studies of the Russian Academy of Sciences</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2023-09-29">
        <day>29</day>
        <month>09</month>
        <year>2023</year>
      </pub-date>
      <volume>14</volume>
      <issue>3</issue>
      <fpage>19</fpage>
      <lpage>27</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://human.spbstu.ru/userfiles/files/articles/2023/3/19-27.pdf"/>
      <abstract xml:lang="en">
        <p>This article discusses the results of manual processing and classification of the structural elements of the “Dictionary of the Russian language of the 18th century” obtained from its 22 issues published so far. The purpose of this work is to accommodate the manifold repertory of metalinguistic techniques and design features characteristic of the aforementioned dictionary to a unified data structure frame, which could significantly facilitate the preparation of its database-driven digital version. The core difficulty of the task discovered during our analysis of the dictionary structure is the fact that there is no obvious way to determine the limit of deviations from the printed version acceptable for the digital edition. In our taxonomy we distinguish two types of structures, namely generic and unique ones. They can be formally represented by a three-level system of components: (1) the basic ones, (2) those subordinate to the basic ones, and (3) simple items of two types: (3.1) primary elements or (3.2) complex typical structures (component blocks). In this system, the generic structures are preserved entirely and without exception, whereas the unique ones can be included or left out by an additional decision. The particular features of the paper version which do not affect the data  structure are preserved in cases where the scope of a certain element is narrowed or expanded within one component block. In fact, if any atypical or rare use of an element exceeded the boundaries of a block and it were nevertheless decided to preserve this element, it would be necessary to expand the data structure with a new component useless outside this idiosyncratic case. In such situations, we propose to eliminate the unclaimed elements of dictionary entries from the original text in order to adjust it to the standard metalanguage of the dictionary.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>electronic lexicography</kwd>
        <kwd>normal data model</kwd>
        <kwd>database</kwd>
        <kwd>formalization</kwd>
        <kwd>Dictionary of the Russian language of the 18th century</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
