Scientific and technical texts in the framework of information 4.0: content analysis and text synthesis
The paper discusses issues of formalization of linguistic knowledge in the framework providing strategic value for information. This is focused in the concept of Information 4.0. This concept reveals that information can be presented and visualized as a cloud of “information molecules”. The information molecules can be accessed directly, assembled, and changed by different users in various formats, integrated with data, other information systems, and products. Information 4.0 examines the form, production, interaction and curation of information components. The underlying assumption of the paper is that information and knowledge are represented and transferred as a text. Thereafter the text structure is the result of information transfer and the starting point of information mining and extraction. The paper reflects that Information 4.0 demands new approaches in how we work on content. New research directions and applied tasks to be solved are seen in advancing from procedures formalizing text semantics to procedures providing machine-readable automatic information structuring and text production. The methodological focus is not on the ready text structures but on linguistic tools and operations explaining how to produce texts according to given models and content. Thereafter in the framework of Information 4.0 the focus is not on the basic model of the typical publication structure IMRED, but on the notion of structured content authoring and productivist approach to the text. Based on a corpus-assisted analysis of current papers of Russian authors in social sciences and humanities the research shows typical faults in the authors’ key words.