Materials of DIALOGUE 2018

10:30-12:00 / Section 1

Document Analysis

Elena Mikhalkova, Yuri Karyakin, Nadezhda Ganzherli

Classifying social network pages by interests of their followers: optimization in small scale datasets

Mikhail Bulygin, Serge Sharoff

Using machine translation for automatic genre classification in Arabic

Vasily Konovalov, Zhargal Tumunbayarova

Learning word embeddings for low resource languages: the case of buryat

10:30-12:00 / Section 2

Corpus Linguistics

Irina Levontina, Alexey Shmelev

The Russian aby: corpus-driven research (synchrony and diachrony)

Boris Iomdin, Anna Ambartsumian, Vasilisa Andriyanets, Ivan Levin

Lexical variation: word knowledge and polysemy in Russian everyday life lexicon

Vladimir Belikov

Obscenization of everyday and literary Russian texts based on the General Internet Corpus of Russian (18+)

12:00-12:30

Coffee-break

12:30 - 14:00 / Section 1

Document Analysis

Boris Galitsky

Discovering and assessing heated arguments at the discourse level

Nikolay Skachkov, Konstantin Vorontsov

Improving topic models with segmental structure of texts

Mariia Seleznova, Anton Belyy

Quality evaluation and improvement for hierarchical topic modeling

12:30 - 14:00 / Section 2

Corpus Linguistics

Alexander Piperski

Corpus size and the robustness of measures of corpus distance

Natalia Lukashevich, Edward Klyshinsky, Irina Kobozeva

Creating a corpus of syntactic co-occurrences for Russian

Daniil Skorinkin, Frank Fischer, German Palchikov

Building a corpus for the quantitative research of russian drama: composition, structure, case studies

14:00-15:00

Break

15:00-16:30 / Section 1

Deep Learning for Document Analysis

Marina Dubova, Anton Belyy

Framework for russian plagiarism detection using sentence embedding similarity and negative sampling

Victor Bulatov, Vasiliy Alekseev, Konstantin Vorontsov

Intra-text coherence as a measure of topic models' interpretability

Elena Tutubalina, Zulfat Miftahutdinov

Leveraging deep neural networks and semantic similarity measures for medical concept normalisation in social media posts

15:00-16:30 / Section 2

Formal Models of Language

Ekaterina Lyutikova, Sergei Tatevosov

Re-interpreting events: notes on one linguistic innovation in Russian

Anton Zimmerling

Two dialects of Russian grammar: corpus data and formal models

Daniel Tiskin

The interpretation of Russian pronouns in counteridentity contexts: a corpus study

16:30-17:00

Coffee-break

17:00 - 19:00 / Round table

BigData vs. SmallData: how to solve the issue of training data insufficiency

17:00 - 19:00 / Section 2

Formal Models of Language

Leonid Iomdin

Once again on microsyntactic constructions formed with functional words: tо i delo ‘every now and then’

Olga Pekelis

Speech act conjunction: the scale of speech act use and its manifestation in grammar

Elena Uryson

Syntax of Russian adverbial prepositions

Natalia Slioussar

Gender, declension and stem-final consonants: an experimental study of gender agreement in Russian