Annotation

This site describes the process of creating Russian Language text corpus necessary for testing algorithms of topic model. Wikinews collection licensed by Creative Commons Attribution 2.5 Generic used as a source of texts for corpus. Next, the stage of text's preprocessing and mark-up described.

Download