Blog posts

2022

Automatically transcribe Russian to IPA

2 minute read

Published:

One of the main obstacles for those who are learning Russian is the need to map the pronunciation of Cyrillic characters to correct phonemes. Converting Russian words into International Phonetic Alphabet may help, but is this task of grapheme-to-phoneme conversion as simple as it seems?

2021

My voice… but make it synthetic 🤖

5 minute read

Published:

The goal of speech synthesis, also known as text-to-speech, is to convert written texts into spoken utterances. Speech synthesis finds application in many areas of our everyday life, ranging from announcements at train stations to voice assistants in call centers.

2020

Handling imbalanced data with resampling

5 minute read

Published:

It’s not uncommon for data scientists to work on imabalnced data, i.e. such data where classes are not uniformly distributed and one or two classes present a vast majority. Actually, most of classification data is usually imbalanced. To name but a few: medical data to diagnose a condition, fraud detection data, churn client data etc.

Explain your ML model: no more black boxes 🎁

7 minute read

Published:

1. What's in a black box?

The more companies are interested in using machine learning and big data in their work, the more they care about the interpretability of the models. This is understandable: asking questions and looking for explanations is human.
We want to know not only "What's the prediction?", but "Why so?" as well. Thus, interpretation of ML models is important and helps us to: