Vladimir Orshulevich is an NLP research engineer at Unum. He used to work as a Research Engineer in SberDevices.
Since multimodality became popular, lots of engineer are trying to make domain-universal search. The search engines that will find in image by textual query, the HTML file by piece of audio and so on. So here is our (Unum) approach with a bias towards GPU accelerating inference (for underlying models) and passion to distribute everything.