Andrey Voynov

I'm a Senior Research Scientist at Google DeeepMind.

I work on new capabilities of the foundation video and image generative models. Before joining DeepMind I worked in Google Research in Creative Camera. I also host visiting student researchers and collaborate with academic groups. I have math background and defeated my PhD from Moscow State University in 2014. Before joining Google in 2022 I worked as a Research Scientist in Yandex Research and also participated in its autonomouse car developement.

Email / Scholar / 𝕏 / Github

Research

I'm interested in computer vision and deep learning, in particular, images generative models and unsupervised learning. My current research is mostly focused on new capabilities of visual generative models for creativity. My math research was in the intersection of convex geometry and functional analysis.

	PALP: Prompt Aligned Personalization of Text-to-Image Models Moab Arar, Andrey Voynov, Amir Hertz, Omri Avrahami, Shlomi Fruchter, Yael Pritch, Daniel Cohen-Or, Ariel Shamir SIGGRAPH-Asia, 2024 project page / arXiv A diffusion model personalization is performed with the prior knowledge of the target prompt to be used for.
	ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi, Or Patashnik, Andrey Voynov, Hadar Averbuch-Elor, Daniel Cohen-Or, ECCV, 2024 project page / demo / arXiv Euler forward method is used to enable more accurate diffusion inversion.
	Curved Diffusion: A Generative Model With Optical Geometry Control Andrey Voynov, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or, ECCV, 2024 project page / arXiv Diffusion model with extra camera curvature conditioning implemented with either Riemannian metric tensor, or per-pixel coordinates conditioning.
	Style aligned image generation via shared attention Amir Hertz, Andrey Voynov, Shlomi Fruchter, Daniel Cohen-Or CVPR, 2024 (Oral Presentation) project page / code / arXiv Cross-batch shared self-attention makes a diffusion model generate images with aligned styles.
	Concept decomposition for visual exploration and inspiration Yael Vinker, Andrey Voynov, Daniel Cohen-Or, Ariel Shamir SIGGRAPH-Asia, Journal track, 2023 (Best Paper Award) project page / code / arXiv Diffusion model personalization forms a binary tree of a visual concept decomposition.
	Sketch-guided text-to-image diffusion models Andrey Voynov, Kfir Aberman, Daniel Cohen-Or SIGGRAPH, 2023 project page / arXiv Small MLP performs gradient guidance over intermediate diffusion features for sketch-to-image generation.
	P+: Extended Textual Conditioning in Text-to-Image Generation Andrey Voynov, Qingyan Chu, Daniel Cohen-Or, Kfir Aberman arXiv, 2023 project page / arXiv Different prompts are injected to different cross-attention layers that majorly improves textual inversion and allows appearance mixing.
	When, Why, and Which Pretrained GANs Are Useful? Timofey Grigoryev, Andrey Voynov, Artem Babenko ICLR, 2022 arXiv / code Recall is what important for GAN initialization, and Imagenet-pretrained StyleGAN is a good choice.
	Label-efficient semantic segmentation with diffusion models Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko ICLR, 2022 arXiv / code Diffusion model intermediate features are used for few-shot segmentation.
	Object segmentation without labels with large-scale generative models Andrey Voynov, Stanislav Morozov, Artem Babenko ICML, 2021 arXiv / code Background-segmentation latent direction of BigBiGAN produces synthetic data for unsupervised foreground segmentation learning.
	Navigating the GAN parameter space for semantic image editing Anton Cherepkov, Andrey Voynov, Artem Babenko CVPR, 2021 arXiv / code Finding StyleGAN weights shifts that induces interpretable images editing.
	On Self-Supervised Image Representations for GAN Evaluation Stanislav Morozov, Andrey Voynov, Artem Babenko ICLR, 2021 (Spotlight) paper / code Self-supervised pretrained backbones are shown to be better features extractors for GANs evaluation.
	Unsupervised Discovery of Interpretable Directions in the GAN Latent Space Andrey Voynov, Artem Babenko ICML, 2020 arXiv / code An unsupervised method to find interpretable directions in a GAN latent space.
	RPGAN: GANs Interpretability via Random Routing Andrey Voynov, Artem Babenko arXiv, 2019 arXiv / code A GAN with a generator composed of a sequence of randomly-chosen layers.
Math Papers	My math research was primarly focused on functional analysis, random matrices semigroups, and convex geometry. I had a pleasure to have Vladimir Protasov as my PhD advisor. In all the papers below the authors order is alphabetical.
	Matrix semigroups with constant spectral radius Vladimir Protasov, Andrey Voynov Linear Algebra and its Applications, (513, 376-408) 2017
	Compact noncontraction semigroups of affine operators Vladimir Protasov, Andrey Voynov Sbornik: Mathematics, 206 (7), 921, 2015
	On the structure of self-affine convex bodies Andrey Voynov Sbornik: Mathematics, 204 (8), 1122, 921, 2013
	Shortest positive products of nonnegative matrices Andrey Voynov Linear Algebra and its Applications, 439 (6), 1627-1634, 2013
	Sets of nonnegative matrices without positive products Vladimir Protasov, Andrey Voynov Linear Algebra and its Applications, 437 (3), 749-765, 2012
	A counterexample to Valette’s conjecture Andrey Voynov Proceedings of the Steklov Institute of Mathematics, 275 (1), 290-292, 2011
	Self-affine polytopes. Applications to functional equations and matrix theory Andrey Voynov Sbornik: Mathematics, 202 (10), 1413, 2011
	On compact sets with a certain affine invariant Andrey Voynov Mathematical Notes, 90, 32-36, 2011

This page template is based on Jon Barron's public academic website source code.

Research

Math Papers