Andrey Voynov

I'm a Senior Research Scientist at Google DeeepMind.

I work on new capabilities of the foundation video and image generative models. Before joining DeepMind I worked in Google Research in Creative Camera. I also host visiting student researchers and collaborate with academic groups. I have math background and defeated my PhD from Moscow State University in 2014. Before joining Google in 2022 I worked as a Research Scientist in Yandex Research and also participated in its autonomouse car developement.

Email  /  Scholar  /  𝕏  /  Github

profile photo

Research

I'm interested in computer vision and deep learning, in particular, images generative models and unsupervised learning. My current research is mostly focused on new capabilities of visual generative models for creativity. My math research was in the intersection of convex geometry and functional analysis.


clean-usnob Navigating with Annealing Guidance Scale in Diffusion Space
Shai Yehezkel, Omer Dahary, Andrey Voynov, Daniel Cohen-Or,
SIGGRAPH-Asia, 2025
project page / arXiv

A very small MLP dynamically adjusts CFG over denoising steps for better generation.

clean-usnob PALP: Prompt Aligned Personalization of Text-to-Image Models
Moab Arar, Andrey Voynov, Amir Hertz, Omri Avrahami, Shlomi Fruchter, Yael Pritch, Daniel Cohen-Or, Ariel Shamir
SIGGRAPH-Asia, 2024
project page / arXiv

A diffusion model personalization is performed with the prior knowledge of the target prompt to be used for.

clean-usnob ReNoise: Real Image Inversion Through Iterative Noising
Daniel Garibi, Or Patashnik, Andrey Voynov, Hadar Averbuch-Elor, Daniel Cohen-Or,
ECCV, 2024
project page / demo / arXiv

Euler forward method is used to enable more accurate diffusion inversion.

clean-usnob Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or,
ECCV, 2024
project page / arXiv

Diffusion model with extra camera curvature conditioning implemented with either Riemannian metric tensor, or per-pixel coordinates conditioning.

clean-usnob Style aligned image generation via shared attention
Amir Hertz*, Andrey Voynov*, Shlomi Fruchter, Daniel Cohen-Or
CVPR, 2024   (Oral Presentation)
project page / code / arXiv

Cross-batch shared self-attention makes a diffusion model generate images with aligned styles.

clean-usnob Concept decomposition for visual exploration and inspiration
Yael Vinker, Andrey Voynov, Daniel Cohen-Or, Ariel Shamir
SIGGRAPH-Asia, Journal track, 2023   (Best Paper Award)
project page / code / arXiv

Diffusion model personalization forms a binary tree of a visual concept decomposition.

clean-usnob Sketch-guided text-to-image diffusion models
Andrey Voynov, Kfir Aberman, Daniel Cohen-Or
SIGGRAPH, 2023
project page / arXiv

Small MLP performs gradient guidance over intermediate diffusion features for sketch-to-image generation.

clean-usnob P+: Extended Textual Conditioning in Text-to-Image Generation
Andrey Voynov, Qingyan Chu, Daniel Cohen-Or, Kfir Aberman
arXiv, 2023
project page / arXiv

Different prompts are injected to different cross-attention layers that majorly improves textual inversion and allows appearance mixing.

clean-usnob When, Why, and Which Pretrained GANs Are Useful?
Timofey Grigoryev*, Andrey Voynov*, Artem Babenko
ICLR, 2022
arXiv / code

Recall is what important for GAN initialization, and Imagenet-pretrained StyleGAN is a good choice.

clean-usnob Label-efficient semantic segmentation with diffusion models
Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko
ICLR, 2022
arXiv / code

Diffusion model intermediate features are used for few-shot segmentation.

clean-usnob Object segmentation without labels with large-scale generative models
Andrey Voynov, Stanislav Morozov, Artem Babenko
ICML, 2021
arXiv / code

Background-segmentation latent direction of BigBiGAN produces synthetic data for unsupervised foreground segmentation learning.

clean-usnob Navigating the GAN parameter space for semantic image editing
Anton Cherepkov, Andrey Voynov, Artem Babenko
CVPR, 2021
arXiv / code

Finding StyleGAN weights shifts that induces interpretable images editing.

clean-usnob On Self-Supervised Image Representations for GAN Evaluation
Stanislav Morozov, Andrey Voynov, Artem Babenko
ICLR, 2021   (Spotlight)
paper / code

Self-supervised pretrained backbones are shown to be better features extractors for GANs evaluation.

clean-usnob Unsupervised Discovery of Interpretable Directions in the GAN Latent Space
Andrey Voynov, Artem Babenko
ICML, 2020
arXiv / code

An unsupervised method to find interpretable directions in a GAN latent space.

clean-usnob RPGAN: GANs Interpretability via Random Routing
Andrey Voynov, Artem Babenko
arXiv, 2019
arXiv / code

A GAN with a generator composed of a sequence of randomly-chosen layers.

Math Papers

My math research was primarly focused on functional analysis, random matrices semigroups, and convex geometry. I had a pleasure to have Vladimir Protasov as my PhD advisor. In all the papers below the authors order is alphabetical.

Matrix semigroups with constant spectral radius
Vladimir Protasov, Andrey Voynov
Linear Algebra and its Applications, (513, 376-408) 2017
Compact noncontraction semigroups of affine operators
Vladimir Protasov, Andrey Voynov
Sbornik: Mathematics, 206 (7), 921, 2015
On the structure of self-affine convex bodies
Andrey Voynov
Sbornik: Mathematics, 204 (8), 1122, 921, 2013
Shortest positive products of nonnegative matrices
Andrey Voynov
Linear Algebra and its Applications, 439 (6), 1627-1634, 2013
Sets of nonnegative matrices without positive products
Vladimir Protasov, Andrey Voynov
Linear Algebra and its Applications, 437 (3), 749-765, 2012
A counterexample to Valette’s conjecture
Andrey Voynov
Proceedings of the Steklov Institute of Mathematics, 275 (1), 290-292, 2011
Self-affine polytopes. Applications to functional equations and matrix theory
Andrey Voynov
Sbornik: Mathematics, 202 (10), 1413, 2011
On compact sets with a certain affine invariant
Andrey Voynov
Mathematical Notes, 90, 32-36, 2011

This page template is based on Jon Barron's public academic website source code.