Andrey Voynov
I'm a researcher at Google in Tel Aviv, focusing on creative imagenery.
I work in Creative Camera focusing on new capabilities of generative models. I also host visiting student researchers and collaborate with academic groups.
I have math background and defeated my PhD from Moscow State University in 2014. Before joining Google in 2022
I worked as a Research Scientist in Yandex Research and also participated in its autonomouse car developement.
Email /
Scholar /
𝕏 /
Github
|
|
Research
I'm interested in computer vision and deep learning, in particular, images generative models and unsupervised learning. My current research is mostly focused on new capabilities of visual generative models for creativity.
My math research was in the intersection of convex geometry and functional analysis.
|
|
PALP: Prompt Aligned Personalization of Text-to-Image Models
Moab Arar,
Andrey Voynov,
Amir Hertz,
Omri Avrahami,
Shlomi Fruchter,
Yael Pritch,
Daniel Cohen-Or,
Ariel Shamir
SIGGRAPH-Asia, 2024
project page
/
arXiv
A diffusion model personalization is performed with the prior knowledge of the target prompt to be used for.
|
|
ReNoise: Real Image Inversion Through Iterative Noising
Daniel Garibi,
Or Patashnik,
Andrey Voynov,
Hadar Averbuch-Elor,
Daniel Cohen-Or,
ECCV, 2024
project page
/
demo
/
arXiv
Euler forward method is used to enable more accurate diffusion inversion.
|
|
Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov,
Amir Hertz,
Moab Arar,
Shlomi Fruchter,
Daniel Cohen-Or,
ECCV, 2024
project page
/
arXiv
Diffusion model with extra camera curvature conditioning implemented with either Riemannian metric tensor, or per-pixel coordinates conditioning.
|
|
Style aligned image generation via shared attention
Amir Hertz*,
Andrey Voynov*,
Shlomi Fruchter,
Daniel Cohen-Or
CVPR, 2024   (Oral Presentation)
project page
/
code
/
arXiv
Cross-batch shared self-attention makes a diffusion model generate images with aligned styles.
|
|
Concept decomposition for visual exploration and inspiration
Yael Vinker,
Andrey Voynov,
Daniel Cohen-Or,
Ariel Shamir
SIGGRAPH-Asia, Journal track, 2023   (Best Paper Award)
project page
/
code
/
arXiv
Diffusion model personalization forms a binary tree of a visual concept decomposition.
|
|
Sketch-guided text-to-image diffusion models
Andrey Voynov,
Kfir Aberman,
Daniel Cohen-Or
SIGGRAPH, 2023
project page
/
arXiv
Small MLP performs gradient guidance over intermediate diffusion features for sketch-to-image generation.
|
|
P+: Extended Textual Conditioning in Text-to-Image Generation
Andrey Voynov,
Qingyan Chu,
Daniel Cohen-Or,
Kfir Aberman
arXiv, 2023
project page
/
arXiv
Different prompts are injected to different cross-attention layers that majorly improves textual inversion and allows appearance mixing.
|
|
When, Why, and Which Pretrained GANs Are Useful?
Timofey Grigoryev*,
Andrey Voynov*,
Artem Babenko
ICLR, 2022
arXiv
/
code
Recall is what important for GAN initialization, and Imagenet-pretrained StyleGAN is a good choice.
|
|
Label-efficient semantic segmentation with diffusion models
Dmitry Baranchuk,
Ivan Rubachev,
Andrey Voynov,
Valentin Khrulkov,
Artem Babenko
ICLR, 2022
arXiv
/
code
Diffusion model intermediate features are used for few-shot segmentation.
|
|
Object segmentation without labels with large-scale generative models
Andrey Voynov,
Stanislav Morozov,
Artem Babenko
ICML, 2021
arXiv
/
code
Background-segmentation latent direction of BigBiGAN produces synthetic data for unsupervised foreground segmentation learning.
|
|
Navigating the GAN parameter space for semantic image editing
Anton Cherepkov,
Andrey Voynov,
Artem Babenko
CVPR, 2021
arXiv
/
code
Finding StyleGAN weights shifts that induces interpretable images editing.
|
|
On Self-Supervised Image Representations for GAN Evaluation
Stanislav Morozov,
Andrey Voynov,
Artem Babenko
ICLR, 2021   (Spotlight)
paper
/
code
Self-supervised pretrained backbones are shown to be better features extractors for GANs evaluation.
|
|
Unsupervised Discovery of Interpretable Directions in the GAN Latent Space
Andrey Voynov,
Artem Babenko
ICML, 2020
arXiv
/
code
An unsupervised method to find interpretable directions in a GAN latent space.
|
|
RPGAN: GANs Interpretability via Random Routing
Andrey Voynov,
Artem Babenko
arXiv, 2019
arXiv
/
code
A GAN with a generator composed of a sequence of randomly-chosen layers.
|
Math Papers
|
My math research was primarly focused on functional analysis, random matrices semigroups, and convex geometry.
I had a pleasure to have Vladimir Protasov as my PhD advisor.
In all the papers below the authors order is alphabetical.
|
|
Matrix semigroups with constant spectral radius
Vladimir Protasov,
Andrey Voynov
Linear Algebra and its Applications, (513, 376-408) 2017
|
|
Compact noncontraction semigroups of affine operators
Vladimir Protasov,
Andrey Voynov
Sbornik: Mathematics, 206 (7), 921, 2015
|
|
On the structure of self-affine convex bodies
Andrey Voynov
Sbornik: Mathematics, 204 (8), 1122, 921, 2013
|
|
Shortest positive products of nonnegative matrices
Andrey Voynov
Linear Algebra and its Applications, 439 (6), 1627-1634, 2013
|
|
Sets of nonnegative matrices without positive products
Vladimir Protasov,
Andrey Voynov
Linear Algebra and its Applications, 437 (3), 749-765, 2012
|
|
A counterexample to Valette’s conjecture
Andrey Voynov
Proceedings of the Steklov Institute of Mathematics, 275 (1), 290-292, 2011
|
|
Self-affine polytopes. Applications to functional equations and matrix theory
Andrey Voynov
Sbornik: Mathematics, 202 (10), 1413, 2011
|
|
On compact sets with a certain affine invariant
Andrey Voynov
Mathematical Notes, 90, 32-36, 2011
|
This page template is based on Jon Barron's public academic website source code.
|
|