A series of AI-generated selfies created using the Facetune app, with the prompt used to create the selfie written below. From left to right, the images are in the style of M.C. Escher, the Top Gun movie, and Jean-Michel Basquiat. | Image: The Verge
Like any novel technological development, text-to-image AI models are slowly moving from the R&D stage to the phase of âmaking weird stuff for social media.â
Case in point is the phenomenon of the AI-generated selfie, which splices your likeness with different artistic styles and themes using machine learning. Think of it like the next Snapchat filter: a quick way to make your face look goofier than normal. Or goofy in a different way.
There are a few methods to make your own AI-generated portrait right now, like the free web tool DrawAnyone. Just head to the site and upload some selfies, let the system process your likeness, and then pick an output style for your new portrait. A fair warning: youâll probably have to wait a few hours for the site to work through your pictures, and youâll only have access to a few free image generations before youâll be asked to pay for more.
The output, though, is impressive â and in my experience, uncannily realistic. Using AI to generate selfies feels like coming face-to-face with endless doppelgängers. The pictures look like you but not wholly like you. Sometimes they get your chin wrong, or your eyes. Or maybe you do really look like that? Who knows when an algorithm is drawing your likeness.
Below, you can see a few examples generated by DrawAnyone based on my own pictures and using (from top to bottom) the siteâs âoccult,â âsoldier,â âroyal,â and âStudio Ghibliâ styles. (Itâs not clear how exactly these categories are defined, as the âStudio Ghibliâ setting looks nothing like Studio Ghibli and more like a generic digital âanimeâ painting.)
Image: The Verge
DrawAnyoneâs âoccultâ portrait.
Image: The Verge
DrawAnyoneâs âsoldierâ portrait.
Image: The Verge
DrawAnyoneâs âroyalâ portrait.
Image: The Verge
DrawAnyoneâs âStudio Ghibliâ portrait.
DrawAnyone is the creation of software engineer Bonnie Pham and her boyfriend, and is based on open-source text-to-image AI model Stable Diffusion. âNeither of us have a background in AI but weâve managed to piece things together as we go,â Pham told The Verge over email.
Pham says she and her partner created the site using a fork of Stable Diffusion named DreamBooth (which was itself originally a method created by Google researchers to fine-tune AI art generators) before making âa lot of modifications to make it faster and to generate better results.â She notes that the pair didnât upload any additional data to train the modelâs different styles â all the output was already latent in Stable Diffusionâs system.
The use of Stable Diffusion shows how the projectâs open-source nature allows others to quickly build on its capabilities, a dynamic that its creators often tout and that is fueling the fast development of AI art. Indeed, DrawAnyone isnât the only project using Stable Diffusion in this way. App developer Lightricks, maker of the popular selfie-editing app Facetune, is also using the model to generate AI portraits. As with DrawAnyone, the feature in Facetune requires users to upload a number of selfies for the system to learn their likeness, before users then pick an output style to generate new images.
(A quick aside: both Lightricks and Pham assured The Verge that all selfies uploaded to their systems are only used to train the AI model that generates usersâ images; theyâre not used for any other data training purpose.)
Whatâs particularly interesting about Facetuneâs implementation, though, is that it shows you the prompt â or text description â used to create the output. This is notable, as it reveals exactly how these tools rely on the work of real artists to generate imagery.
Below, you can see two examples from the Facetune app based on the styles of M.C. Escher and Jean-Michel Basquiat. Compared to DrawAnyone, Iâd say the system is much less realistic and more cartoonish. Itâs able to pick up on my general features (Iâm white, male, with scruffy hair and a mustache) but it doesnât often create images that look like me.
Image: The Verge
A preset example from Facetuneâs AI selfie feature based on M.C. Escher.
Image: The Verge
A preset example from Facetuneâs AI selfie feature based on Jean-Michel Basquiat.
Other examples from Facetuneâs preset descriptions include âintricate color portrait of me in the style of Tom Bagshaw, soft smooth skin, 8k Octane beautifully detailed renderâ and âme as a gorgeous princess, professionally retouched, muted colors, soft lighting, realistic, smooth face, fully body shot, torso, dress, perfect eyes, sharp focus on eyes, 8k, high definition, insanely detailed, intricate, elegant, art by J. Scott Campbell and Artgerm.â
To be clear: Tom Bagshaw, J. Scott Campbell, and Stanley âArtgermâ Lau are all living artists. Their work was scraped from the web without their consent by the creators of Stable Diffusion in order to train this AI model. Now, Facetuneâs developer Lightricks is explicitly utilizing styles that artists honed over decades to create selfies that sell app subscriptions. The current consensus is that this is fine from a legal perspective, but itâs easy to see why many artists are angry about how their work is used to power these systems.
In the meantime, though, tools like DrawAnyone and Facetuneâs AI selfie mode will continue to proliferate. Pham says that she thinks these tools âwill become a popular way for people to share avatars of themselves across social mediaâ and a âfun alternative for people to show off another side of their personality.â