In this short blog post, I want to share my experience using two remarkable AI technologies: AI Automatic 1111 GUI and Eleven Labs speech tech. These tools can help you create stunning visual and audio content with minimal effort and cost.
Automatic 1111 GUI is a browser interface for Stable Diffusion, a deep learning model that can generate realistic images from text descriptions. You can use it to create anything from surreal paintings to photorealistic portraits, just by typing what you want to see. You can also edit existing images by sketching, inpainting, outpainting or upscaling them. AI Automatic 1111 GUI lets you choose from different models, adjust various parameters, clone or design new voices, and merge checkpoints. I have been using it to create all my recent blog post images.
Eleven Labs speech tech is a voice technology research company that develops the most compelling AI speech software for publishers and creators. Their Prime Voice AI platform can convert any text to speech in any voice and any emotion with unprecedented fidelity and context awareness. You can use it to voice news articles, newsletters, blogs, audiobooks, videos or games. You can also clone voices from surprisingly small samples or create entirely new synthetic voices from scratch. The authentic storytelling potential for this is significant and although It does not yet supports the nuance of Stephen Fry’s reading say Harry Potter. However, with a little effort and the creation of voices, I would image we are not that far away.
It’s truly amazing how these technologies can work together to create uber-personalization for an audience. I used AI Automatic 111 GUI to generate these images on my blog. It’s crazy what you can do with it for example these images are based on me having trained the model on some pictures of myself and if I ever wanted to know what I might look like as a woman; well know I know! The important thing to realize here is that they are not “photoshopped” they are totally computer generated.
I used Eleven Labs speech tech to clone my own voice from a 15-second recording it’s a bit bazar to hear yourself say something you have never said! But the potential for this is enormous. As a Dyslexic person who does not find reading a pleasure, I can see a host of user cases. For example, in the games industry, when playing a computer game, you could have the narrate spoken out loud in your own voice. Many AAA game have voice actors for large amounts of text, but the quests are still written and require the payer to read. I have also experimented having my blog read in my own voice the result was a personalized audio experience that sounded surprisingly just like me. It is of course, possible to combine Automatic 111 GUI and Eleven Labs tech along with some animation to combine the two.
Eleven Labs and Automatic 1111 GUI have the potential to be game-changers in the B2B, B2C, and B2B4C spaces, transforming industries in ways we’ve yet to fully imagine. As someone who began my career around the same time as the internet was emerging, I’ve witnessed first-hand the transformative power of technology. AI is already revolutionizing the way we work and create, with machine learning driving ever-faster improvements.
Businesses of all sizes need to embrace these new technologies and transition to an AI-powered future. Just as everyone was building a website in the early days of the internet, I believe that soon everyone will be shifting to an AI-powered agent. These agents can work 24/7, delivering a 10x productivity boost for businesses of all sizes. And for those Woking in marketing looking to create engaging content without spending too much time or money on production, these technologies are a must-have.
At Curious Cognition, we help businesses understand how AI technologies can be applied to create game-changing products and services and support a 10x productivity increase. If you’re interested in trying them out, visit their websites or contact me to schedule a call for an overview of what’s possible.
In my next article, I’ll be sharing my experimentation with an open-source project called Auto GTP, as well as two products: LangChain and Pinecone, which make the creation of intelligent agents possible today. Stay tuned for more insights on how AI is changing the game!