AI THAT CAN MAKE ANYONE DANCE
The introduction of Generative Adversarial Networks (GANs) into the wild world of AI is the perfect example of one of those game-changers.
GANs function by approximating generative models. At a high level, GANs is an approach to machine learning involving two algorithms – a “generator” and a “discriminator” – that are pitted against each other to improve the abilities of the overall system.
Let’s dive into exactly how this works:
- What is the “Do as I do” AI Project?
- How Does This AI Dancing Process Work?
- Differentiating Between AI, Deep Learning & Machine Learning
- Where Did GANS Originate?
- Future Applications of GANs
What is the “Do as I do” AI Project?
Venturing into a new era of GANs, researchers from UC Berkeley published an amazing work on motion retargeting that can make fake videos showing amateurs dancing like professional dancers.
They used custom GANs long with other video generation and smoothening techniques to transpose dance moves to amateurs with no advanced skills from a source video.
Video source: Caroline Chan
This is just the beginning of what’s possible with Generative Adversarial Networks (GANs).
How Does This AI Dancing Process Work?
In the research paper, the authors state:
“With our framework, we create a variety of videos, enabling untrained amateurs to spin and twirl like ballerinas, perform martial arts kicks or dance as vibrantly as pop stars.”
AI dancing is the process of taking a video of a professional dancer (Dancer A). Deep learning video generation detects the pose of dancer A and transposes this onto a source video of an amateur dancer (Dancer B).
Essentially, making anyone a pro dancer on video through machine learning and deep learning applications.
Differentiating Between AI, Deep Learning & Machine Learning
Artificial Intelligence (AI)
Artificial intelligence is the general category that all of these terms belong to. It’s a reference to any intelligence performed by a machine that results in an optimal or suboptimal resolution to a problem.
Deep Learning: The Human Connection
Inspired by the function and structure of the brain (specifically the networks of interconnected neurons) – deep learning features algorithms that mimic the biological structure of the brain.
Machine Learning: Cognition Signs
Machine learning is the use of algorithms to interpret data, draw conclusions from it, then go on to make predictions or determinations about something in the world.
Where Did GANS Originate?
What started as a night out with fellow students in 2014 ended with the development of a whole new avenue that would shape the future of artificial intelligence.
While Ian Goodfellow celebrated the recent graduation of a fellow doctoral graduate, he was asked by a peer to help him with a project that would enable a computer to create photos independently.
At that time, technology that used neural networks did exist – where algorithms were modeled similarly to neurons in the brain – to create “generative” machines able to produce plausible data independently. But the results of this process were still very primitive and prone to errors like blurred faces or missing ears.
To find a solution for the project the other students were working on, massive amounts of number crunching would be required, which would not work.
Goodfellow returned home and pondered on this challenge which led to an idea. He wondered what would happen if you pitted two neural network systems against each other, essentially competing to solve the problem.
So, he gave it a shot and coded his software.
It worked the very first time. And the concept of Generative Adversarial Networks (GANs) was born.
Taking these methods a step further, the “do as I do” motion transfer system performs video generation while preserving intricate details like the subjects’ faces.
Recent advancements in image generation and general image mapping frameworks have laid the groundwork and enabled the authors to create a mapping from pose to target subject.
Many more purposes exist for GANs, especially in image generation because of their remarkable ability to replicate detailed, high-quality images.
Future Applications of GANs
Room for Improvement
Upon observation, you will notice that the subject dancer isn’t perfect. You will catch some waviness and choppy motions.
Nonetheless, “Everyone Can Dance” is a shining example of what modern AI is capable of. Some fine-tuning and a little time behind this development will render methods indistinguishable from the real subjects.
Causes for Concern
Obvious scenarios render the concern of GANs and motion transfer technology being used to implicate people in false scenarios.
In recent years, Reddit cracked down on deepfake pornography and the political sphere is concerned about false information spreading via fake videos. Furthermore, down to a very fundamental level, endless scenarios exist as to how these methods can cause interpersonal conflicts.
Future Developments
Many applications of machine learning have been facilitated by the emergence of deep learning methods. Collectively, these improvements are advancing the AI field.
The technological ability to break down tasks into ways that utilize various machine assists will facilitate future developments, including exciting developments like:
- Driverless cars
- Better preventative healthcare
- Robot driven surgical procedures
- AI Enabled Surgical Safety Tools
Researchers are even asking questions about how GANS and motion re-targeting can be exploited to solve some healthcare problems.
For instance: Can this technology be transferred to perform motion re-targeting from an expert surgeon to a novice surgeon?
We believe the technologies needed to codify an expert surgeon’s proficiency into an AI system are here and we will see them in action soon. Such tech will empower novice surgeons and provide care at the same level as a skilled expert with years of experience.
Needless to say the AI spectrum is evolving at lightning speed and there’s no telling how far this is going to take us and how fast.
Remember: this is only the beginning.
“Deep Learning is a superpower. With it, you can make a computer see, synthesize novel art, translate languages, render a medical diagnosis, or build pieces of a car that can drive itself. If that isn’t a superpower, I don’t know what is.” – Andrew Ng, Founder of deeplearning.ai
RediMinds are working on numerous exciting AI projects right now, helping businesses and healthcare provides change the face of their industries.
Stay up to date with our latest projects and exciting AI news by subscribing to our monthly newsletter.