Soumya Prakash Pradhan

The Artificial Intelligence (AI) race is rapidly transforming every industry, and Alibaba has recently introduced new AI tools to support merchants on its platforms. These tools boast impressive features that are truly mind-blowing.

One of the remarkable capabilities of these AI tools is their ability to make images talk, sing, and come to life from any audio file with astonishing accuracy.

Additionally, they are equipped with image and video generation capabilities for promotional purposes.

Not only do these tools promise to save merchants a significant amount of money, but they also aim to enhance their productivity and efficiency on the e-commerce platform.

Alibaba's Emote Portrait Alive is a prime example of these advancements. This innovative technology uses artificial intelligence to animate portraits, infusing them with dynamic and nuanced emotional expressions. Users can now generate a range of emotions and showcase their creativity with ease.

According to reports, Alibaba provides a Python SDK that allows developers to seamlessly integrate Emote Portrait Alive functionalities into their applications.

The above video demonstration of the Alibaba AI tools showcases its ability to generate various facial expressions and head poses, surpassing previous methods used by studios like D-ID or HeyGen.

The research paper by Alibaba highlights the use of two attention mechanisms: Reference-Attention and Audio-Attention.

Reference-Attention ensures that the animated face retains the likeness of the person in the reference image, maintaining their identity.

On the other hand, Audio-Attention helps synchronise the movements of the animated face with the sounds or words in the audio clips.

For further details about these AI tools, you can explore their research paper here.

scrollToTop