TY - JOUR
T1 - Visual Computing Unified Application Using Deep Learning and Computer Vision Techniques
AU - Sowmya, B. J.
AU - Meeradevi,
AU - Seema, S.
AU - Dayananda, P.
AU - Supreeth, S.
AU - Shruthi, G.
AU - Rohith, S.
N1 - Publisher Copyright:
© 2024 by the authors of this article. Published under CC-BY.
PY - 2024
Y1 - 2024
N2 - Vision Studio aims to utilize a diverse range of modern deep learning and computer vision principles and techniques to provide a broad array of functionalities in image and video processing. Deep learning is a distinct class of machine learning algorithms that utilize multiple layers to gradually extract more advanced features from raw input. This is beneficial when using a matrix as input for pixels in a photo or frames in a video. Computer vision is a field of artificial intelligence that teaches computers to interpret and comprehend the visual domain. The main functions implemented include deepfake creation, digital ageing (de-ageing), image animation, and deepfake detection. Deepfake creation allows users to utilize deep learning methods, particularly autoencoders, to overlay source images onto a target video. This creates a video of the source person imitating or saying things that the target person does. Digital aging utilizes generative adversarial networks (GANs) to digitally simulate the aging process of an individual. Image animation utilizes first-order motion models to create highly realistic animations from a source image and driving video. Deepfake detection is achieved by using advanced and highly efficient convolutional neural networks (CNNs), primarily employing the EfficientNet family of models.
AB - Vision Studio aims to utilize a diverse range of modern deep learning and computer vision principles and techniques to provide a broad array of functionalities in image and video processing. Deep learning is a distinct class of machine learning algorithms that utilize multiple layers to gradually extract more advanced features from raw input. This is beneficial when using a matrix as input for pixels in a photo or frames in a video. Computer vision is a field of artificial intelligence that teaches computers to interpret and comprehend the visual domain. The main functions implemented include deepfake creation, digital ageing (de-ageing), image animation, and deepfake detection. Deepfake creation allows users to utilize deep learning methods, particularly autoencoders, to overlay source images onto a target video. This creates a video of the source person imitating or saying things that the target person does. Digital aging utilizes generative adversarial networks (GANs) to digitally simulate the aging process of an individual. Image animation utilizes first-order motion models to create highly realistic animations from a source image and driving video. Deepfake detection is achieved by using advanced and highly efficient convolutional neural networks (CNNs), primarily employing the EfficientNet family of models.
UR - https://www.scopus.com/pages/publications/85186548865
UR - https://www.scopus.com/pages/publications/85186548865#tab=citedBy
U2 - 10.3991/ijim.v18i01.42673
DO - 10.3991/ijim.v18i01.42673
M3 - Article
AN - SCOPUS:85186548865
SN - 1865-7923
VL - 18
SP - 59
EP - 74
JO - International Journal of Interactive Mobile Technologies
JF - International Journal of Interactive Mobile Technologies
IS - 1
ER -