Visualizing Audio Spectrogram

Object-Aware Image Augmentation for Audio-Visual Zero-Shot Learning

Abstract: Audio-visual zero-shot learning (ZSL) leverages both video and audio information for model training, aiming to classify new video categories that were not seen during the training. However, ...

Indian Defence Review

Underwater Camera Records First-Ever Wild Fish Sounds Across 46 Species

Fish have been known to make sounds for over two millennia, yet much of this underwater world has remained acoustically ...

12don MSN

2025 in visual storytelling

Explore some favorite visual stories of designers, developers and art directors from The Washington Post’s Design, Graphics and Opinions teams.

IEEE

On Explainable Closed-Set Source Device Identification Using Log-Mel Spectrograms From Videos’ Audio: A Grad-CAM Approach

Abstract: Source Device Identification (SDI) is pivotal in multimedia forensics, as it entails the recognition of the device that captured a specific image or video. This paper introduces an ...

Scientific Research Publishing

Multimodal Digital Phenotyping for Bipolar Disorder: Robust Mood-State Classification and Early Relapse Risk Monitoring ()

Bipolar Disorder, Digital Phenotyping, Multimodal Learning, Face/Voice/Phone, Mood Classification, Relapse Prediction, T-SNE, Ablation Share and Cite: de Filippis, R. and Al Foysal, A. (2025) ...

GitHub

Show inaccessible results

Object-Aware Image Augmentation for Audio-Visual Zero-Shot Learning

Underwater Camera Records First-Ever Wild Fish Sounds Across 46 Species

2025 in visual storytelling

On Explainable Closed-Set Source Device Identification Using Log-Mel Spectrograms From Videos’ Audio: A Grad-CAM Approach

Multimodal Digital Phenotyping for Bipolar Disorder: Robust Mood-State Classification and Early Relapse Risk Monitoring ()

Audio-Visual Instance Segmentation

Deepfake Scams Are Exploding: Essential Detection Tips and AI Scam Prevention You Need Now

aliahmad552/music-genre-classification