Innovative Framework
Exploring cross-modal alignment and robust transfer through advanced neural network methodologies and training techniques.
Phase One
Features space analysis using orthogonal probing and gradient-based attribution to understand cross-modal and unimodal decisions in CLIP-style models.
Phase Two
Disentangled fusion training with adversarial decoders and contrastive learning objectives to enhance modality invariance and similarity across domains.