Home Knowledge Base INtegrated Discriminative

INtegrated Discriminative

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models

arXiv:2512.05530v2 Announce Type: replace Abstract: Recently, multimodal large language models (MLLMs) have been widely applied to reasoning tasks. However, they suffer from limited multi-rationale semantic modeling, insufficient logical robustness, and susceptibility to misleading cues.

arXiv CS 7d ago

Learning Emotion-discriminative Representations for Zero-Shot Cross-lingual Speech Emotion Recognition

Announce Type: new Abstract: Zero-shot cross-lingual speech emotion recognition (SER) remains challenging due to distribution mismatches across languages and the lack of emotion annotations in target language. Under such conditions, models trained solely on source-language data frequently suffer from degraded generalization when evaluated on unseen target languages. To address this limitation, we propose an emotion-discriminative representation learning method that integrates supervised...

arXiv CS 5d ago

Aqua Boundary-Saliency Attention Module for Lightweight Underwater Salient Instance Segmentation Detection Transformer

arXiv:2606.08002v1 Announce Type: new Abstract: Underwater instance segmentation integrates pixel-level mask prediction and instance-level discrimination for marine resource exploration, ecological monitoring, and underwater robotic perception. Recent prompt-based and auxiliary-modality methods improve mask quality, but their reliance on large foundation models, prompt generation, or extra modality estimation complicates efficient deployment. This work introduces Lightweight Underwater...

arXiv CS 1d ago

Single-cell multimodal profiling of pan-cancer cell lines uncovers gene regulatory principles underlying intrinsic cell states and environmental features

Cancer arises from extensive genetic and epigenetic alterations that reshape chromatin, transcriptional regulation, and malignant cell states. To systematically chart cancer-intrinsic regulatory programs, we constructed a pan-cancer single-cell transcriptomic and epigenomic atlas encompassing 60 human cell lines representing 16 tissue origins and 20 cancer types, comprising 240,957 single-nucleus RNA-seq and 223,347 single-nucleus ATAC-seq profiles. Integrative analyses revealed extensive...

bioRxiv 7d ago

Self-supervised Feature Disentanglement and Augmentation Network for One-class Face Anti-spoofing

arXiv:2503.22929v3 Announce Type: replace Abstract: Face anti-spoofing (FAS) techniques aim to enhance the security of facial identity authentication by distinguishing authentic live faces from deceptive attempts. While two-class FAS methods risk overfitting to training attacks to achieve better performance, one-class FAS approaches handle unseen attacks well but are less robust to domain information entangled within the liveness features. To address this, we propose an Unsupervised Feature...

arXiv CS 5d ago

Geometric Second-Order Feature Correlation Learning for Self-Supervised Speech Emotion Recognition

Announce Type: new Abstract: Self-supervised learning (SSL) yields powerful, context-rich representations for speech emotion recognition (SER), yet aggregating these representations into holistic descriptors remains a bottleneck. Conventional first-order aggregation implicitly assumes feature independence, which overlooks the latent Riemannian geometry and discards higher-order relationships essential to the representational power of the backbone. To address this problem, this paper proposes...

arXiv CS 2d ago

Preference-Aware Rubric Learning for Personalized Evaluation

Announce Type: new Abstract: As Large Language Models (LLMs) evolve from general-purpose assistants to user-centric agents, personalization has become central to aligning model behavior with individual preferences, making the evaluation of personalized alignment a critical bottleneck. Existing evaluation methods-ranging from automatic metrics to LLM-as-a-judge approaches-fail to capture subjective, user-specific preferences embedded in long-term interaction histories.

arXiv CS 9d ago

VidMsg: A Benchmark for Implicit Message Inference in Short Videos

Announce Type: new Abstract: Understanding short online videos involves more than identifying visible objects and actions; video makers often include an underlying message or purpose in the clip. We introduce VidMsg, a benchmark for evaluating implicit message understanding in short, internet-native video clips. VidMsg contains 400 YouTube-derived clips across 9 practical topic areas and 52 fine-grained target messages, covering domains such as career and finance, education, health and...

arXiv CS 7d ago

SSRLive: Live Streaming Recommendation with Dynamic Semantic ID

Announce Type: new Abstract: Live streaming has emerged as one of the fastest-growing forms of online media, enabling instant content broadcasting and real-time engagement between users and streamers. Despite the effectiveness of existing recommendation algorithms in this domain, they often suffer from limited utilization of computational resources, with low FLOPs that hinder further performance enhancement. Generative recommendation techniques, which have gained traction in various...

arXiv CS 2d ago

SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning

arXiv:2606.04493v1 Announce Type: new Abstract: Correspondence pruning aims to identify inliers from an initial set of correspondences. Most existing Graph Neural Network (GNN)-based methods rely on geometric features mapped from coarse Euclidean coordinates, which struggle to capture the subtle geometric consistencies presented by inliers. While Mamba-based methods possess global receptive fields and long sequence modeling capabilities, they tend to accumulate substantial inconsistent...

arXiv CS 6d ago