Home Knowledge Base Personalized Visual Extraction and

Personalized Visual Extraction and

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Teach Multimodal Recommendation Model to See via Personalized Visual Extraction and Adaptive Learning

Announce Type: new Abstract: Multimodal sequential recommendation (MSR) incorporates textual and visual information to improve recommendation quality. However, recent studies and our empirical analysis show that visual features are often underutilized, thereby contributing far less than textual signals. We attribute this issue to two factors: insufficient visual representation learning (pretrained encoders fail to capture preference-relevant cues) and unbalanced visual-text optimization...

arXiv CS 1d ago

UniVerse: A Unified Modulation Framework for Segmentation-Free,Disentangled Multi-Concept Personalization

Announce Type: replace Abstract: Personalized visual understanding has advanced significantly, yet existing approaches struggle to localize and extract specific concepts when input images contain multiple objects. Many prior methods rely heavily on segmentation-based supervision or exhibit poor compositional generalization, limiting their ability to accurately disentangle and manipulate individual concepts. In this work, we propose UniVerse, a Unified Modulation Framework for...

arXiv CS 7d ago

MaskClaw: Edge-Side Personalized Privacy Arbitration for GUI Agents with Behavior-Driven Skill Evolution

arXiv:2605.28646v2 Announce Type: replace Abstract: GUI agents rely on screenshots to infer intent and operate across applications, but these screenshots often contain private messages, medical records, payment credentials, and workplace-specific workflows. Privacy decisions in this setting depend on task, recipient, application state, and user role, yet static PII detectors miss these boundaries and cloud-side VLM reasoning can upload the raw screen before deciding what should be protected....

arXiv CS 9d ago

Personalize Your Large Vision-language Models With In-context Prompt Tuning

arXiv:2605.31513v1 Announce Type: new Abstract: Large vision-language models (LVLMs) have demonstrated strong general multimodal capability and are increasingly deployed in downstream systems. This trend has driven growing interest in LVLM personalization, which aims to enable models to quickly and effectively learn out-of-distribution multimodal concepts to meet user-specific needs. However, many existing methods rely on inference-time training, which reduces efficiency.

arXiv CS 9d ago

Human-Like Neural Nets by Catapulting

Human-like Neural Nets by Catapulting Speculative proposal to create artificial neural nets with human-like performance by high-learning-rate/regularization training of overparameterized NNs to trigger catapulting/grokking. Over-parameterization as a route to true generalization would resolve many outstanding mysteries of artificial versus natural intelligence. There are many mysteries about deep learning and human intelligence, but we could describe the biggest anomaly this way: why are...

Hacker News 3d ago

How Turkey Hacked the Hair Transplant Industry

The astounding growth of the hair-transplant industry in Turkey is not just a medical tourism success story; it’s also a tale of “hacked” medical equipment and algorithmic craftsmanship. From a biological and evolutionary perspective, human hair is often viewed as an unremarkable mass of keratin that still plays some important functions—protecting our scalps from the sun’s harmful ultraviolet rays and regulating our body temperatures—but, for the most part, is no longer essential to our...

Wired 10d ago

Ask HN: What are tools you have made for yourself since the advent of AI?

I've made a number of ceramic molds for slumping fused glass into bowls. As well as wooden templates for ceramic mugs. I've devised a few carrying tools to move glass frit paintings from my studio down to my barn where the kilns sit without spilling the glass.

Hacker News 1d ago

PHAF-Personalized Hand Avatars in a Flash

arXiv:2606.03420v1 Announce Type: new Abstract: We present PHAF-Personalized Hand Avatars in a Flash, a personalized photo-realistic hand avatar which provides high quality multi-view renders from just two images (dorsal and palmar views).Unlike slow optimization-based techniques, PHAF generates fast personalized textures for real-time deployment on edge devices. Our approach combines semantic guided mesh alignment and densified texture extraction to transfer high-frequency details efficiently.

arXiv CS 7d ago

Personalized 3D Myocardial Infarct Geometry Reconstruction from Cine MRI for Cardiac Digital Twins

arXiv:2606.01808v1 Announce Type: new Abstract: Accurate 3D geometric characterization of myocardial infarction (MI) is essential for building cardiac digital twins (CDTs) to precisely simulate infarct-related electrophysiology. Late gadolinium enhancement magnetic resonance imaging (LGE MRI) is the clinical reference for locating MI, yet its reliance on contrast agents restricts use in renally impaired patients and limits longitudinal follow-ups. As an alternative, contrast-free cine MRI...

arXiv CS 8d ago

DaVinci Resolve 21

DaVinci Resolve 21 introduces the Photo page, bringing Hollywood's most advanced color tools to still photography! A new generation of AI tools let you search media by content, read slate data, perform de-aging, blemish removal and more. The Edit and Cut pages have improved keyframing and greater graphic format support.

Hacker News 7d ago