Multi-channel Optical Vision Model

arXiv Physics Wednesday 10 June 2026, 03:49 UTC By Ali Momeni, Guillaume Noetinger, Tim Tuuva, Romain Fleury 1 min read

Key Points

Announce Type: new Abstract: Spatial multiplexing is one of the natural strengths of optics, yet in optical neural networks, it is often used mainly as parallel throughput. Here, we show that spatial multiplexing in an optical neural network can be used not only to process multiple inputs in parallel, but also to define a trainable representational coordinate of the model. In three implemented scenarios, parallel-input processing, class-code readout and channel-mixed feature interaction,...

arXiv:2606.10253v1 Announce Type: new Abstract: Spatial multiplexing is one of the natural strengths of optics, yet in optical neural networks, it is often used mainly as parallel throughput. Here, we show that spatial multiplexing in an optical neural network can be used not only to process multiple inputs in parallel, but also to define a trainable representational coordinate of the model. In three implemented scenarios, parallel-input processing, class-code readout and channel-mixed feature interaction, spatial channels act as independent learners, structured code dimensions, and interacting feature groups. The programmable free-space optical processor is trained through an online physical-forward/surrogate-backward scheme, where measured optical outputs define the forward pass while a differentiable surrogate estimates gradients and is continually fine-tuned during training from newly acquired optical data. We demonstrate these channel roles in image classification and regression tasks using multi-layer architectures with more than one million trainable optical phase parameters. We further implement a hybrid optical-electronic vision-language model, in which the optical neural network provides visual tokens to a digital transformer decoder for controlled image-captioning tasks. These results establish spatially multiplexed optical channels as a programmable feature and readout space for hybrid optical vision models.

Optical Vision Model arXiv:2606.10253v1 (ORG)

Originally published by arXiv Physics Read original →

Multi-channel Optical Vision Model

Related Stories

Scientists discover 5 million-year-old whale graveyard stretching for hundreds of miles in the Indian Ocean

Plan for hundreds of new spaces to ease Ben Nevis parking woes

Plan for hundreds of new spaces to ease Ben Nevis parking woes

Low-copper paints matched high-copper rivals, while silicone performed best against fouling