Vision Transformer Model

Multi-step chestnut physical characteristics classification model based on vision transformation using a single-view RGB image

Chestnut classification is essential for improving postharvest processing efficiency and supporting large-scale commercialization; however, conventional manual sorting is labor intensive, inconsistent ...

Nature

A vision–language foundation model for precision oncology

Clinical decision-making is driven by multimodal data, including clinical notes and pathological characteristics. Artificial intelligence approaches that can effectively integrate multimodal data hold ...

EurekAlert!

Vision transformers with hierarchical attention

In the last decade, convolutional neural networks (CNNs) have been the go-to architecture in computer vision, owing to their powerful capability in learning representations from images/videos.

VentureBeat

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...

Design News

Cognex Advances Machine Vision with Nvidia Jetson

Cognex’s Nvidia-powered In-Sight 6900 Vision Controller offers engineers high-performance edge AI machine vision.

Forbes

Recent Advancements In Computer Vision: Transforming Perception And Applications

Computer vision continues to be one of the most dynamic and impactful fields in artificial intelligence. Thanks to breakthroughs in deep learning, architecture design and data efficiency, machines are ...

Semiconductor Engineering

Vision-Language-Action Models Arrive

The AI model type capturing the most attention across robotics and autonomous vehicles right now is the vision-language-action model, or VLA. At embedded AI conferences this year, particularly the ...

Phys.org

Super transformer aims to bring order to biology's data under one AI model

A new vision for unified AI A KAUST-led vision for artificial intelligence (AI) could help bridge that gap. An AI system that combines multiple biological data modalities into a single model has been ...

EurekAlert!

AI-powered vision model accurately estimates occluded fruit size in vertical farming systems

Accurately estimating fruit size directly on plants is essential for precision agriculture, enabling data-driven crop management and improving yield prediction. Traditional fruit detection and ...

VentureBeat

Beyond transformers: Nvidia’s MambaVision aims to unlock faster, cheaper enterprise computer vision

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Transformer-based large language models ...

Interesting Engineering

US: Los Alamos lab’s new tool detects hallucinations in machine vision models

Los Alamos researchers developed PAS, a real-time tool that helps detect false image claims in machine vision models.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results