Cohere for AI Releases Open-Source Aya Vision Models for Computer Vision-Based Tasks

- Advertisement -

Cohere For AI, the firm’s open research division, released new state-of-the-art (SOTA) vision models on Tuesday. Dubbed Aya Vision, the artificial intelligence (AI) models are available in two parameter sizes. The company’s latest frontier models address the inconsistent performance of existing large language models (LLMs) across different languages, especially for multimodal tasks. Aya Vision models can generate outputs in 23 languages and can perform both text-based and image-based tasks. However, it cannot generate images. Cohere has made the AI models available on open-source repositories as well as via WhatsApp.

Cohere Releases Aya Vision AI Models

In a blog post, the AI firm detailed the new vision models. Aya Vision is available in 8B and 32B parameter sizes. These models can generate text, translate text and images across 23 languages, analyse images and answer queries about them, as well as caption images. Both models can be accessed via Cohere’s Hugging Face page and on Kaggle.

Additionally, general users can try out Cohere’s models via a dedicated WhatsApp chat account that can be accessed here. The company says the Aya Vision models are useful for instances when people come across images or artworks they would like to learn more about.

Based on the company’s internal testing, the Aya Vision 8B model outperforms Qwen2.5-VL 7B, Gemini Flash 1.5 8B, and Llama 3.2 11B Vision models on the AyaVisionBench and m-WildVision benchmarks. Notably, the AyaVisionBench benchmark was also developed by Cohere, and its details have been shared in the public domain.

Coming to the Aya Vision 32B model, the company claimed that it outperformed Llama 3.2 90B Vision and Qwen2-VL 72B models on the same benchmarks.

To achieve frontier performance, Cohere claimed that several algorithmic innovations were developed. The Aya Vision models were fed synthetic annotations, developers scaled up multilingual data through translation and rephrasing, and multiple multimodal models were merged in separate steps. The developers observed that in each step, the performance was significantly improved.

Notably, developers can access the open weights of the Aya Vision models from Kaggle and Hugging Face, however, these models are available with a Creative Commons Attribution Non Commercial 4.0 license. It allows for academic and research-based usage but prohibits commercial use cases.

Source link

- Advertisement -

Cohere for AI Releases Open-Source Aya Vision Models for Computer Vision-Based Tasks

Cohere Releases Aya Vision AI Models

Top Selling Gadgets

Motorola G45 5G (Brilliant Blue, 8GB RAM, 128GB Storage)

Lenovo IdeaPad Slim 3 13th Gen Intel Core i7-13620H 15″ (38.1cm) FHD IPS 300 Nits Thin & Light Laptop (16GB/512GB SSD/Win 11/MSO 21/1Yr ADP Free/Alexa Built-in/3 mon Game Pass/Grey/1.6Kg), 83EM008GIN

Lenovo IdeaPad 3 12th Gen Intel Core i3-1215U 14 Inch (35.5cm) FHD Thin & Light Laptop (8GB/512GB SSD/Win 11/Office 2021/Backlit KB/1Yr ADP Free/3months Game Pass/Arctic Grey/1.43Kg), 82RJ00FKIN

realme NARZO N65 5G (Deep Green 4GB RAM, 128GB Storage) India’s 1st D6300 5G Chipset | Ultra Slim 190g Design | 120Hz Eye Comfort Display | 50MP AI Camera| Charger in The Box

LEAVE A REPLY Cancel reply

Subscribe

POCO C61 Mystical Green 4GB RAM 64GB ROM

Chuwi HeroBook Pro 14.1” Intel Celeron N4020 Laptop with 8GB RAM, 256GB SSD, Windows 11, 1TB Expand, FHD IPS, Ultra Slim, USB3.0, Mini-HDMI, Webcam

Just a moment…

Big Tech Opposes YouTube Exemption from Australia’s Ban on Social Media for Children

New Zealand lose key player for Champions Trophy Final against India

Here’s Why Apple is Unlikely to Release an M4 Ultra Chip for Macs

ICC Champions Trophy: Matt Henry the first threat India’s star-studded batting line-up has to neutralise in final | Cricket News

More like this
Related

Just a moment…

Big Tech Opposes YouTube Exemption from Australia’s Ban on Social Media for Children

New Zealand lose key player for Champions Trophy Final against India

Here’s Why Apple is Unlikely to Release an M4 Ultra Chip for Macs

Top Selling Gadgets

Motorola G45 5G (Brilliant Blue, 8GB RAM, 128GB Storage)

Lenovo IdeaPad Slim 3 13th Gen Intel Core i7-13620H 15″ (38.1cm) FHD IPS 300 Nits Thin & Light Laptop (16GB/512GB SSD/Win 11/MSO 21/1Yr ADP Free/Alexa Built-in/3 mon Game Pass/Grey/1.6Kg), 83EM008GIN

Lenovo IdeaPad 3 12th Gen Intel Core i3-1215U 14 Inch (35.5cm) FHD Thin & Light Laptop (8GB/512GB SSD/Win 11/Office 2021/Backlit KB/1Yr ADP Free/3months Game Pass/Arctic Grey/1.43Kg), 82RJ00FKIN

realme NARZO N65 5G (Deep Green 4GB RAM, 128GB Storage) India’s 1st D6300 5G Chipset | Ultra Slim 190g Design | 120Hz Eye Comfort Display | 50MP AI Camera| Charger in The Box

About us

Company

The latest

Just a moment…

Big Tech Opposes YouTube Exemption from Australia’s Ban on Social Media for Children

New Zealand lose key player for Champions Trophy Final against India

Subscribe

Cohere for AI Releases Open-Source Aya Vision Models for Computer Vision-Based Tasks

Cohere Releases Aya Vision AI Models

Top Selling Gadgets

Motorola G45 5G (Brilliant Blue, 8GB RAM, 128GB Storage)

Lenovo IdeaPad Slim 3 13th Gen Intel Core i7-13620H 15″ (38.1cm) FHD IPS 300 Nits Thin & Light Laptop (16GB/512GB SSD/Win 11/MSO 21/1Yr ADP Free/Alexa Built-in/3 mon Game Pass/Grey/1.6Kg), 83EM008GIN

Lenovo IdeaPad 3 12th Gen Intel Core i3-1215U 14 Inch (35.5cm) FHD Thin & Light Laptop (8GB/512GB SSD/Win 11/Office 2021/Backlit KB/1Yr ADP Free/3months Game Pass/Arctic Grey/1.43Kg), 82RJ00FKIN

realme NARZO N65 5G (Deep Green 4GB RAM, 128GB Storage) India’s 1st D6300 5G Chipset | Ultra Slim 190g Design | 120Hz Eye Comfort Display | 50MP AI Camera| Charger in The Box

LEAVE A REPLY Cancel reply

Subscribe

POCO C61 Mystical Green 4GB RAM 64GB ROM

Chuwi HeroBook Pro 14.1” Intel Celeron N4020 Laptop with 8GB RAM, 256GB SSD, Windows 11, 1TB Expand, FHD IPS, Ultra Slim, USB3.0, Mini-HDMI, Webcam

More like thisRelated

Top Selling Gadgets

Motorola G45 5G (Brilliant Blue, 8GB RAM, 128GB Storage)

Lenovo IdeaPad Slim 3 13th Gen Intel Core i7-13620H 15″ (38.1cm) FHD IPS 300 Nits Thin & Light Laptop (16GB/512GB SSD/Win 11/MSO 21/1Yr ADP Free/Alexa Built-in/3 mon Game Pass/Grey/1.6Kg), 83EM008GIN

Lenovo IdeaPad 3 12th Gen Intel Core i3-1215U 14 Inch (35.5cm) FHD Thin & Light Laptop (8GB/512GB SSD/Win 11/Office 2021/Backlit KB/1Yr ADP Free/3months Game Pass/Arctic Grey/1.43Kg), 82RJ00FKIN

realme NARZO N65 5G (Deep Green 4GB RAM, 128GB Storage) India’s 1st D6300 5G Chipset | Ultra Slim 190g Design | 120Hz Eye Comfort Display | 50MP AI Camera| Charger in The Box

About us

Company

The latest

Subscribe

More like this
Related