LLMs
-

Microsoft’s Differential Transformer, a new architecture for LLMs
Researchers from Microsoft and Tsinghua University introduced the Differential Transformer, which is an…
-

Alibaba released Qwen2.5 with more than 100 open-source AI models
Alibaba Cloud recently announced the release of over 100 open-sourced Qwen 2.5 multimodal…
-

LLaMA-Omni lets you speak to LLMs and get instant responses
LLaMA-Omni is an open-source AI tool designed for real-time voice interaction with large…
-

Transfusion, a multi-modal model for text and image generation
Transfusion is a multi-modal AI tool designed to handle both text and images…
-

Kotaemon is a RAG UI that lets you chat with your documents
Kotaemon is an open-source RAG (Retrieval Augmented Generation) platform that can interact with…
-

MindSearch, an open-source AI-powered search engine
MindSearch is an open-source LLM-based search engine that mimics the human cognitive processes…
-

The AI Scientist model is capable of independent scientific exploration
The AI Scientist is a comprehensive framework designed to automate the entire research…
-

Llama 3.1 from Meta, its most capable models to date
On July 23, 2024 Meta launched Llama 3.1, a collection of open-source foundation…
-

Anthropic unveils Claude 3.5 Sonnet, its most performant AI model yet
Anthropic, a leading AI research company, has released Claude 3.5 Sonnet, setting new…
-

MoRA, a high-rank strategy for enhanced fine-tuning of LLMs
MoRA (Model Rank Adaptation) is a new method designed to improve the fine-tuning…