LLMs
-

Meta’s Llama 4, advanced multimodal models with long context
Meta released Llama 4, a new suite of AI models which offers advanced…
-

Gemma 3 matches 98% DeepSeek-R1 and runs on a single GPU or TPU
Gemma 3, Google’s latest AI model, offers multi-modal capabilities and achieves 98% of…
-

Baidu released two advanced LLMs, ERNIE 4.5 and ERNIE X1
Chinese technology giant Baidu is challenging leading AI models with its most recent…
-

SWE-RL enhances LLMs coding capabilities
Meta introduces SWE-RL, marking the first time reinforcement learning has been used to…
-

DeepSeek-R1 revolutionizes the AI landscape
The Chinese AI startup DeepSeek has made a breakthrough in AI with the…
-

Cosmos simulates physical worlds for training AI systems
NVIDIA has released the Cosmos World Foundation Model Platform, an advanced AI toolkit…
-

IBM’s Docling converts PDFs into other digital formats
Docling is an open-source, easy to use Python library designed to convert PDF…
-

Hunyuan-Large, the largest open-source Mixture of Experts model from Tencent
Tencent released Hunyuan-Large, the largest open-source Transformer-based Mixture of Experts model to date,…
-

Model Swarms enables collaboration between multiple LLMs
Model Swarms is a collaborative search algorithm inspired by the collective behavior of…
-

LightRAG, a lightweight and efficient RAG
LightRAG is a new Retrieval-Augmented Generation method that generates faster and more contextually…