LLMs

Meta’s Llama 4, advanced multimodal models with long context

Meta released Llama 4, a new suite of AI models which offers advanced…

April 10, 2025
Gemma 3 matches 98% DeepSeek-R1 and runs on a single GPU or TPU

Gemma 3, Google’s latest AI model, offers multi-modal capabilities and achieves 98% of…

March 26, 2025
Baidu released two advanced LLMs, ERNIE 4.5 and ERNIE X1

Chinese technology giant Baidu is challenging leading AI models with its most recent…

March 21, 2025
SWE-RL enhances LLMs coding capabilities

Meta introduces SWE-RL, marking the first time reinforcement learning has been used to…

March 18, 2025
DeepSeek-R1 revolutionizes the AI landscape

The Chinese AI startup DeepSeek has made a breakthrough in AI with the…

February 21, 2025
Cosmos simulates physical worlds for training AI systems

NVIDIA has released the Cosmos World Foundation Model Platform, an advanced AI toolkit…

February 3, 2025
IBM’s Docling converts PDFs into other digital formats

Docling is an open-source, easy to use Python library designed to convert PDF…

December 2, 2024
Hunyuan-Large, the largest open-source Mixture of Experts model from Tencent

Tencent released Hunyuan-Large, the largest open-source Transformer-based Mixture of Experts model to date,…

November 26, 2024
Model Swarms enables collaboration between multiple LLMs

Model Swarms is a collaborative search algorithm inspired by the collective behavior of…

November 13, 2024
LightRAG, a lightweight and efficient RAG

LightRAG is a new Retrieval-Augmented Generation method that generates faster and more contextually…

October 28, 2024