LLMs
-

Multimodal foundation models: the future of AI assistants
Researchers from Google AI and Hugging Face present a comprehensive survey of multimodal…
-

Falcon 180B, the most powerful open-source large language model (LLM), is now available
The Technology Innovation Institute (TII) launched Falcon 180B, which is a scaled-up version of…
-

Recursive summarization: store and retrieve long-term dialogue memory in large language models
A new research paper suggests that large language models (LLMs) can be taught…
-

Meta’s SeamlessM4T can translate and transcribe speech and text across nearly 100 languages
Meta launched SeamlessM4T (Massively Multilingual & Multimodal Machine Translation), a new AI model…
-

AgentBench: a new tool to evaluate LLMs as agents in interactive environments
AgentBench is a multi-dimensional benchmark that tests how well Large Language Models (LLMs)…
-

How to train GPT-style models faster and cheaper with FlashAttention-2
FlashAttention-2 is a new method for attention computation in Transformers that outperforms the…
-

LeanDojo: a toolkit for developing and evaluating ReProver and other LLM-based theorem provers
LeanDojo is an open-source Lean playground that provides resources, data, models, and benchmarks for…
-

FinGPT: an open-source large language model for the finance sector
FinGPT is a large language model (LLM) that is fine-tuned with financial data…
-

MosaicML launches MPT-30B: a new open-source model that outperforms GPT-3
MosaicML, a company that provides a platform for training and deploying large language…
-

ChipGPT: a new approach for hardware design using LLMs
ChipGPT is a new scalable framework that automates the hardware design process by…