LLMs
-
How to train GPT-style models faster and cheaper with FlashAttention-2
FlashAttention-2 is a new method for attention computation in Transformers that outperforms the…
-
LeanDojo: a toolkit for developing and evaluating ReProver and other LLM-based theorem provers
LeanDojo is an open-source Lean playground that provides resources, data, models, and benchmarks for…
-
FinGPT: an open-source large language model for the finance sector
FinGPT is a large language model (LLM) that is fine-tuned with financial data…
-
MosaicML launches MPT-30B: a new open-source model that outperforms GPT-3
MosaicML, a company that provides a platform for training and deploying large language…
-
ChipGPT: a new approach for hardware design using LLMs
ChipGPT is a new scalable framework that automates the hardware design process by…
-
Gorilla: a Large Language Model connected with over 1,600 APIs
Gorilla is a fine-tuned LLaMA-based model that can interact with more than 1,600…
-
OlaGPT boosts LLMs with human-like cognitive modules and intelligent components
OlaGPT is a new framework that aims to enhance the problem-solving abilities of…
-
Meta AI releases MEGABYTE, a novel AI architecture to predict million-byte sequences without tokenization
Meta AI releases MEGABYTE (Multiscale Encoder-Generator BYte Transformer), a new powerful AI model that can…
-
Bot or Human? How to detect a ChatGPT Bot with one simple question
FLAIR (Finding Large language model Authenticity via a single Inquiry and Response) is…
-
Stability AI launched StableVicuna, the first open-source chatbot based on human feedback
Stability AI has introduced StableVicuna, the first large-scale open-source chatbot that has been…