Large Language Models (LLMs)

47 Posts

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.
Large Language Models (LLMs)

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.

Apple is thinking small — very small — with a new family of open large language models.
Benchmarks that rank large language models’ performance of industry tasks
Large Language Models (LLMs)

Benchmarks for Industry: Vals AI evaluates large language models on industry-specific tasks.

How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval
Large Language Models (LLMs)

Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval

Retrieval-augmented generation (RAG) enables large language models to generate better output by retrieving documents that are relevant to a user’s prompt. Fine-tuning further improves RAG performance.
Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code
Large Language Models (LLMs)

Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code

Language models can generate code that erroneously points to software packages, creating vulnerabilities that attackers can exploit.
More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback
Large Language Models (LLMs)

More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback

Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
The Inflection AI logo merging with the Microsoft logo
Large Language Models (LLMs)

Microsoft Absorbs Inflection: Microsoft pays Inflection AI $650 Million, hires most of its staff

Microsoft took over most of the once high-flying chatbot startup Inflection AI in an unusual deal.
Cutting the Cost of Pretrained Models: FrugalGPT, a method to cut AI costs and maintain quality
Large Language Models (LLMs)

Cutting the Cost of Pretrained Models: FrugalGPT, a method to cut AI costs and maintain quality

Research aims to help users select large language models that minimize expenses while maintaining quality.
Some Models Pose Security Risk: Security flaws exposed in Hugging Face's repository and security features
Large Language Models (LLMs)

Some Models Pose Security Risk: Security flaws exposed in Hugging Face's repository and security features

Security researchers sounded the alarm about holes in Hugging Face’s platform.
Conversational Robots: RFM-1, a model that enables robots to understand and act on human commands
Large Language Models (LLMs)

Conversational Robots: RFM-1, a model that enables robots to understand and act on human commands

Robots equipped with large language models are asking their human overseers for help.
Schooling Language Models in Math: GOAT (Good at Arithmetic Tasks), a method to boost large language models' arithmetic abilities
Large Language Models (LLMs)

Schooling Language Models in Math: GOAT (Good at Arithmetic Tasks), a method to boost large language models' arithmetic abilities

Large language models are not good at math. Researchers devised a way to make them better. Tiedong Liu and Bryan Kian Hsiang Low at the National University of Singapore proposed a method to fine-tune large language models for arithmetic tasks.
Google Releases Open Source LLMs: All we know about Google's Gemma-7B and Gemma-2B models
Large Language Models (LLMs)

Google Releases Open Source LLMs: All we know about Google's Gemma-7B and Gemma-2B models

Google asserted its open source bona fides with new models. Google released weights for Gemma-7B, an 8.5 billion-parameter large language model intended to run GPUs, and Gemma-2B, a 2.5 billion-parameter version intended for deployment on CPUs and edge devices.
Mistral AI Extends Its Portfolio: Mistral enhances AI landscape in Europe with Microsoft partnership and new language models.
Large Language Models (LLMs)

Mistral AI Extends Its Portfolio: Mistral enhances AI landscape in Europe with Microsoft partnership and new language models.

European AI champion Mistral AI unveiled new large language models and formed an alliance with Microsoft. 
Swiss Army LLM
Large Language Models (LLMs)

Swiss Army LLM

The combination of  language models that are equipped for retrieval augmented generation can retrieve text from a database to improve their output. Further work extends this capability to retrieve information from any application that comes with an API. 
Generated Video Gets Real(er): OpenAI's Sora, a new player in text-to-video generation
Large Language Models (LLMs)

Generated Video Gets Real(er): OpenAI's Sora, a new player in text-to-video generation

OpenAI’s new video generator raises the bar for detail and realism in generated videos — but the company released few details about how it built the system.
LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals
Large Language Models (LLMs)

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals

Most people understand that others’ mental states can differ from their own. For instance, if your friend leaves a smartphone on a table and you privately put it in your pocket, you understand that your friend continues to believe it was on the table.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox