Large Language Models (LLMs)

47 Posts

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.

Apple is thinking small — very small — with a new family of open large language models.

Benchmarks that rank large language models’ performance of industry tasks

Benchmarks for Industry: Vals AI evaluates large language models on industry-specific tasks.

How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.

Large Language Models (LLMs)

Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval

Retrieval-augmented generation (RAG) enables large language models to generate better output by retrieving documents that are relevant to a user’s prompt. Fine-tuning further improves RAG performance.

Large Language Models (LLMs)

Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code

Language models can generate code that erroneously points to software packages, creating vulnerabilities that attackers can exploit.

Large Language Models (LLMs)

More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback

Large language models sometimes generate false statements. New work makes them more likely to produce factual output.

Large Language Models (LLMs)

Microsoft Absorbs Inflection: Microsoft pays Inflection AI $650 Million, hires most of its staff

Microsoft took over most of the once high-flying chatbot startup Inflection AI in an unusual deal.

Large Language Models (LLMs)

Cutting the Cost of Pretrained Models: FrugalGPT, a method to cut AI costs and maintain quality

Research aims to help users select large language models that minimize expenses while maintaining quality.

Large Language Models (LLMs)

Some Models Pose Security Risk: Security flaws exposed in Hugging Face's repository and security features

Security researchers sounded the alarm about holes in Hugging Face’s platform.

Large Language Models (LLMs)

Conversational Robots: RFM-1, a model that enables robots to understand and act on human commands

Robots equipped with large language models are asking their human overseers for help.

Large Language Models (LLMs)

Schooling Language Models in Math: GOAT (Good at Arithmetic Tasks), a method to boost large language models' arithmetic abilities

Large language models are not good at math. Researchers devised a way to make them better. Tiedong Liu and Bryan Kian Hsiang Low at the National University of Singapore proposed a method to fine-tune large language models for arithmetic tasks.

Large Language Models (LLMs)

Google Releases Open Source LLMs: All we know about Google's Gemma-7B and Gemma-2B models

Google asserted its open source bona fides with new models. Google released weights for Gemma-7B, an 8.5 billion-parameter large language model intended to run GPUs, and Gemma-2B, a 2.5 billion-parameter version intended for deployment on CPUs and edge devices.

Large Language Models (LLMs)

Mistral AI Extends Its Portfolio: Mistral enhances AI landscape in Europe with Microsoft partnership and new language models.

European AI champion Mistral AI unveiled new large language models and formed an alliance with Microsoft.

Large Language Models (LLMs)

Swiss Army LLM

The combination of language models that are equipped for retrieval augmented generation can retrieve text from a database to improve their output. Further work extends this capability to retrieve information from any application that comes with an API.

Large Language Models (LLMs)

Generated Video Gets Real(er): OpenAI's Sora, a new player in text-to-video generation

OpenAI’s new video generator raises the bar for detail and realism in generated videos — but the company released few details about how it built the system.

Large Language Models (LLMs)

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals

Most people understand that others’ mental states can differ from their own. For instance, if your friend leaves a smartphone on a table and you privately put it in your pocket, you understand that your friend continues to believe it was on the table.

Large Language Models (LLMs)

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.

Benchmarks for Industry: Vals AI evaluates large language models on industry-specific tasks.

Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval

Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code

More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback

Microsoft Absorbs Inflection: Microsoft pays Inflection AI $650 Million, hires most of its staff

Cutting the Cost of Pretrained Models: FrugalGPT, a method to cut AI costs and maintain quality

Some Models Pose Security Risk: Security flaws exposed in Hugging Face's repository and security features

Conversational Robots: RFM-1, a model that enables robots to understand and act on human commands

Schooling Language Models in Math: GOAT (Good at Arithmetic Tasks), a method to boost large language models' arithmetic abilities

Google Releases Open Source LLMs: All we know about Google's Gemma-7B and Gemma-2B models

Mistral AI Extends Its Portfolio: Mistral enhances AI landscape in Europe with Microsoft partnership and new language models.

Swiss Army LLM

Generated Video Gets Real(er): OpenAI's Sora, a new player in text-to-video generation

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals

Subscribe to The Batch