小型新創Arcee AI 從零打造400B參數開源大型語言模型，旨在超越Meta的Llama

Techcrunch

大約 1 個月前

AI 生成摘要

擁有30名員工的新創公司Arcee AI 發布了名為Trinity的400B參數開源基礎模型，旨在憑藉其全面的功能和寬鬆的Apache授權，挑戰Meta的Llama等現有巨頭。

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta's Llama | TechCrunch

Topics

Latest

Amazon

Apps

Biotech & Health

Climate

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

More from TechCrunch

Staff

Events

Startup Battlefield

StrictlyVC

Newsletters

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta’s Llama

Many in the industry think the winners of the AI model market have already been decided: Big Tech will own it (Google, Meta, Microsoft, a bit of Amazon) along with their model makers of choice, largely OpenAI and Anthropic.

But tiny 30-person startup Arcee AI disagrees. The company just released a truly and permanently open (Apache license) general-purpose, foundation model called Trinity, and Arcee claims that at 400B parameters, it is among the largest open-source foundation models ever trained and released by a U.S. company.

Arcee says Trinity compares to Meta’s Llama 4 Maverick 400B, and Z.ai GLM-4.5, a high-performing open-source model from China’s Tsinghua University, according to benchmark tests conducted using base models (very little post training).

Like other state-of-the-art (SOTA) models, Trinity is geared for coding and multi-step processes like agents. Still, despite its size, it’s not a true SOTA competitor yet because it currently supports only text.

More modes are in the works — a vision model is currently in development, and a speech-to-text version is on the roadmap, CTO Lucas Atkins told TechCrunch (pictured above, on the left). In comparison, Meta’s Llama 4 Maverick is already multi-modal, supporting text and images.

But before adding more AI modes to its roster, Arcee says, it wanted a base LLM that would impress its main target customers: developers and academics. The team particularly wants to woo U.S. companies of all sizes away from choosing open models from China.

“Ultimately, the winners of this game, and the only way to really win over the usage, is to have the best open-weight model,” Atkins said. “To win the hearts and minds of developers, you have to give them the best.”

Disrupt 2026 Tickets: One-time offer

The benchmarks show that the Trinity base model, currently in preview while more post-training takes place, is largely holding its own and, in some cases, slightly besting Llama on tests of coding and math, common sense, knowledge and reasoning.

The progress Arcee has made so far to become a competitive AI Lab is impressive. The large Trinity model follows two previous small models released in in December: the 26B-parameter Trinity Mini, a fully post-trained reasoning model for tasks ranging from web apps to agents, and the 6B-parameter Trinity Nano, an experimental model designed to push the boundaries of models that are tiny yet chatty.

The kicker is, Arcee trained them all in six months for $20 million total, using 2,048 Nvidia Blackwell B300 GPUs. This out of the roughly $50 million the company has raised so far, said founder and CEO Mark McQuade (pictured above, on the right).

That kind of cash was “a lot for us,” said Atkins, who led the model building effort. Still, he acknowledged that it pales in comparison to how much bigger labs are spending right now.

The six-month timeline “was very calculated,” said Atkins, whose career before LLMs involved building voice agents for cars. “We are a younger startup that’s extremely hungry. We have a tremendous amount of talent and bright young researchers who, when given the opportunity to spend this amount of money and train a model of this size, we trusted that they’d rise to the occasion. And they certainly did, with many sleepless nights, many long hours.”

McQuade, previously an early employee at open-source model marketplace HuggingFace, says Arcee didn’t start out wanting to become a new U.S. AI Lab: The company was originally doing model customization for large enterprise clients like SK Telecom.

“We were only doing post-training. So we would take the great work of others: We would take a Llama model, we would take a Mistral model, we would take a Qwen model that was open source, and we would post-train it to make it better” for a company’s intended use, he said, including doing the reinforcement learning.

But as their client list grew, Atkins said, the need for their own model was becoming a necessity, and McQuade was worried about relying on other companies. At the same time, many of the best open models were coming from China, which U.S. enterprises were leery of, or were barred from using.

It was a nerve-wracking decision. “I think there’s less than 20 companies in the world that have ever pre-trained and released their own model” at the size and level that Arcee was gunning for, McQuade said.

The company started small at first, trying its hand at a tiny, 4.5B model created in partnership with training company DatologyAI. The project’s success then encouraged bigger endeavors.

But if the U.S. already has Llama, why does it need another open weight model? Atkins says by choosing the open source Apache license, the startup is committed to always keeping its models open. This comes after Meta CEO Mark Zuckerberg last year indicated his company might not always make all of its most advanced models open source.

“Llama can be looked at as not truly open source as it uses a Meta-controlled license with commercial and usage caveats,” he says. This has caused some open source organizations to claim that Llama isn’t open source compliant at all.

“Arcee exists because the U.S. needs a permanently open, Apache-licensed, frontier-grade alternative that can actually compete at today’s frontier,” McQuade said.

All Trinity models, large and small, can be downloaded for free. The largest version will be released in three flavors. Trinity Large Preview is a lightly post-trained instruct model, meaning it’s been trained to follow human instructions, not just predict the next word, which gears it for general chat usage. Trinity Large Base is the base model without post-training.

Then we have TrueBase, a model with any instruct data or post training so enterprises or researchers that want to customize it won’t have to unroll any data, rules or assumptions.

Acree AI will eventually offer a hosted version of its general release model for, it says, competitive API pricing. That release is up to six weeks away as the startup continues to improve the model’s reasoning training.

API pricing for Trinity-Mini is $0.045 / $0.15, and there is a rate-limited free tier available, too. Meanwhile, the company still sells post-training and customization options.

Topics

Venture Editor

Tickets are live at the lowest rates of the year. Save up to $680 on your pass — and if you’re among the first 500 registrants, score a +1 pass at 50% off.Meet investors. Discover your next portfolio company. Hear from 250+ tech leaders, dive into 200+ sessions, and explore 300+ startups building what’s next. Don’t miss these one-time savings.

小型新創Arcee AI 從零打造400B參數開源大型語言模型，旨在超越Meta的Llama

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta's Llama | TechCrunch

Topics

More from TechCrunch

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta’s Llama

Disrupt 2026 Tickets: One-time offer

Disrupt 2026 Tickets: One-time offer

Most Popular

Meta to test premium subscriptions on Instagram, Facebook, and WhatsApp

Anthropic launches interactive Claude apps, including Slack and other workplace tools

This founder cracked firefighting — now he’s creating an AI gold mine

TikTok users freak out over app’s ‘immigration status’ collection — here’s what it means

Researchers say Russian government hackers were behind attempted Poland power outage

Microsoft gave FBI a set of BitLocker encryption keys to unlock suspects’ laptops: Reports

Capital One acquires Brex for a steep discount to its peak valuation, but early believers are laughing all the way to the bank

Tiny Startup Arcee AI Builds 400B Open Source LLM From Scratch to Compete with Meta's Llama

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta's Llama | TechCrunch

Topics

More from TechCrunch

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta’s Llama

Disrupt 2026 Tickets: One-time offer

Disrupt 2026 Tickets: One-time offer

Most Popular

Meta to test premium subscriptions on Instagram, Facebook, and WhatsApp

Anthropic launches interactive Claude apps, including Slack and other workplace tools

This founder cracked firefighting — now he’s creating an AI gold mine

TikTok users freak out over app’s ‘immigration status’ collection — here’s what it means

Researchers say Russian government hackers were behind attempted Poland power outage

Microsoft gave FBI a set of BitLocker encryption keys to unlock suspects’ laptops: Reports

Capital One acquires Brex for a steep discount to its peak valuation, but early believers are laughing all the way to the bank