OpenAI gpt-oss
|

OpenAI Breaks Six-Year Silence with Revolutionary Open-Weight Models gpt-oss-120b and gpt-oss-20b

OpenAI just dropped two game-changing open-weight language models that can run locally on your laptop, and they perform at the level of o4-mini. After six years of keeping its advanced models behind closed doors, the company released gpt-oss-120b and gpt-oss-20b on August 5, 2025, marking its first open-weight release since GPT-2 in 2019.

CEO Sam Altman called it “a big deal” and “a state-of-the-art open-weights reasoning model, with strong real-world performance comparable to o4-mini, that you can run locally on your own computer.” The smaller version can even run on a smartphone, making advanced AI accessible to anyone with basic hardware.

What makes gpt-oss models by OpenAI different

The gpt-oss series represents a significant shift in OpenAI’s strategy. Available under the Apache 2.0 license, these models outperform similarly sized open models on reasoning tasks, demonstrate strong tool use capabilities, and are optimized for efficient deployment on consumer hardware.

Both models use a Mixture-of-Experts (MoE) architecture that dramatically reduces computational requirements. The gpt-oss-120b activates only 5.1 billion parameters per token from its total 117 billion parameters, while the gpt-oss-20b activates 3.6 billion from 21 billion total parameters. This smart design allows the larger model to run on a single 80GB GPU and the smaller one on just 16GB of memory.

The models excel in chain-of-thought reasoning, tool usage, and agentic workflows. They support context lengths of up to 128,000 tokens and use a new tokenizer called o200k_harmony, which OpenAI is also open-sourcing.

Performance that rivals proprietary models

In benchmark testing, these open models punch well above their weight. The gpt-oss-120b outperforms OpenAI’s o3-mini and matches or exceeds o4-mini on competition coding (Codeforces), general problem solving (MMLU and HLE) and tool calling (TauBench). Even more impressive, it beats o4-mini on health-related queries and competition mathematics.

The smaller gpt-oss-20b holds its own against models many times its size. Despite its compact form factor, it matches or exceeds o3-mini on standard evaluations and even outperforms it on competition mathematics and health benchmarks.

Unprecedented safety testing

OpenAI didn’t just release these models, they put them through rigorous adversarial testing first. The company intentionally fine-tuned versions to maximize biology and cybersecurity capabilities, testing what bad actors might do with open-weight access. The results showed that even with malicious fine-tuning, the models couldn’t reach dangerous capability thresholds.

OpenAI gpt-oss test
Image via OpenAI

OpenAI carried out extensive safety training and testing on the open-weight models, filtering out harmful chemical, biological, radiological and nuclear data during pre-training. Three independent expert groups reviewed their methodology and provided recommendations that were incorporated into the final release.

Industry partnerships and availability

The models are immediately available through multiple channels. OpenAI partnered with leading deployment platforms including Azure, Hugging Face, vLLM, Ollama, llama.cpp, LM Studio, AWS, Fireworks, Together AI, Baseten, Databricks, Vercel, Cloudflare, and OpenRouter.

NVIDIA CEO Jensen Huang praised the collaboration, stating that “OpenAI showed the world what could be built on Nvidia AI, and now they’re advancing innovation in open-source software.” The models were trained on NVIDIA H100 GPUs and are optimized for the company’s infrastructure.

Amazon Web Services announced that the models will be available through Amazon Bedrock and Amazon SageMaker AI, with the larger model being “3x more price-performant than the comparable Gemini model, 5x more than DeepSeek-R1, and 2x more than the comparable OpenAI o4 model.”

A strategic shift in AI Access

This release marks a dramatic strategic shift for OpenAI. The company’s business appears to be booming, with $13 billion in annual recurring revenue as of August 2025, up from $10 billion in June, and 700 million weekly active ChatGPT users. Despite this success with proprietary models, OpenAI is returning to its open-source roots.

OpenAI President Greg Brockman told reporters, “It’s been exciting to see an ecosystem develop, and we are excited to contribute to that and really push the frontier and then see what happens from there.”

Real-world applications

The implications extend far beyond technical benchmarks. Developers can now build AI agents, automated reasoning systems, and specialized applications without expensive cloud infrastructure. The models handle multiple programming languages, perform complex mathematical computations, and can be fine-tuned for specific use cases.

gpt-oss dashboard
Image via OpenAI

For small businesses and individual developers, this democratizes access to cutting-edge AI capabilities. The ability to run these models locally means no ongoing API costs, complete data privacy, and the freedom to modify and customize without restrictions.

Competing in the open-weight race

The release of gpt-oss puts OpenAI back in direct competition with Meta’s Llama series, Mistral’s open models, and China’s DeepSeek. However, OpenAI’s models bring unique advantages: state-of-the-art reasoning capabilities, extensive safety testing, and the backing of the company behind ChatGPT.

The timing is strategic. As governments worldwide consider AI regulation, open-weight models that can be inspected and modified offer transparency that closed systems cannot match. OpenAI is positioning itself as a leader in responsible open AI development.

What OpenAI gpt-oss models mean for you

Whether you’re a developer, researcher, or business owner, these models open new possibilities. You can now run sophisticated AI reasoning locally, build custom applications without vendor lock-in, and access capabilities that previously required expensive API subscriptions.

The release includes comprehensive documentation, reference implementations, and a $500,000 red-teaming challenge to identify potential safety issues. OpenAI is clearly committed to making the gpt-oss models as accessible and safe as possible.

This isn’t just another model release – it’s a fundamental shift in how advanced AI capabilities are distributed. For the first time in six years, OpenAI’s most advanced reasoning technology is freely available to anyone willing to download it. The AI landscape just got a lot more interesting.


Read our most recent articles:


Get the latest news and updates from the world of Artificial Intelligence with our weekly newsletter, Artificial Tracker.

Go back

Your message has been sent

Warning
Warning
Warning.

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *