This event has ended. Showing historical data.

$143.93K
1
10

$143.93K
1
10
Trader mode: Actionable analysis for identifying opportunities and edge
This market will resolve according to the company that owns the model with the third-highest arena score based on the Chatbot Arena LLM Leaderboard when the table under the "Leaderboard" tab is checked on January 31, 2026, 12:00 PM ET. Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text set to default (style control on) will be used to resolve this market. If two models are tied for the third-highest arena score at this market's check time, reso
AI-generated analysis based on market data. Not financial advice.
This prediction market asks which company will own the third-ranked artificial intelligence model by the end of February 2026, according to a specific public benchmark. The resolution is based on the Chatbot Arena LLM Leaderboard, a crowdsourced evaluation platform run by the Large Model Systems Organization (LMSYS Org). The leaderboard ranks AI models based on an 'Arena Score' derived from anonymous, randomized user votes where two models compete to produce the better response to a prompt. For this market, the ranking will be determined by checking the leaderboard on February 28, 2026, at 12:00 PM Eastern Time, using the default settings with 'style control' enabled. This setting is designed to filter out stylistic preferences and focus on the substantive quality of model outputs. The market specifically tracks the third-place position, a competitive tier just below the top two contenders. Interest in this market stems from the intense competition and rapid evolution in the generative AI field. Companies invest billions in developing these models, and their public ranking on independent benchmarks like the Chatbot Arena influences investor confidence, developer adoption, and public perception. The third-place spot is particularly contested, as it often represents a battle between established tech giants and well-funded startups vying for market relevance.
The Chatbot Arena LLM Leaderboard was launched in May 2023 by LMSYS Org to address limitations in existing AI benchmarks. Traditional benchmarks often used static question sets that models could be overtrained on, a phenomenon known as 'benchmark contamination.' The Arena introduced a dynamic, crowdsourced evaluation method where real users vote on blind model outputs, providing a more organic measure of model capability and user preference. In its first year, the leaderboard saw OpenAI's GPT-4 consistently hold the top position. A significant shift occurred in March 2024 with the release of Anthropic's Claude 3 model family, where Claude 3 Opus briefly surpassed GPT-4 Turbo to claim the number one spot, demonstrating that the top rank was contestable. This event validated the leaderboard's role as a real-time indicator of a fast-moving competitive landscape. The 'style control' feature was added to the platform to address user feedback that voting could be influenced by writing style (e.g., verbose vs. concise answers) rather than factual accuracy or reasoning quality. By early 2025, the leaderboard had evaluated over 1 million human votes across more than 100 models, cementing its status as a primary reference for comparing large language models.
The ranking of AI models has tangible economic and strategic consequences. For companies, a top-three position on a respected public leaderboard can attract venture capital, enterprise customers, and top AI research talent. It serves as a powerful marketing tool, signaling technical prowess in a crowded market. For developers and businesses choosing which AI models to build upon, the leaderboard offers a comparative snapshot of performance that informs costly integration decisions. Beyond commercial interests, these rankings influence the geopolitical AI race. National governments monitor the leaderboard as a proxy for technological leadership, with the United States and China keenly aware of which companies and, by extension, which countries are producing the most capable models. The concentration of top AI talent and resources in a handful of companies also raises important questions about market control, the future of open-source AI, and the equitable distribution of a transformative technology.
As of late 2024 and early 2025, the Chatbot Arena leaderboard shows a tightly contested top tier. OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet are closely ranked near the top. Google's Gemini models and Meta's latest Llama iterations continue to be evaluated. The competitive landscape remains fluid, with all major companies publicly roadmapping more advanced model releases throughout 2025, setting the stage for the February 2026 ranking that this market will resolve.
The Chatbot Arena is a public benchmark created by LMSYS Org that ranks large language models based on anonymous user votes. Users are presented with two blind model responses to the same prompt and choose which is better. These votes generate an Elo-based 'Arena Score' for each model.
Style control is a filtering mechanism on the leaderboard. When enabled, it attempts to identify and downweight user votes that may be based primarily on stylistic preferences, like verbosity or tone, rather than the factual correctness or reasoning quality of the model's answer. The goal is to make the score reflect substantive performance.
The leaderboard updates continuously as new votes are cast and processed. However, a model's displayed score typically stabilizes after it receives a significant volume of votes, often in the thousands. The market uses a single snapshot taken at a specific date and time for resolution.
The ranking changes as new models are released and evaluated. As of early 2025, the third position has shifted between companies like Google (Gemini) and Anthropic (Claude), depending on the specific model version and the leaderboard's update cycle. The market speculates on who will hold that position on a future date.
The Elo system, originally designed for chess, is a method for calculating relative skill levels. In the Chatbot Arena, each model starts with a base rating. When a user prefers one model over another, rating points are transferred from the loser to the winner, with the amount based on the expected outcome.
Educational content is AI-generated and sourced from Wikipedia. It should not be considered financial advice.
10 markets tracked

No data available
| Market | Platform | Price |
|---|---|---|
![]() | Poly | 100% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |
![]() | Poly | 0% |





No related news found
Add this market to your website
<iframe src="https://predictpedia.com/embed/f1-6g2" width="400" height="160" frameborder="0" style="border-radius: 8px; max-width: 100%;" title="Which company has the #3 AI model end of January? (Style Control On)"></iframe>