Currently evaluating OpenAI’s o1-preview. Although it is said that it is not always ...

2024-09-17 10:21:42

Currently evaluating OpenAI’s o1-preview. Although it is said that it is not always more accurate than GPT-4o, when checked with the “World Model”, which collects problems that LLMs struggle with, o1-preview correctly solves questions like the following that GPT-4o gets wrong.

Q1. What happens if you push... blog.yostos.org https://blog.yostos.org/2024/09/17/currently-evaluating-openais.html

Author Public Key

npub1y0st0svvu5xg6dvswx7dz5m2p7004kmvsx6n22w4yjp3l6fa3mvsef2zz7

Seen on

wss://relay.nos.social wss://nos.lol wss://relay.damus.io

Show more details

yostos on Nostr: Currently evaluating OpenAI’s o1-preview. Although it is said that it is not always ...